Importing Datasets¶
The datasets module allows easy access to existing standard test collections, particulary those from TREC. In particular, each defined dataset can download and provide easy access to:
files containing the documents of the corpus
topics (queries), as a dataframe, ready for retrieval
relevance assessments (aka, labels or qrels), as a dataframe, ready for evaluation
ready-made Terrier indices, where appropriate
- pyterrier.datasets.list_datasets()[source]¶
Returns a dataframe of all datasets, listing which topics, qrels, corpus files or indices are available. By default, filters to only datasets with both a corpus and topics in English.
- pyterrier.datasets.find_datasets()[source]¶
A grep-like method to help identify datasets. Filters the output of list_datasets() based on the name containing the query
- class pyterrier.datasets.Dataset[source]¶
Represents a dataset (test collection) for indexing or retrieval. A common use-case is to use the Dataset within an Experiment:
dataset = pt.get_dataset("trec-robust-2004") pt.Experiment([br1, br2], dataset.get_topics(), dataset.get_qrels(), eval_metrics=["map", "recip_rank"])
- get_corpus()[source]¶
Returns the location of the files to allow indexing the corpus, i.e. it returns a list of filenames.
- get_corpus_iter(verbose=True)[source]¶
Returns an iter of dicts for this collection. If verbose=True, a tqdm pbar shows the progress over this iterator.
- Return type
Iterator
[Dict
[str
,Any
]]
- get_corpus_lang()[source]¶
Returns the ISO 639-1 language code for the corpus, or None for multiple/other/unknown
- Return type
Optional
[str
]
- get_index(variant=None, **kwargs)[source]¶
Returns the IndexRef of the index to allow retrieval. Only a few datasets provide indices ready made.
- get_topics(variant=None)[source]¶
Returns the topics, as a dataframe, ready for retrieval.
- Return type
DataFrame
- get_topics_lang()[source]¶
Returns the ISO 639-1 language code for the topics, or None for multiple/other/unknown
- Return type
Optional
[str
]
- get_qrels(variant=None)[source]¶
Returns the qrels, as a dataframe, ready for evaluation.
- Return type
DataFrame
Examples¶
Many of the PyTerrier unit tests are based on the Vaswani NPL test collection, a corpus of scientific abstract from ~11,000 documents. PyTerrier provides a ready-made index on the Terrier Data Repository. This allows experiments to be easily conducted:
dataset = pt.get_dataset("vaswani")
bm25 = pt.BatchRetrieve.from_dataset(dataset, "terrier_stemmed", wmodel="BM25")
dph = pt.BatchRetrieve.from_dataset(dataset, "terrier_stemmed", wmodel="DPH")
pt.Experiment(
[bm25, dph],
dataset.get_topics(),
dataset.get_qrels(),
eval_metrics=["map"]
)
Indexing and then retrieval of documents from the MSMARCO document corpus can be achieved as follows:
dataset = pt.get_dataset("trec-deep-learning-docs")
indexer = pt.TRECCollectionIndexer("./index")
# this downloads the file msmarco-docs.trec.gz
indexref = indexer.index(dataset.get_corpus())
index = pt.IndexFactory.of(indexref)
DPH_br = pt.BatchRetrieve(index, wmodel="DPH") % 100
BM25_br = pt.BatchRetrieve(index, wmodel="BM25") % 100
# this runs an experiment to obtain results on the TREC 2019 Deep Learning track queries and qrels
pt.Experiment(
[DPH_br, BM25_br],
dataset.get_topics("test"),
dataset.get_qrels("test"),
eval_metrics=["recip_rank", "ndcg_cut_10", "map"])
For more details on use of MSMARCO, see our MSMARCO leaderboard submission notebooks.
You can also index datasets that include a corpus using IterDictIndexer and get_corpus_iter:
dataset = pt.datasets.get_dataset('irds:cord19/trec-covid')
indexer = pt.index.IterDictIndexer('./cord19-index')
indexref = indexer.index(dataset.get_corpus_iter(), fields=('title', 'abstract'))
index = pt.IndexFactory.of(indexref)
DPH_br = pt.BatchRetrieve(index, wmodel="DPH") % 100
BM25_br = pt.BatchRetrieve(index, wmodel="BM25") % 100
# this runs an experiment to obtain results on the TREC COVID queries and qrels
pt.Experiment(
[DPH_br, BM25_br],
dataset.get_topics('title'),
dataset.get_qrels(),
eval_metrics=["P.5", "P.10", "ndcg_cut.10", "map"])
Available Datasets¶
The table below lists the provided datasets, detailing the attributes available for each dataset.
In each column, True designates the presence of a single artefact of that type, while a list denotes the available variants.
Datasets with the irds:
prefix are from the ir_datasets package; further
documentation on these datasets can be found here.
dataset |
corpus |
index |
topics |
qrels |
info_url |
---|---|---|---|---|---|
50pct |
[‘ex2’, ‘ex3’] |
[training, validation] |
[training, validation] |
||
antique |
True |
[train, test] |
[train, test] |
||
vaswani |
True |
True |
True |
True |
|
msmarco_document |
True |
True |
[train, dev, test, test-2020, leaderboard-2020] |
[train, dev, test, test-2020] |
|
msmarcov2_document |
True |
[train, dev1, dev2, valid1, valid2, trec_2021] |
[train, dev1, dev2, valid1, valid2] |
||
msmarco_passage |
True |
True |
[train, dev, dev.small, eval, eval.small, test-2019, test-2020] |
[train, dev, test-2019, test-2020, dev.small] |
|
msmarcov2_passage |
True |
[train, dev1, dev2, trec_2021] |
[train, dev1, dev2] |
||
trec-robust-2004 |
True |
True |
|||
trec-robust-2005 |
True |
True |
|||
trec-terabyte |
[2004, 2005, 2006, 2004-2006, 2006-np, 2005-np] |
[2004, 2005, 2006, 2004-2006, 2005-np, 2006-np] |
|||
trec-precision-medicine |
[2017, 2018, 2019, 2020] |
[qrels-2017-abstracts, qrels-2017-abstracts-sample, qrels-2017-trials, qrels-2018-abstracts, qrels-2018-abstracts-sample, qrels-2018-trials, qrels-2018-trials-sample, qrels-2019-abstracts, qrels-2019-trials, qrels-2019-abstracts-sample, qrels-2019-trials-sample] |
|||
trec-covid |
[round4, round5] |
True |
[round1, round2, round3, round4, round5] |
[round1, round2, round3, round3-cumulative, round4, round4-cumulative, round5] |
|
trec-wt2g |
True |
True |
|||
trec-wt10g |
[trec9, trec10-adhoc, trec10-hp] |
[trec9, trec10-adhoc, trec10-hp] |
|||
trec-wt-2002 |
[td, np] |
[np, td] |
|||
trec-wt-2003 |
[td, np] |
[np, td] |
|||
trec-wt-2004 |
[all, np, hp, td] |
[hp, td, np, all] |
|||
trec-wt-2009 |
True |
[adhoc, adhoc.catA, adhoc.catB] |
|||
trec-wt-2010 |
True |
[‘adhoc’] |
|||
trec-wt-2011 |
True |
[‘adhoc’] |
|||
trec-wt-2012 |
True |
[‘adhoc’] |
|||
irds:antique |
True |
||||
irds:antique/test |
True |
True |
True |
||
irds:antique/test/non-offensive |
True |
True |
True |
https://ir-datasets.com/antique.html#antique/test/non-offensive |
|
irds:antique/train |
True |
True |
True |
||
irds:antique/train/split200-train |
True |
True |
True |
https://ir-datasets.com/antique.html#antique/train/split200-train |
|
irds:antique/train/split200-valid |
True |
True |
True |
https://ir-datasets.com/antique.html#antique/train/split200-valid |
|
irds:aquaint |
True |
||||
irds:aquaint/trec-robust-2005 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/aquaint.html#aquaint/trec-robust-2005 |
|
irds:argsme |
|||||
irds:argsme/1.0 |
True |
||||
irds:argsme/1.0-cleaned |
True |
||||
irds:argsme/2020-04-01/debateorg |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/debateorg |
|||
irds:argsme/2020-04-01/debatepedia |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/debatepedia |
|||
irds:argsme/2020-04-01/debatewise |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/debatewise |
|||
irds:argsme/2020-04-01/idebate |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/idebate |
|||
irds:argsme/2020-04-01/parliamentary |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/parliamentary |
|||
irds:argsme/2020-04-01/processed |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/processed |
|||
irds:argsme/2020-04-01 |
True |
||||
irds:beir |
|||||
irds:beir/arguana |
True |
True |
True |
||
irds:beir/climate-fever |
True |
True |
True |
||
irds:beir/cqadupstack/android |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/english |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/gaming |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/gis |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/mathematica |
True |
[text, tags] |
True |
https://ir-datasets.com/beir.html#beir/cqadupstack/mathematica |
|
irds:beir/cqadupstack/physics |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/programmers |
True |
[text, tags] |
True |
https://ir-datasets.com/beir.html#beir/cqadupstack/programmers |
|
irds:beir/cqadupstack/stats |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/tex |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/unix |
True |
[text, tags] |
True |
||
irds:beir/cqadupstack/webmasters |
True |
[text, tags] |
True |
https://ir-datasets.com/beir.html#beir/cqadupstack/webmasters |
|
irds:beir/cqadupstack/wordpress |
True |
[text, tags] |
True |
https://ir-datasets.com/beir.html#beir/cqadupstack/wordpress |
|
irds:beir/dbpedia-entity |
True |
True |
|||
irds:beir/dbpedia-entity/dev |
True |
True |
True |
||
irds:beir/dbpedia-entity/test |
True |
True |
True |
||
irds:beir/fever |
True |
True |
|||
irds:beir/fever/dev |
True |
True |
True |
||
irds:beir/fever/test |
True |
True |
True |
||
irds:beir/fever/train |
True |
True |
True |
||
irds:beir/fiqa |
True |
True |
|||
irds:beir/fiqa/dev |
True |
True |
True |
||
irds:beir/fiqa/test |
True |
True |
True |
||
irds:beir/fiqa/train |
True |
True |
True |
||
irds:beir/hotpotqa |
True |
True |
|||
irds:beir/hotpotqa/dev |
True |
True |
True |
||
irds:beir/hotpotqa/test |
True |
True |
True |
||
irds:beir/hotpotqa/train |
True |
True |
True |
||
irds:beir/msmarco |
True |
True |
|||
irds:beir/msmarco/dev |
True |
True |
True |
||
irds:beir/msmarco/test |
True |
True |
True |
||
irds:beir/msmarco/train |
True |
True |
True |
||
irds:beir/nfcorpus |
True |
[text, url] |
|||
irds:beir/nfcorpus/dev |
True |
True |
True |
||
irds:beir/nfcorpus/test |
True |
True |
True |
||
irds:beir/nfcorpus/train |
True |
True |
True |
||
irds:beir/nq |
True |
True |
True |
||
irds:beir/quora |
True |
True |
|||
irds:beir/quora/dev |
True |
True |
True |
||
irds:beir/quora/test |
True |
True |
True |
||
irds:beir/scidocs |
True |
[text, authors, year, cited_by, references] |
True |
||
irds:beir/scifact |
True |
True |
|||
irds:beir/scifact/test |
True |
True |
True |
||
irds:beir/scifact/train |
True |
True |
True |
||
irds:beir/trec-covid |
True |
[text, query, narrative] |
True |
||
irds:beir/webis-touche2020 |
True |
[text, description, narrative] |
True |
||
irds:beir/webis-touche2020/v2 |
True |
[text, description, narrative] |
True |
||
irds:c4 |
|||||
irds:c4/en-noclean-tr |
True |
||||
irds:c4/en-noclean-tr/trec-misinfo-2021 |
True |
[text, description, narrative, disclaimer, stance, evidence] |
https://ir-datasets.com/c4.html#c4/en-noclean-tr/trec-misinfo-2021 |
||
irds:car |
|||||
irds:car/v1.5 |
True |
||||
irds:car/v1.5/test200 |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/train/fold0 |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/train/fold1 |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/train/fold2 |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/train/fold3 |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/train/fold4 |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/trec-y1 |
True |
[text, title, headings] |
|||
irds:car/v1.5/trec-y1/auto |
True |
[text, title, headings] |
True |
||
irds:car/v1.5/trec-y1/manual |
True |
[text, title, headings] |
True |
||
irds:car/v2.0 |
True |
||||
irds:highwire |
True |
||||
irds:highwire/trec-genomics-2006 |
True |
True |
[start, length, relevance] |
https://ir-datasets.com/highwire.html#highwire/trec-genomics-2006 |
|
irds:highwire/trec-genomics-2007 |
True |
True |
[start, length, relevance] |
https://ir-datasets.com/highwire.html#highwire/trec-genomics-2007 |
|
irds:medline |
|||||
irds:medline/2004 |
True |
||||
irds:medline/2004/trec-genomics-2004 |
True |
[title, need, context] |
True |
https://ir-datasets.com/medline.html#medline/2004/trec-genomics-2004 |
|
irds:medline/2004/trec-genomics-2005 |
True |
True |
True |
https://ir-datasets.com/medline.html#medline/2004/trec-genomics-2005 |
|
irds:medline/2017 |
True |
||||
irds:medline/2017/trec-pm-2017 |
True |
[disease, gene, demographic, other] |
True |
https://ir-datasets.com/medline.html#medline/2017/trec-pm-2017 |
|
irds:medline/2017/trec-pm-2018 |
True |
[disease, gene, demographic] |
True |
https://ir-datasets.com/medline.html#medline/2017/trec-pm-2018 |
|
irds:clinicaltrials |
|||||
irds:clinicaltrials/2017 |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2017 |
|||
irds:clinicaltrials/2017/trec-pm-2017 |
True |
[disease, gene, demographic, other] |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2017/trec-pm-2017 |
|
irds:clinicaltrials/2017/trec-pm-2018 |
True |
[disease, gene, demographic] |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2017/trec-pm-2018 |
|
irds:clinicaltrials/2019 |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2019 |
|||
irds:clinicaltrials/2019/trec-pm-2019 |
True |
[disease, gene, demographic] |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2019/trec-pm-2019 |
|
irds:clinicaltrials/2021 |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2021 |
|||
irds:clinicaltrials/2021/trec-ct-2021 |
True |
True |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2021/trec-ct-2021 |
|
irds:clinicaltrials/2021/trec-ct-2022 |
True |
True |
https://ir-datasets.com/clinicaltrials.html#clinicaltrials/2021/trec-ct-2022 |
||
irds:clirmatrix |
|||||
irds:clueweb09/catb |
True |
||||
irds:clueweb09/catb/trec-web-2009 |
True |
[query, description, type, subtopics] |
[relevance, method, iprob] |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2009 |
|
irds:clueweb09/catb/trec-web-2009/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2009/diversity |
|
irds:clueweb09/catb/trec-web-2010 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2010 |
|
irds:clueweb09/catb/trec-web-2010/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2010/diversity |
|
irds:clueweb09/catb/trec-web-2011 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2011 |
|
irds:clueweb09/catb/trec-web-2011/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2011/diversity |
|
irds:clueweb09/catb/trec-web-2012 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2012 |
|
irds:clueweb09/catb/trec-web-2012/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/catb/trec-web-2012/diversity |
|
irds:clueweb09/en |
True |
||||
irds:clueweb09/en/trec-web-2009 |
True |
[query, description, type, subtopics] |
[relevance, method, iprob] |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2009 |
|
irds:clueweb09/en/trec-web-2009/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2009/diversity |
|
irds:clueweb09/en/trec-web-2010 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2010 |
|
irds:clueweb09/en/trec-web-2010/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2010/diversity |
|
irds:clueweb09/en/trec-web-2011 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2011 |
|
irds:clueweb09/en/trec-web-2011/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2011/diversity |
|
irds:clueweb09/en/trec-web-2012 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2012 |
|
irds:clueweb09/en/trec-web-2012/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb09.html#clueweb09/en/trec-web-2012/diversity |
|
irds:clueweb12 |
True |
||||
irds:clueweb12/b13 |
True |
||||
irds:clueweb12/b13/clef-ehealth |
True |
True |
[relevance, trustworthiness, understandability] |
https://ir-datasets.com/clueweb12.html#clueweb12/b13/clef-ehealth |
|
irds:clueweb12/b13/ntcir-www-1 |
True |
True |
True |
https://ir-datasets.com/clueweb12.html#clueweb12/b13/ntcir-www-1 |
|
irds:clueweb12/b13/ntcir-www-2 |
True |
[title, description] |
True |
https://ir-datasets.com/clueweb12.html#clueweb12/b13/ntcir-www-2 |
|
irds:clueweb12/b13/ntcir-www-3 |
True |
[title, description] |
https://ir-datasets.com/clueweb12.html#clueweb12/b13/ntcir-www-3 |
||
irds:clueweb12/b13/trec-misinfo-2019 |
True |
[title, cochranedoi, description, narrative] |
[relevance, effectiveness, redibility] |
https://ir-datasets.com/clueweb12.html#clueweb12/b13/trec-misinfo-2019 |
|
irds:clueweb12/trec-web-2013 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb12.html#clueweb12/trec-web-2013 |
|
irds:clueweb12/trec-web-2013/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb12.html#clueweb12/trec-web-2013/diversity |
|
irds:clueweb12/trec-web-2014 |
True |
[query, description, type, subtopics] |
True |
https://ir-datasets.com/clueweb12.html#clueweb12/trec-web-2014 |
|
irds:clueweb12/trec-web-2014/diversity |
True |
[query, description, type, subtopics] |
[relevance, subtopic_id] |
https://ir-datasets.com/clueweb12.html#clueweb12/trec-web-2014/diversity |
|
irds:codec |
True |
[query, domain, guidelines] |
True |
||
irds:codec/economics |
True |
[query, domain, guidelines] |
True |
||
irds:codec/history |
True |
[query, domain, guidelines] |
True |
||
irds:codec/politics |
True |
[query, domain, guidelines] |
True |
||
irds:cord19 |
True |
||||
irds:cord19/fulltext |
True |
||||
irds:cord19/fulltext/trec-covid |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/cord19.html#cord19/fulltext/trec-covid |
|
irds:cord19/trec-covid |
True |
[title, description, narrative] |
True |
||
irds:cord19/trec-covid/round1 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/cord19.html#cord19/trec-covid/round1 |
|
irds:cord19/trec-covid/round2 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/cord19.html#cord19/trec-covid/round2 |
|
irds:cord19/trec-covid/round3 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/cord19.html#cord19/trec-covid/round3 |
|
irds:cord19/trec-covid/round4 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/cord19.html#cord19/trec-covid/round4 |
|
irds:cord19/trec-covid/round5 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/cord19.html#cord19/trec-covid/round5 |
|
irds:cranfield |
True |
True |
True |
||
irds:disks45 |
|||||
irds:disks45/nocr |
True |
||||
irds:disks45/nocr/trec-robust-2004 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/disks45.html#disks45/nocr/trec-robust-2004 |
|
irds:disks45/nocr/trec-robust-2004/fold1 |
True |
True |
True |
https://ir-datasets.com/disks45.html#disks45/nocr/trec-robust-2004/fold1 |
|
irds:disks45/nocr/trec-robust-2004/fold2 |
True |
True |
True |
https://ir-datasets.com/disks45.html#disks45/nocr/trec-robust-2004/fold2 |
|
irds:disks45/nocr/trec-robust-2004/fold3 |
True |
True |
True |
https://ir-datasets.com/disks45.html#disks45/nocr/trec-robust-2004/fold3 |
|
irds:disks45/nocr/trec-robust-2004/fold4 |
True |
True |
True |
https://ir-datasets.com/disks45.html#disks45/nocr/trec-robust-2004/fold4 |
|
irds:disks45/nocr/trec-robust-2004/fold5 |
True |
True |
True |
https://ir-datasets.com/disks45.html#disks45/nocr/trec-robust-2004/fold5 |
|
irds:disks45/nocr/trec7 |
True |
[title, description, narrative] |
True |
||
irds:disks45/nocr/trec8 |
True |
[title, description, narrative] |
True |
||
irds:dpr-w100 |
True |
||||
irds:dpr-w100/natural-questions/dev |
True |
[text, answers] |
True |
https://ir-datasets.com/dpr-w100.html#dpr-w100/natural-questions/dev |
|
irds:dpr-w100/natural-questions/train |
True |
[text, answers] |
True |
https://ir-datasets.com/dpr-w100.html#dpr-w100/natural-questions/train |
|
irds:dpr-w100/trivia-qa/dev |
True |
[text, answers] |
True |
https://ir-datasets.com/dpr-w100.html#dpr-w100/trivia-qa/dev |
|
irds:dpr-w100/trivia-qa/train |
True |
[text, answers] |
True |
https://ir-datasets.com/dpr-w100.html#dpr-w100/trivia-qa/train |
|
irds:gov |
True |
||||
irds:gov/trec-web-2002 |
True |
[title, description, narrative] |
True |
||
irds:gov/trec-web-2002/named-page |
True |
True |
True |
https://ir-datasets.com/gov.html#gov/trec-web-2002/named-page |
|
irds:gov/trec-web-2003 |
True |
[title, description] |
True |
||
irds:gov/trec-web-2003/named-page |
True |
True |
True |
https://ir-datasets.com/gov.html#gov/trec-web-2003/named-page |
|
irds:gov/trec-web-2004 |
True |
True |
True |
||
irds:gov2 |
True |
||||
irds:gov2/trec-mq-2008 |
True |
True |
[relevance, method, iprob] |
||
irds:gov2/trec-tb-2004 |
True |
[title, description, narrative] |
True |
||
irds:gov2/trec-tb-2005 |
True |
[title, description, narrative] |
True |
||
irds:gov2/trec-tb-2005/efficiency |
True |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2005/efficiency |
|
irds:gov2/trec-tb-2005/named-page |
True |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2005/named-page |
|
irds:gov2/trec-tb-2006 |
True |
[title, description, narrative] |
True |
||
irds:gov2/trec-tb-2006/efficiency |
True |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/efficiency |
|
irds:gov2/trec-tb-2006/efficiency/10k |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/efficiency/10k |
||
irds:gov2/trec-tb-2006/efficiency/stream1 |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/efficiency/stream1 |
||
irds:gov2/trec-tb-2006/efficiency/stream2 |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/efficiency/stream2 |
||
irds:gov2/trec-tb-2006/efficiency/stream3 |
True |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/efficiency/stream3 |
|
irds:gov2/trec-tb-2006/efficiency/stream4 |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/efficiency/stream4 |
||
irds:gov2/trec-tb-2006/named-page |
True |
True |
True |
https://ir-datasets.com/gov2.html#gov2/trec-tb-2006/named-page |
|
irds:kilt |
True |
||||
irds:kilt/codec |
True |
[query, domain, guidelines] |
True |
||
irds:kilt/codec/economics |
True |
[query, domain, guidelines] |
True |
||
irds:kilt/codec/history |
True |
[query, domain, guidelines] |
True |
||
irds:kilt/codec/politics |
True |
[query, domain, guidelines] |
True |
||
irds:lotte |
|||||
irds:lotte/lifestyle/dev |
True |
||||
irds:lotte/lifestyle/dev/forum |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/lifestyle/dev/forum |
|
irds:lotte/lifestyle/dev/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/lifestyle/dev/search |
|
irds:lotte/lifestyle/test |
True |
||||
irds:lotte/lifestyle/test/forum |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/lifestyle/test/forum |
|
irds:lotte/lifestyle/test/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/lifestyle/test/search |
|
irds:lotte/pooled/dev |
True |
||||
irds:lotte/pooled/dev/forum |
True |
True |
True |
||
irds:lotte/pooled/dev/search |
True |
True |
True |
||
irds:lotte/pooled/test |
True |
||||
irds:lotte/pooled/test/forum |
True |
True |
True |
||
irds:lotte/pooled/test/search |
True |
True |
True |
||
irds:lotte/recreation/dev |
True |
||||
irds:lotte/recreation/dev/forum |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/recreation/dev/forum |
|
irds:lotte/recreation/dev/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/recreation/dev/search |
|
irds:lotte/recreation/test |
True |
||||
irds:lotte/recreation/test/forum |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/recreation/test/forum |
|
irds:lotte/recreation/test/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/recreation/test/search |
|
irds:lotte/science/dev |
True |
||||
irds:lotte/science/dev/forum |
True |
True |
True |
||
irds:lotte/science/dev/search |
True |
True |
True |
||
irds:lotte/science/test |
True |
||||
irds:lotte/science/test/forum |
True |
True |
True |
||
irds:lotte/science/test/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/science/test/search |
|
irds:lotte/technology/dev |
True |
||||
irds:lotte/technology/dev/forum |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/technology/dev/forum |
|
irds:lotte/technology/dev/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/technology/dev/search |
|
irds:lotte/technology/test |
True |
||||
irds:lotte/technology/test/forum |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/technology/test/forum |
|
irds:lotte/technology/test/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/technology/test/search |
|
irds:lotte/writing/dev |
True |
||||
irds:lotte/writing/dev/forum |
True |
True |
True |
||
irds:lotte/writing/dev/search |
True |
True |
True |
||
irds:lotte/writing/test |
True |
||||
irds:lotte/writing/test/forum |
True |
True |
True |
||
irds:lotte/writing/test/search |
True |
True |
True |
https://ir-datasets.com/lotte.html#lotte/writing/test/search |
|
irds:msmarco-passage |
True |
||||
irds:msmarco-passage/dev |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/dev |
|
irds:msmarco-passage/dev/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/dev/judged |
|
irds:msmarco-passage/dev/small |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/dev/small |
|
irds:msmarco-passage/eval |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/eval |
||
irds:msmarco-passage/eval/small |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/eval/small |
||
irds:msmarco-passage/train |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train |
|
irds:msmarco-passage/train/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train/judged |
|
irds:msmarco-passage/train/medical |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train/medical |
|
irds:msmarco-passage/train/split200-train |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train/split200-train |
|
irds:msmarco-passage/train/split200-valid |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train/split200-valid |
|
irds:msmarco-passage/train/triples-small |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train/triples-small |
|
irds:msmarco-passage/train/triples-v2 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/train/triples-v2 |
|
irds:msmarco-passage/trec-dl-2019 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-2019 |
|
irds:msmarco-passage/trec-dl-2019/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-2019/judged |
|
irds:msmarco-passage/trec-dl-2020 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-2020 |
|
irds:msmarco-passage/trec-dl-2020/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-2020/judged |
|
irds:msmarco-passage/trec-dl-hard |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-hard |
|
irds:msmarco-passage/trec-dl-hard/fold1 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-hard/fold1 |
|
irds:msmarco-passage/trec-dl-hard/fold2 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-hard/fold2 |
|
irds:msmarco-passage/trec-dl-hard/fold3 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-hard/fold3 |
|
irds:msmarco-passage/trec-dl-hard/fold4 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-hard/fold4 |
|
irds:msmarco-passage/trec-dl-hard/fold5 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage.html#msmarco-passage/trec-dl-hard/fold5 |
|
irds:mmarco |
|||||
irds:mr-tydi |
|||||
irds:mr-tydi/en |
True |
True |
True |
||
irds:mr-tydi/en/dev |
True |
True |
True |
||
irds:mr-tydi/en/test |
True |
True |
True |
||
irds:mr-tydi/en/train |
True |
True |
True |
||
irds:msmarco-document |
True |
||||
irds:msmarco-document/anchor-text |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/anchor-text |
|||
irds:msmarco-document/dev |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/dev |
|
irds:msmarco-document/eval |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/eval |
||
irds:msmarco-document/orcas |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/orcas |
|
irds:msmarco-document/train |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/train |
|
irds:msmarco-document/trec-dl-2019 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-2019 |
|
irds:msmarco-document/trec-dl-2019/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-2019/judged |
|
irds:msmarco-document/trec-dl-2020 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-2020 |
|
irds:msmarco-document/trec-dl-2020/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-2020/judged |
|
irds:msmarco-document/trec-dl-hard |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-hard |
|
irds:msmarco-document/trec-dl-hard/fold1 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-hard/fold1 |
|
irds:msmarco-document/trec-dl-hard/fold2 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-hard/fold2 |
|
irds:msmarco-document/trec-dl-hard/fold3 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-hard/fold3 |
|
irds:msmarco-document/trec-dl-hard/fold4 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-hard/fold4 |
|
irds:msmarco-document/trec-dl-hard/fold5 |
True |
True |
True |
https://ir-datasets.com/msmarco-document.html#msmarco-document/trec-dl-hard/fold5 |
|
irds:msmarco-document-v2 |
True |
||||
irds:msmarco-document-v2/anchor-text |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/anchor-text |
|||
irds:msmarco-document-v2/dev1 |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/dev1 |
|
irds:msmarco-document-v2/dev2 |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/dev2 |
|
irds:msmarco-document-v2/train |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/train |
|
irds:msmarco-document-v2/trec-dl-2019 |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2019 |
|
irds:msmarco-document-v2/trec-dl-2019/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2019/judged |
|
irds:msmarco-document-v2/trec-dl-2020 |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2020 |
|
irds:msmarco-document-v2/trec-dl-2020/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2020/judged |
|
irds:msmarco-document-v2/trec-dl-2021 |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2021 |
|
irds:msmarco-document-v2/trec-dl-2021/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2021/judged |
|
irds:msmarco-document-v2/trec-dl-2022 |
True |
True |
https://ir-datasets.com/msmarco-document-v2.html#msmarco-document-v2/trec-dl-2022 |
||
irds:msmarco-passage-v2 |
True |
||||
irds:msmarco-passage-v2/dev1 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/dev1 |
|
irds:msmarco-passage-v2/dev2 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/dev2 |
|
irds:msmarco-passage-v2/train |
True |
True |
True |
https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/train |
|
irds:msmarco-passage-v2/trec-dl-2021 |
True |
True |
True |
https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/trec-dl-2021 |
|
irds:msmarco-passage-v2/trec-dl-2021/judged |
True |
True |
True |
https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/trec-dl-2021/judged |
|
irds:msmarco-passage-v2/trec-dl-2022 |
True |
True |
https://ir-datasets.com/msmarco-passage-v2.html#msmarco-passage-v2/trec-dl-2022 |
||
irds:msmarco-qna |
True |
||||
irds:msmarco-qna/dev |
True |
[text, type, answers] |
True |
||
irds:msmarco-qna/eval |
True |
[text, type] |
|||
irds:msmarco-qna/train |
True |
[text, type, answers] |
True |
||
irds:neumarco |
|||||
irds:nfcorpus |
True |
||||
irds:nfcorpus/dev |
True |
[title, all] |
True |
||
irds:nfcorpus/dev/nontopic |
True |
True |
True |
||
irds:nfcorpus/dev/video |
True |
[title, desc] |
True |
||
irds:nfcorpus/test |
True |
[title, all] |
True |
||
irds:nfcorpus/test/nontopic |
True |
True |
True |
https://ir-datasets.com/nfcorpus.html#nfcorpus/test/nontopic |
|
irds:nfcorpus/test/video |
True |
[title, desc] |
True |
||
irds:nfcorpus/train |
True |
[title, all] |
True |
||
irds:nfcorpus/train/nontopic |
True |
True |
True |
https://ir-datasets.com/nfcorpus.html#nfcorpus/train/nontopic |
|
irds:nfcorpus/train/video |
True |
[title, desc] |
True |
||
irds:natural-questions |
True |
||||
irds:natural-questions/dev |
True |
True |
[relevance, short_answers, yes_no_answer] |
https://ir-datasets.com/natural-questions.html#natural-questions/dev |
|
irds:natural-questions/train |
True |
True |
[relevance, short_answers, yes_no_answer] |
https://ir-datasets.com/natural-questions.html#natural-questions/train |
|
irds:nyt |
True |
||||
irds:nyt/trec-core-2017 |
True |
[title, description, narrative] |
True |
||
irds:nyt/wksup |
True |
True |
True |
||
irds:nyt/wksup/train |
True |
True |
True |
||
irds:nyt/wksup/valid |
True |
True |
True |
||
irds:pmc |
|||||
irds:pmc/v1 |
True |
||||
irds:pmc/v1/trec-cds-2014 |
True |
[type, description, summary] |
True |
||
irds:pmc/v1/trec-cds-2015 |
True |
[type, description, summary] |
True |
||
irds:pmc/v2 |
True |
||||
irds:pmc/v2/trec-cds-2016 |
True |
[type, note, description, summary] |
True |
||
irds:touche-image |
|||||
irds:touche-image/2022-06-13 |
True |
https://ir-datasets.com/touche-image.html#touche-image/2022-06-13 |
|||
irds:argsme/2020-04-01/touche-2020-task-1 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/touche-2020-task-1 |
|
irds:clueweb12/touche-2020-task-2 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/clueweb12.html#clueweb12/touche-2020-task-2 |
|
irds:argsme/2020-04-01/touche-2021-task-1 |
True |
True |
[relevance, quality] |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/touche-2021-task-1 |
|
irds:clueweb12/touche-2021-task-2 |
True |
[title, description, narrative] |
[relevance, quality] |
https://ir-datasets.com/clueweb12.html#clueweb12/touche-2021-task-2 |
|
irds:argsme/2020-04-01/processed/touche-2022-task-1 |
True |
[title, description, narrative] |
[relevance, quality, coherence] |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/processed/touche-2022-task-1 |
|
irds:clueweb12/touche-2022-task-2 |
True |
[title, objects, description, narrative] |
[relevance, quality, stance] |
https://ir-datasets.com/clueweb12.html#clueweb12/touche-2022-task-2 |
|
irds:touche-image/2022-06-13/touche-2022-task-3 |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/touche-image.html#touche-image/2022-06-13/touche-2022-task-3 |
|
irds:argsme/1.0/touche-2020-task-1/uncorrected |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/argsme.html#argsme/1.0/touche-2020-task-1/uncorrected |
|
irds:argsme/2020-04-01/touche-2020-task-1/uncorrected |
True |
[title, description, narrative] |
True |
https://ir-datasets.com/argsme.html#argsme/2020-04-01/touche-2020-task-1/uncorrected |
|
irds:clueweb12/touche-2022-task-2/expanded-doc-t5-query |
True |
[title, objects, description, narrative] |
[relevance, quality, stance] |
https://ir-datasets.com/clueweb12.html#clueweb12/touche-2022-task-2/expanded-doc-t5-query |
|
irds:trec-robust04 |
True |
[title, description, narrative] |
True |
||
irds:trec-robust04/fold1 |
True |
True |
True |
https://ir-datasets.com/trec-robust04.html#trec-robust04/fold1 |
|
irds:trec-robust04/fold2 |
True |
True |
True |
https://ir-datasets.com/trec-robust04.html#trec-robust04/fold2 |
|
irds:trec-robust04/fold3 |
True |
True |
True |
https://ir-datasets.com/trec-robust04.html#trec-robust04/fold3 |
|
irds:trec-robust04/fold4 |
True |
True |
True |
https://ir-datasets.com/trec-robust04.html#trec-robust04/fold4 |
|
irds:trec-robust04/fold5 |
True |
True |
True |
https://ir-datasets.com/trec-robust04.html#trec-robust04/fold5 |
|
irds:tripclick |
True |
||||
irds:tripclick/logs |
True |
||||
irds:tripclick/test |
True |
True |
|||
irds:tripclick/test/head |
True |
True |
|||
irds:tripclick/test/tail |
True |
True |
|||
irds:tripclick/test/torso |
True |
True |
|||
irds:tripclick/train |
True |
True |
True |
||
irds:tripclick/train/head |
True |
True |
True |
||
irds:tripclick/train/head/dctr |
True |
True |
True |
https://ir-datasets.com/tripclick.html#tripclick/train/head/dctr |
|
irds:tripclick/train/hofstaetter-triples |
True |
True |
True |
https://ir-datasets.com/tripclick.html#tripclick/train/hofstaetter-triples |
|
irds:tripclick/train/tail |
True |
True |
True |
||
irds:tripclick/train/torso |
True |
True |
True |
https://ir-datasets.com/tripclick.html#tripclick/train/torso |
|
irds:tripclick/val |
True |
True |
True |
||
irds:tripclick/val/head |
True |
True |
True |
||
irds:tripclick/val/head/dctr |
True |
True |
True |
https://ir-datasets.com/tripclick.html#tripclick/val/head/dctr |
|
irds:tripclick/val/tail |
True |
True |
True |
||
irds:tripclick/val/torso |
True |
True |
True |
||
irds:vaswani |
True |
True |
True |
||
irds:wapo |
|||||
irds:wapo/v2 |
True |
||||
irds:wapo/v2/trec-core-2018 |
True |
[title, description, narrative] |
True |
||
irds:wapo/v2/trec-news-2018 |
True |
[doc_id, url] |
True |
||
irds:wapo/v2/trec-news-2019 |
True |
[doc_id, url] |
True |
||
irds:wapo/v3/trec-news-2020 |
[doc_id, url] |
True |
|||
irds:wikiclir |
|||||
irds:wikiclir/en-simple |
True |
True |
True |
||
irds:wikir |
|||||
irds:wikir/en1k |
True |
||||
irds:wikir/en1k/test |
True |
True |
True |
||
irds:wikir/en1k/training |
True |
True |
True |
||
irds:wikir/en1k/validation |
True |
True |
True |
||
irds:wikir/en59k |
True |
||||
irds:wikir/en59k/test |
True |
True |
True |
||
irds:wikir/en59k/training |
True |
True |
True |
||
irds:wikir/en59k/validation |
True |
True |
True |
||
irds:wikir/en78k |
True |
||||
irds:wikir/en78k/test |
True |
True |
True |
||
irds:wikir/en78k/training |
True |
True |
True |
||
irds:wikir/en78k/validation |
True |
True |
True |
||
irds:wikir/ens78k |
True |
||||
irds:wikir/ens78k/test |
True |
True |
True |
||
irds:wikir/ens78k/training |
True |
True |
True |
||
irds:wikir/ens78k/validation |
True |
True |
True |
||
irds:trec-fair |
|||||
irds:trec-fair/2021 |
True |
||||
irds:trec-fair/2021/train |
True |
[text, keywords, scope, homepage] |
True |
||
irds:trec-fair/2021/eval |
True |
[text, keywords, scope] |
True |
||
irds:trec-fair/2022 |
True |
||||
irds:trec-fair/2022/train |
True |
[text, url] |
True |
||
irds:trec-fair-2021 |
True |
||||
irds:trec-fair-2021/train |
True |
[text, keywords, scope, homepage] |
True |
https://ir-datasets.com/trec-fair-2021.html#trec-fair-2021/train |
|
irds:trec-fair-2021/eval |
True |
[text, keywords, scope] |
True |
https://ir-datasets.com/trec-fair-2021.html#trec-fair-2021/eval |
|
irds:trec-cast |
|||||
irds:trec-cast/v0 |
True |
||||
irds:trec-cast/v0/train |
True |
[raw_utterance, topic_number, turn_number, topic_title, topic_description] |
True |
||
irds:trec-cast/v0/train/judged |
True |
True |
True |
https://ir-datasets.com/trec-cast.html#trec-cast/v0/train/judged |
|
irds:trec-cast/v1 |
True |
||||
irds:trec-cast/v1/2019 |
True |
[raw_utterance, topic_number, turn_number, topic_title, topic_description] |
True |
||
irds:trec-cast/v1/2019/judged |
True |
True |
True |
https://ir-datasets.com/trec-cast.html#trec-cast/v1/2019/judged |
|
irds:trec-cast/v1/2020 |
True |
[raw_utterance, automatic_rewritten_utterance, manual_rewritten_utterance, manual_canonical_result_id, topic_number, turn_number] |
True |
||
irds:trec-cast/v1/2020/judged |
True |
True |
True |
https://ir-datasets.com/trec-cast.html#trec-cast/v1/2020/judged |
|
irds:hc4 |
|||||
irds:neuclir |
|||||
irds:neuclir/1 |
|||||
trec-deep-learning-docs |
True |
True |
[train, dev, test, test-2020, leaderboard-2020] |
[train, dev, test, test-2020] |
|
trec-deep-learning-passages |
True |
True |
[train, dev, dev.small, eval, eval.small, test-2019, test-2020] |
[train, dev, test-2019, test-2020, dev.small] |