DBLP¶
DBLP provides open bibliographic information on computer science journals and conferences.
pyterrier-services
provides access to DBLP APIs through DblpApi
.
Example:
>>> from pyterrier_services import DblpApi
>>> dblp = DblpApi()
>>> retr = dblp.retriever(num_results=5)
>>> dblp.search('pyterrier')
# qid query docno score rank title authors year type
# 0 1 pyterrier journals/corr/abs-2412-05339 -0.0 0 PyTerrier-GenRank: The PyTerrier Plugin for Re... [Kaustubh D. Dhole] 2024 Informal and Other Publications
# 1 1 pyterrier conf/cikm/MacdonaldTMO21 -1.0 1 PyTerrier: Declarative Experimentation in Pyth... [Craig Macdonald, Nicola Tonellotto, Sean MacA... 2021 Conference and Workshop Papers
# 2 1 pyterrier conf/clef/Tavakolpoursaleh21 -2.0 2 PyTerrier-based Research Data Recommendations ... [Narges Tavakolpoursaleh, Johann Schaible] 2021 Conference and Workshop Papers
# 3 1 pyterrier conf/ictir/MacdonaldT20 -3.0 3 Declarative Experimentation in Information Ret... [Craig Macdonald, Nicola Tonellotto] 2020 Conference and Workshop Papers
# 4 1 pyterrier journals/corr/abs-2007-14271 -4.0 4 Declarative Experimentation in Information Ret... [Craig Macdonald, Nicola Tonellotto] 2020 Informal and Other Publications
- class pyterrier_services.DblpApi[source]¶
Represents a reference to the DBLP search API.
- retriever(*, num_results=100, entity_type=DblpEntityType.publication, verbose=True)[source]¶
Returns a
Transformer
that retrieves from DBLP.- Return type:
- Parameters:
num_results – The number of results to retrieve. Defaults to 100.
entity_type – The type of entity to search over. Defaults to
DblpEntityType.publication
.verbose – Whether to log the progress. Defaults to True.
- bibtex_loader(*, bib_type=DblpBibType.standard, verbose=True)[source]¶
Returns a
Transformer
that loads bibtex data from DBLP.- Return type:
- Parameters:
bib_type – The type of BibTeX to load. Defaults to
DblpBibType.standard
.verbose – Whether to log the progress. Defaults to True.
- search(query, *, entity_type=DblpEntityType.publication, offset=0, limit=100, return_next=False, return_total=False)[source]¶
Searches for papers on Semantic Scholar with the provided query.
- Return type:
Union
[DataFrame
,Tuple
[DataFrame
,int
],Tuple
[DataFrame
,int
,int
]]- Parameters:
query – The search query.
entity_type – The type of entity to search over. Defaults to
DblpEntityType.publication
.offset – The offset of the first result to retrieve. Defaults to 0.
limit – The maximum number of results to retrieve. Defaults to 100.
return_next – Whether to return the next query URL. Defaults to False.
return_total – Whether to return the total number of results. Defaults to False.
- class pyterrier_services.DblpRetriever(*, api=None, num_results=100, entity_type=DblpEntityType.publication, verbose=True)[source]¶
A
Transformer
retriever that queries the DBLP search API.- Parameters:
api – The DBLP api service. Defaults to a new instance of
DblpApi
.num_results – The number of results to retrieve per query. Defaults to 100.
entity_type – The type of entity to search over. Defaults to
DblpEntityType.publication
verbose – Whether to log the progress. Defaults to True.