DBLP

DBLP provides open bibliographic information on computer science journals and conferences.

pyterrier-services provides access to DBLP APIs through DblpApi.

Example:

Retrieve from the DBLP API
>>> from pyterrier_services import DblpApi
>>> dblp = DblpApi()
>>> retr = dblp.retriever(num_results=5)
>>> dblp.search('pyterrier')
#   qid      query                         docno  score  rank                                              title                                            authors  year                             type
# 0   1  pyterrier  journals/corr/abs-2412-05339   -0.0     0  PyTerrier-GenRank: The PyTerrier Plugin for Re...                                [Kaustubh D. Dhole]  2024  Informal and Other Publications
# 1   1  pyterrier      conf/cikm/MacdonaldTMO21   -1.0     1  PyTerrier: Declarative Experimentation in Pyth...  [Craig Macdonald, Nicola Tonellotto, Sean MacA...  2021   Conference and Workshop Papers
# 2   1  pyterrier  conf/clef/Tavakolpoursaleh21   -2.0     2  PyTerrier-based Research Data Recommendations ...         [Narges Tavakolpoursaleh, Johann Schaible]  2021   Conference and Workshop Papers
# 3   1  pyterrier       conf/ictir/MacdonaldT20   -3.0     3  Declarative Experimentation in Information Ret...               [Craig Macdonald, Nicola Tonellotto]  2020   Conference and Workshop Papers
# 4   1  pyterrier  journals/corr/abs-2007-14271   -4.0     4  Declarative Experimentation in Information Ret...               [Craig Macdonald, Nicola Tonellotto]  2020  Informal and Other Publications
class pyterrier_services.DblpApi[source]

Represents a reference to the DBLP search API.

retriever(*, num_results=100, entity_type=DblpEntityType.publication, verbose=True)[source]

Returns a Transformer that retrieves from DBLP.

Return type:

Transformer

Parameters:
  • num_results – The number of results to retrieve. Defaults to 100.

  • entity_type – The type of entity to search over. Defaults to DblpEntityType.publication.

  • verbose – Whether to log the progress. Defaults to True.

bibtex_loader(*, bib_type=DblpBibType.standard, verbose=True)[source]

Returns a Transformer that loads bibtex data from DBLP.

Return type:

Transformer

Parameters:
  • bib_type – The type of BibTeX to load. Defaults to DblpBibType.standard.

  • verbose – Whether to log the progress. Defaults to True.

search(query, *, entity_type=DblpEntityType.publication, offset=0, limit=100, return_next=False, return_total=False)[source]

Searches for papers on Semantic Scholar with the provided query.

Return type:

Union[DataFrame, Tuple[DataFrame, int], Tuple[DataFrame, int, int]]

Parameters:
  • query – The search query.

  • entity_type – The type of entity to search over. Defaults to DblpEntityType.publication.

  • offset – The offset of the first result to retrieve. Defaults to 0.

  • limit – The maximum number of results to retrieve. Defaults to 100.

  • return_next – Whether to return the next query URL. Defaults to False.

  • return_total – Whether to return the total number of results. Defaults to False.

class pyterrier_services.DblpRetriever(*, api=None, num_results=100, entity_type=DblpEntityType.publication, verbose=True)[source]

A Transformer retriever that queries the DBLP search API.

Parameters:
  • api – The DBLP api service. Defaults to a new instance of DblpApi.

  • num_results – The number of results to retrieve per query. Defaults to 100.

  • entity_type – The type of entity to search over. Defaults to DblpEntityType.publication

  • verbose – Whether to log the progress. Defaults to True.