Semantic Scholar Open Research Corpus (S2ORC)

GPTKB entity

Statements (26)
Predicate Object
gptkbp:instanceOf gptkb:dataset
gptkbp:abbreviation gptkb:S2ORC
gptkbp:access open
gptkbp:contains bibliographic information
abstracts
citation graphs
full text of scientific papers
metadata of scientific papers
gptkbp:creator gptkb:Allen_Institute_for_AI
gptkbp:describedBy large-scale dataset of scientific papers
gptkbp:domain scientific literature
https://www.w3.org/2000/01/rdf-schema#label Semantic Scholar Open Research Corpus (S2ORC)
gptkbp:language English
gptkbp:license gptkb:CC_BY-NC_2.0
gptkbp:publishedIn Lo, K., Wang, L. L., Neumann, M., Kinney, R., & Weld, D. S. (2020). S2ORC: The Semantic Scholar Open Research Corpus. ACL 2020.
gptkbp:relatedTo gptkb:Semantic_Scholar
gptkbp:releaseDate 2020
gptkbp:size over 8 million papers
gptkbp:usedFor gptkb:machine_learning
information retrieval
citation analysis
natural language processing research
scientific document analysis
gptkbp:website https://allenai.org/data/s2orc
gptkbp:bfsParent gptkb:Open_Research_Corpus
gptkbp:bfsLayer 6