Where does the data come from?
Pure's data comes from various internal and external sources, ensuring a comprehensive and accurate representation of research activities, outputs, and achievements. The primary sources are:
• Internal systems: HR (fall 2025)
• Certain bibliographic databases with which the library has a license and other free open access sources. See below for the current list of sources.
• External researcher profiles: ORCID
• Manual input by the CEU Pure Implementation team
Current list of external sources:
- Scopus, an abstract and citation database, covers 22,000 titles from more than 5,000 international publishers. It contains over 56 million records from peer-reviewed research literature in the scientific, technical, medical, social sciences, and arts and humanities fields.
- EBSCOhost databases are the most-used, premium online information resources for tens of thousands of institutions worldwide, representing millions of end users.
- Digital Commons is an institutional repository platform for journals, conference proceedings, open educational resources, and more. It is used by over 500 academic institutions, healthcare centers, public libraries, and research centers to manage and showcase their institution's research and scholarship.
- Mendeley is a free reference manager and academic social network that helps scientist organize their research, collaborate with others online, and discover the latest research. It holds more than 20 million records.
- Embase is a biomedical database with over 25 million indexed records and covers relevant and up-to-date biomedical research, from drugs and diseases from pre-clinical studies to critical toxicological information, in more than 7600 peer-reviewed journals, 2500 conferences and 300.000 conference abstracts.
- Contains more than 19 million records from MEDLINE and from medical scientific journals, dating back from 1948. PubMed contains links to full-texts, either from other databases or the journals' publishers. Content must be "E-pub ahead of print", before imported into Pure.
- CiNii is a bibliographic database service for material in Japanese academic libraries, especially focusing on Japanese works and English works published in Japan
- A system developed by publishers in 2000, to ease linking between references in fulltext documents in online scholarly literature, on the publishers' websites. Crossref uses DOI to transmit link information.
- arXiv is an Open Access database with full-text access and contains more than 650.000 e-prints. It covers the domains of physics, mathematics, computer science, quantitative biology, quantitative economy and statistics.
- The SAO/NASA Astrophysics Data System (ADS) is a Digital Library portal for researchers in Astronomy and Physics, operated by the Smithsonian Astrophysical Observatory (SAO) under a NASA grant. The ADS maintains three bibliographic databases containing more than 9.1 million records: Astronomy and Astrophysics, Physics, and arXiv e-prints.
- Espacenet offers free access to more than 90 million patent documents worldwide, containing information about inventions and technical developments from 1836 to today.
- ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. Enabling ORCID as an online import source will allow the import of research output meta data available on ORCID's public API, for more information please see the Release notes.
- SSRN is a worldwide collaboration of over 352,400 authors and more than 2.2 million users that is devoted to the rapid worldwide dissemination of research.
- The dblp computer science bibliography is an online database containing open bibliographic information on more than 5 million computer science publications, including journals, proceedings and monographs.
Answer
Pure's data comes from various internal and external sources, ensuring a comprehensive and accurate representation of research activities, outputs, and achievements. The primary sources are:
• Internal systems: HR (fall 2025)
• Certain bibliographic databases with which the library has a license and other free open access sources. See below for the current list of sources.
• External researcher profiles: ORCID
• Manual input by the CEU Pure Implementation team
Current list of external sources:
- Scopus, an abstract and citation database, covers 22,000 titles from more than 5,000 international publishers. It contains over 56 million records from peer-reviewed research literature in the scientific, technical, medical, social sciences, and arts and humanities fields.
- EBSCOhost databases are the most-used, premium online information resources for tens of thousands of institutions worldwide, representing millions of end users.
- Digital Commons is an institutional repository platform for journals, conference proceedings, open educational resources, and more. It is used by over 500 academic institutions, healthcare centers, public libraries, and research centers to manage and showcase their institution's research and scholarship.
- Mendeley is a free reference manager and academic social network that helps scientist organize their research, collaborate with others online, and discover the latest research. It holds more than 20 million records.
- Embase is a biomedical database with over 25 million indexed records and covers relevant and up-to-date biomedical research, from drugs and diseases from pre-clinical studies to critical toxicological information, in more than 7600 peer-reviewed journals, 2500 conferences and 300.000 conference abstracts.
- Contains more than 19 million records from MEDLINE and from medical scientific journals, dating back from 1948. PubMed contains links to full-texts, either from other databases or the journals' publishers. Content must be "E-pub ahead of print", before imported into Pure.
- CiNii is a bibliographic database service for material in Japanese academic libraries, especially focusing on Japanese works and English works published in Japan
- A system developed by publishers in 2000, to ease linking between references in fulltext documents in online scholarly literature, on the publishers' websites. Crossref uses DOI to transmit link information.
- arXiv is an Open Access database with full-text access and contains more than 650.000 e-prints. It covers the domains of physics, mathematics, computer science, quantitative biology, quantitative economy and statistics.
- The SAO/NASA Astrophysics Data System (ADS) is a Digital Library portal for researchers in Astronomy and Physics, operated by the Smithsonian Astrophysical Observatory (SAO) under a NASA grant. The ADS maintains three bibliographic databases containing more than 9.1 million records: Astronomy and Astrophysics, Physics, and arXiv e-prints.
- Espacenet offers free access to more than 90 million patent documents worldwide, containing information about inventions and technical developments from 1836 to today.
- ORCID is a nonprofit organization helping create a world in which all who participate in research, scholarship and innovation are uniquely identified and connected to their contributions and affiliations, across disciplines, borders, and time. Enabling ORCID as an online import source will allow the import of research output meta data available on ORCID's public API, for more information please see the Release notes.
- SSRN is a worldwide collaboration of over 352,400 authors and more than 2.2 million users that is devoted to the rapid worldwide dissemination of research.
- The dblp computer science bibliography is an online database containing open bibliographic information on more than 5 million computer science publications, including journals, proceedings and monographs.