Applying thesauruses in expanding user search queries: From local use to linked data
https://doi.org/10.33186/1027-3689-2022-12-85-103
Abstract
The subject search in natural languages is the most difficult one due to phraseological ambiguities. To solve the problem, the information systems mobilize the terms in controlled dictionaries, e. g. thesauruses. The authors examine the classifications, thesauruses, subject headings, normative (authority) files within the context of the open networked space of the Linked Open Data environment (LOD). These links enable to enhance (complement) user queries with the words from other dictionaries, and to navigate through the other libraries’ systems for the resources. The authors explore the possibility of practical application of EUROVOC and GEMET thesauruses to expand search queries initiated by the users of RNPLS&T’s Single Open Information Archive (SOIA), Portal of Electronic Library (PEL) of the Parliamentary Library of the RF Federal Assembly and the thematic database “Ecology: Science and technologies”, which records could be potentially linked. The authors cite the study findings and characterize the problems revealed.
The article is prepared within the framework of the Government Order “Information support of scientific research of scientists and specialists on the basis of the RNPLS&T Open Archive as the scientific knowledge aggregation system, (FNEG-2022-003)” for the years 2022–2024.
Keywords
About the Authors
M. V. GoncharovRussian Federation
Mikhail V. Goncharov – Cand. Sc. (Engineering), Associate Professor, Leading Researcher, Head, Group for Perspective Research and Analytic Forecasting, Russian National Public Library for Science and Technology; Associate Professor, Moscow State Linguistic University
Moscow
K. A. Kolosov
Russian Federation
Kirill A. Kolosov – Cand. Sc. (Engineering), Leading Researcher, Russian National Public Library for Science and Technology; Associate Professor, Moscow State Linguistic University
Moscow
E. F. Bychkova
Russian Federation
Elena F. Bychkova – Cand. Sc. (Pedagogy), Leading Researcher, Head, Ecology and Sustainable Development Group of Academic Secretary Department
Moscow
References
1. Malahov D. A., Serebriakov V. A. Model` semanticheskogo poiska na baze tezaurusa // CEUR Workshop Proceedings. 2017. Vol. 2022. P. 191–196.
2. Ataeva O. M., Serebriakov V. A., Tuchkova N. P. Rasshirenie predmetnoi` oblasti informatcionnogo zaprosa na osnove ontologii znanii` tcifrovoi` biblioteki LibMeta // Nauchny`i` servis v seti Internet. Federal`noe gosudarstvennoe uchrezhdenie «Federal`ny`i` issledovatel`skii` centr Institut pricladnoi` matematiki im. M. V. Keldy`sha Rossii`skoi` akademii nauk», 2019. T. 21. S. 63–75.
3. Kechagioglou X. et al. EcoPortal: An Environment for FAIR Semantic Resources in the Ecological Domain // Proceedings. 2021. Т. 1613. С. 0073. URL: http://ceur-ws.org.
4. Dobrov B. V., Lukashevich N. V. Tezaurus RuTez kak resurs dlia resheniia zadach informatcionnogo poiska // Znaniia – Ontologii – Teorii – 2009. URL: http://ns.math.nsc.ru/conference/zont09/reports/93Dobrov-Lukashevich.pdf (data obrashcheniia: 01.12.2022).
5. Lavryonova O. A., Vinberg A. A. Sovremenny`e pol`zovateli bibliotek i prostranstvo sviazanny`kh otkry`ty`kh danny`kh // Bibliotekovedenie. 2020. T. 69. № 3. S. 243–260.
6. Ontology Alignment Evaluation Initiative. URL: http://oaei.ontologymatching.org (data obrashcheniia: 12.11.2022).
7. SKOS Simple Knowledge Organization System Reference. URL: https://www.w3.org/TR/skos-reference (data obrashcheniia: 12.11.2022).
8. Rodríguez-Enríquez C. A. et al. Supply chain knowledge management: A linked databased approach using SKOS // Dyna. 2015. Vol. 82. № 194. P. 27–35.
9. Morshed A., Caracciolo C., Johannsen G., Keizer J. Thesaurus alignment for Linked Data publishing. In: Proceedings of the International Conference on Dublin Core and Metadata Applications 2011. P. 37–46. Dublin Core Metadata Initiative.
10. Goncharov M. V., Kolosov K. A. Problemy` relevantnosti pri obrabotke poiskovy`kh zaprosov k bibliograficheskim i polnotekstovy`m bazam danny`kh v sovremenny`kh modeliakh obespecheniia nauchny`kh issledovanii` sredstvami otkry`ty`kh arhivov // Nauchny`e i tekhnicheskie biblioteki. 2022. № 11. S. 120–134.
11. Shrai`berg Ia. L., Goncharov M. V., Kolosov K. A. O razrabotke kontceptcii Otkry`togo arhiva informatcii GPNTB Rossii // Nauchny`e i tekhnicheskie biblioteki. 2020. № 12. S. 45–58.
12. EuroVoc: Vikipediia. Svobodnaia e`ntciclopediia. URL: https://en.wikipedia.org/wiki/EuroVoc (data obrashcheniia: 12.11.2022).
13. Moskalenko T. A., Miakova N. A. Informatcionno-poiskovy`i` tezaurus Parlamentskoi` biblioteki: e`tapy` razrabotki, vedenie, primenenie i dal`nei`shie perspektivy` // Nauchny`e i tekhnicheskie biblioteki. 2009. № 3. S. 18–22.
14. GEMET – GEneral Multilingual Environmental Thesaurus. URL: https://www.eionet.europa.eu/gemet/en/about (data obrashcheniia: 12.11.2022).
15. Borgoiakova K. S., By`chkova E. F., Zemskov A. I., Kondrasheva I. Iu. Bibliometricheskii` analiz nauchny`kh publikatcii` po e`kologii na osnove referativnoi` bazy` danny`kh «E`kologiia: nauka i tekhnologii» GPNTB Rossii // Nauchny`e i tekhnicheskie biblioteki. 2017. № 10. S. 54–68.
16. Ostländer N., Lutz M. INSPIRE-ing GEMET-Enhancing Metadata Creation and Discovery // EnviroInfo. 2008. P. 212–214. URL: http://enviroinfo.eu/sites/default/files/pdfs/vol119/0212.pdf (data obrashcheniia: 05.12.2022).
17. Francesconi E. On the future of legal publishing services in the Semantic Web // Future Internet. 2018. Vol. 10. № 6. P. 48.
Review
For citations:
Goncharov M.V., Kolosov K.A., Bychkova E.F. Applying thesauruses in expanding user search queries: From local use to linked data. Scientific and Technical Libraries. 2022;(12):85-103. (In Russ.) https://doi.org/10.33186/1027-3689-2022-12-85-103