Automatic and semi-automatic metadata generation tools

a reflection between the years 2010 and 2020

Authors

DOI:

https://doi.org/10.5433/1981-8920.2024v29n1p68

Keywords:

Bibliographic review, Metadata, Automatic and semi-automatic generation., Possibilities, Limitations

Abstract

Objective: Identify the possibilities and limitations for using the analyzed tools.
Methodology: This is an exploratory investigation, carrying out a bibliographical review and the search was carried out in the Scopus, Web Of Science, ISTA, LISTA and LISA databases. A mixed method was used in data analysis, with quantitative and qualitative approaches. 49 scientific articles were found and after applying the adopted criteria, only 12 were selected for synthesis.
Results: The results demonstrated several tools and solutions for generating metadata, using various techniques, methods and functions, addressing their implementation and use. In this context, the possibilities and limitations of these solutions were identified, with the aim of contributing to their application and improvement in future research.
Conclusions: It is concluded that automatic and semi-automatic metadata generation tools are instruments that can assist information professionals in the organized and efficient management of digital collections, improving information retrieval, which reinforces the contribution of this research in the academic world- scientific in the area of Information Science.

Downloads

Download data is not yet available.

Author Biographies

Jean Carlos Borges Brito, University of Brasília

PhD in Information Science from the Universidade de Brasília (UnB), Brasília, Brasil. 

Dalton Lopes Martins, University of Brasília

PhD in Information Science from the Universidade de São Paulo (USP). Professor at the Universidade de Brasília (UnB), Brasília, Brasil

References

AUDICHYA, M, K; SAINI J, R. Computational linguistic prosody rule-based unified technique for automatic metadata generation for Hindi poetry. In: INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION TECHNOLOGY, 1., 2019, Chikmagalur. Proceedings […] Chikmagalur: IEEE, 2019. p. 436-442. Disponível em: https://www.semanticscholar.org/paper/Computational-linguistic-prosody-rule-based-unified-Audichya-Saini/dd6be4b8ee154249df08dbeb3f1115611bc977ad. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1109/ICAIT47043.2019.8987239

COSTA, A; FIRDAUSY, T, P; INNEREBNER, M; MONSORNO, R. EURAC SDI: a near real time and offline automatic metadata generation processing chain. GI Forum, 1., 2013, [S. l.], Proceedings […] Berlim: VDE VERLAG GMBH, 2013. Disponível em: https://encurtador.com.br/XMa7f. Acesso em: 28 nov. 2024.

CRYSTAL, A; LAND, P. Metadata and Search: Global Corporate Circle DCMI 2003 Workshop. 2003. Disponível em: http://www.dublincore.org/groups/corporate/Seattle/. Acesso em: 07 mar. 2023.

EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA (EMBRAPA). Satélites de monitoramento. Campinas: Embrapa, 2024. Disponível em: https://www.embrapa.br/satelites-de-monitoramento/satelites. Acesso em: 22 set. 2024.

GONZALO, P, R; MATT, H; GUNTHER, H, W; COLIN, O; KATIE, A; LAVANYA, R. Science Search: enabling search through automatic metadata generation. In: INTERNATIONAL CONFERENCE ON E-SCIENCE, 14., 2018, Amsterdã, Proceedings […] Amsterdã: IEEE, 2018. Disponível em: https://www.osti.gov/biblio/1602828. Acesso em: 28 nov. 2024.

GREENBERG, J. Metadata Extraction and Harvesting: a comparison of two automatic metadata generation applications. Journal of Internet Cataloging, [S. l.], v. 6, n. 4, 2003. Disponível em: https://researchdiscovery.drexel.edu/esploro/outputs/journalArticle/Metadata-Extraction-and-Harvesting-A-Comparison/991014878230804721. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1300/J141v06n04_05

JOSHI, B. K; KUSHWAH, K. K. A Novel approach to automatic detection of Chaupai Chhand in Hindi Poems. In: INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 1., 2018, Greater Noida, Proceedings […] Greater Noida: IEEE, 2018. p. 223-228. DOI: 10.1109/GUCON.2018.8675052. DOI: https://doi.org/10.1109/GUCON.2018.8675052

KLEPPE, M; VELDHOEN, S; WAAL-GENTENAAR, M. V. D; OUDSTEN, B. D; HAAGSMA, D. Exploration possibilities Automated Generation of Metadata. 2019. Disponível em: https://doi.org/10.5281/zenodo.3375192. Acesso em: 07 mar. 2023.

KOVACEVIC, A.; IVANOVIC, D.; MILOSAVLJEVIC, B.; KONJOVIC, Z. Automatic extraction of metadata from scientific publications for CRIS systems. Program: Electronic Library and Information Systems, [S. l.], v. 45, n. 4, p. 376-396, 2011. Disponível em: https://www.researchgate.net/publication/216592386_Automatic_extraction_of_metadata_from_scientific_publications_for_CRIS_systems. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1108/00330331111182094

LABORATÓRIO DE INTELIGÊNCIA DE REDES. ColetadorOAI-sickle.py. Brasília: UnB; IBICT, 2022. Disponível em: https://github.com/tainacan/data_science/blob/master/FUNARTE/BIBLIOTECA_DIGITAL/ColetadorOAI-sickle.py. Acesso em: 20 set. 2024.

MARATEA, A.; PETROSINO, A.; MANZO, M. Automatic Generation of SCORM Compliant Metadata for Portable Document Format Files. In: INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND TECHNOLOGIES – COMPSYSTECH, 13., Nova Iorque, 2012, Proceedings […] Nova Iorque: Association for Computing Machinery, 2012. Disponível em: https://www.researchgate.net/publication/262277720_Automatic_generation_of_SCORM_compliant_metadata_for_portable_document_format_files. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1145/2383276.2383328

MARCONI, M. A.; LAKATOS, E. M. Fundamentos de metodologia científica. 5. ed. São Paulo: Atlas, 2003.

MOOERS, C. Zatocoding applied to mechanical organization of knowledge. American Documentation, [S. l.], v.2, n.1, p. 20-32, 1951. Disponível em: https://courses.grainger.illinois.edu/cs473/fa2013/misc/zatocoding.pdf. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1002/asi.5090020107

MORRIS, V. Automated Language Identification of Bibliographic Resources. Cataloging & Classification Quarterly, [S. l.], v. 58, n. 1, p. 1-27, 2020. Disponível em: https://bl.iro.bl.uk/concern/articles/6c99ffcb-0003-477d-8a58-64cf8c45ecf5?locale=en. Acesso em: 28 nov. 2024.

PARK, J.; BRENZA, A. Evaluation of Semi-Automatic Metadata Generation Tools: a survey of the current state of the art. Information Technology and Libraries, Chicago, v. 34, ed. 3, p. 22-42, 2015. Disponível em: https://ital.corejournals.org/index.php/ital/article/view/5889. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.6017/ital.v34i3.5889

POLFREMAN, M.; BROUGHTON, V.; WILSON, A. Metadata Generation for Resource Discovery. JISC, 2008. Disponível em: http://www.jisc.ac.uk/whatwedo/programmes/resourcediscovery/autometgen.aspx. Acesso em: 07 mar. 2023

RAFFERTY, J.; NUGENT, C.; LIU, J. Automatic Metadata Generation Through Analysis of Narration Within Instructional Videos. Transaction Processing Systems, J Med Syst, [S. l.], v. 94, n. 39, 2015. Disponível em: https://pubmed.ncbi.nlm.nih.gov/26254252/. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1007/s10916-015-0295-2

REINSEL, D.; GANTZ, J.; RYDNING, J. The Digitization of the World: From Edge to Core. Data Age 2025, [S. l.], nov., 2018. Disponível em: https://www.seagate.com/files/www-content/our-story/trends/files/idc-seagate-dataage-whitepaper.pdf. Acesso em: 28 nov. 2024.

SAH, M; WADE, V. Automatic metadata mining from multilingual enterprise content. Web semantics: Science, services and agents on the world wide web, [S. l.], v. 11, p. 41-62, 2012. Disponível em: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3198936. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1016/j.websem.2011.11.001

SILVA, S. R. de B; CORREA, R. F; GIL-LEIVA, I. Avaliação Direta e Conjunta de Sistemas de Indexação Automática por Atribuição. Inf. & Soc.:Est., João Pessoa, v.30, n.4, p. 1-27, out./dez. 2020. Disponível em: https://periodicos.ufpb.br/ojs2/index.php/ies/article/view/57259. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.22478/ufpb.1809-4783.2020v30n4.57259

ULRICH, H; KOCK-SCHOPPENHAUER, A; DEPPENWIESE, N; GÖTT, R; KERN, J; LABLANS, M; MAJEED, R. W; STÖHR, M. R; STAUSBERG, J; VARGHESE, J; DUGAS, M; INGENERF, J. Understanding the Nature of Metadata: Systematic Review. J Med Internet Res, [S. l.], v. 24, p. 1, 2022. Disponível em: https://pubmed.ncbi.nlm.nih.gov/35014967/. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.2196/25440

VERBORGH, R; VAN DEURSEN, D; MANNENS, E; POPPE, C; WALLE, R, V. Enabling context-aware multimedia annotation by a novel generic semantic problem-solving platform. Multimed Tools Appl, [S. l.], v. 61, p. 105–129, 2012. Disponível em: https://link.springer.com/article/10.1007/s11042-010-0709-6. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1007/s11042-010-0709-6

VLACHIDIS A., BINDING C., MAY K., TUDHOPE D. Automatic Metadata Generation in an Archaeological Digital Library: Semantic Annotation of Grey Literature. In: PRZEPIÓRKOWSKI A., PIASECKI M., JASSEM K., FUGLEWICZ P. (ed.). Computational Linguistics. Studies in Computational Intelligence, Springer; Berlin; Heidelberg, v. 458, 2013. Disponível em: https://pure.southwales.ac.uk/en/publications/automatic-metadata-generation-in-an-archaeological-digital-librar. Acesso em: 28 nov. 2024. DOI: https://doi.org/10.1007/978-3-642-34399-5_10

YANG, G; PARK, J. Automatic Extraction of Metadata Information for Library Collections. International Journal of Advanced Culture Technology, [S. l.], v. 6, n. 2, p. 117-122, 2018. Disponível em: https://koreascience.kr/article/JAKO201820540196117.page. Acesso em: 28 nov. 2024.

Published

2024-12-11

How to Cite

Brito, J. C. B., & Martins, D. L. (2024). Automatic and semi-automatic metadata generation tools: a reflection between the years 2010 and 2020. Informação & Informação, 29(1), 68–98. https://doi.org/10.5433/1981-8920.2024v29n1p68