Academic Article
Developing a digital archaeology classification system using Natural Language Processing and Machine Learning techniques
- Title
- Developing a digital archaeology classification system using Natural Language Processing and Machine Learning techniques
- Creator(s)
- Caravale, Alessandra
- Duran-Silva, Nicolau
- Grimau, Berta
- Moscati, Paola
- Rondelli, Bernardo
- Date
- 2023
- Volume
- 34
- Issue
- 2
- Pages
- 9–32
- Language
- eng
- Rights
- CC BY-NC-ND 4.0
- Abstract
- The Authors propose a knowledge map to analyse and access scientific contents related to Digital Archeology by leveraging various Machine Learning (ML) techniques. The case study concerns the articles published in our international journal «Archeologia e Calcolatori» in the decade from 2011 to 2020 and, as a benchmark, the publications in the ‘Computer Applications and Quantitative Methods in Archaeology’ (CAA) conference proceedings and journal. The titles and abstracts of the publications featured in these two data sets were analysed using a supervised classification approach into the subfields of computer science, based on the ACM’s taxonomy, and by applying topic modelling techniques to discover emergent topics, Named Entity Recognition to identify specific archaeologically relevant entities, and geotagging techniques to link articles with the geographical locations they discuss. The results achieved, although preliminary, provide some methodological suggestions: i) the opportunity to build custom analyses by taking advantage of the increasing availability of open data and metadata; ii) the scope of the contribution of archaeology, and in particular of computational archaeology, to the Heritage Science interdisciplinary domain; the heuristic and predictive role of different ML techniques to gain a multi-faceted access to data analysis and interpretation.
- Cites
- Finding scientific topics
- A controlled vocabulary for research and innovation in the field of Cultural Heritage & Heritage Sciences
- BERT: Pre-training of deep bidirectional transformers for language understanding
- SPECTER: Document-level Representation Learning using Citation-informed Transformers
- Every document has a geographical scope
- CAA2015. Keep the revolution going: proceedings of the 43rd Annual Conference on Computer Applications and Quantitative Methods in Archaeology (Siena 2015)
- Knowledge, Analysis and Innovative Methods for the Study and the Dissemination of Ancient Urban Areas
- Archaeology in the Digital Era
- Introduction to Controlled Vocabularies: Terminology for Art, Architecture, and Other Cultural Works
- Mapping research in assisted reproduction worldwide
- La sfida delle competenze per il Patrimonio Culturale: complementarità, integrazione, interazione
- A topography of climate change research
- Creating a dataset for Named Entity Recognition in the archaeology domain
- IRPET. Report della piattaforma “Tecnologie, Beni Culturali e Cultura”. Le roadmap dello sviluppo e dell’innovazione (RIS3)
- Unsupervised Topic Discovery in User Comments
- SciRepEval: A Multi-Format Benchmark for Scientific Document Representations
- Topic Modelling on Consumer Financial Protection Bureau Data: An Approach Using BERT Based Embeddings
- OpenAlex: A fully-open index of scholarly works, authors, venues, institutions, and concepts
- Mapping STI Ecosystems via Open Data: Overcoming the Limitations of Conflicting Taxonomies. A Case Study for Climate Change Research in Denmark
- Identifying specialisation domains beyond taxonomies: mapping scientific and technological domains of specialisation via semantic analyses
- L'Intelligence collective : Pour une anthropologie du cyberspace
- 30 anni di Archeologia e Calcolatori. Tra memoria e progettualità
Linked resources
Export
Position: 2449 (6 views)
