Conference Publication Details
Mandatory Fields
Joorabchi, A.; Mahdi, A. E.
18th Intl. Conf. on Knowledge Engineering and Knowledge Management - EKAW 2012
Automatic Subject Metadata Generation for Scientific Documents using Wikipedia and Genetic Algorithms
2012
October
Published
1
()
Optional Fields
text mining, scientific digital libraries, subject metadata, keyphrase annotation, keyphrase indexing, Wikipedia, genetic algorithms
32
41
Galway, Ireland
08-OCT-12
12-OCT-12
Topical annotation of documents with keyphrases is a proven method
for revealing the subject of scientific and research documents. However, scientific documents that are manually annotated with keyphrases are in the minority. This paper describes a machine learning-based automatic keyphrase annotation method for scientific documents, which utilizes Wikipedia as a thesaurus
for candidate selection from documents’ content and deploys genetic algorithms
to learn a model for ranking and filtering the most probable keyphrases. Reported experimental results show that the performance of our method, evaluated
in terms of inter-consistency with human annotators, is on a par with that
achieved by humans and outperforms rival supervised methods.
http://ekaw2012.ekaw.org/node/153 ; http://www.skynet.ie/~arash/PDFs/AJ_EKAW2012.pdf ; http://www.youtube.com/watch?v=4uOJwApHZUc
Grant Details