Name Disambiguation Analysis Using the Word Sense Disambiguation Method in Hadith

Ageng Prasetio, Mochammad Arif Bijaksana, Arie Ardiyanti Suryani

Abstract


Name disambiguation is the problem solving process to find similar names in sentences. The ambiguity of names can be found in hadith of Sahih Bukhari, names "Abdullah bin Amru" in hadiths no 27 and “Abdullah bin Amru” in hadith no 58, These names are the same, but there is no proof they are the same person. This problem is the early indication of ambiguity of name in the hadith. Based in this problem, this research aims to find name disambiguation of hadith narrators with classification by considering the perawi chain. To solved this problem the authors used Word Sense Disambiguation (WSD), WSD is a process to assign the same meaning from the sentences, based on the context in which the word appears. To classify several names in the hadith, the authors used KNN algorithm, by combining the WSD and KNN method can reduce the ambiguity of names in hadith. The data used in this study came from the hadith of Sahih Bukhori through the pre-processing stage. After conducting the research showed a collection of hadith numbers with the same name prediction with an accuracy of 99% at k = 1. Thus, this method can be used for name disambiguation.


Keywords


Ambiguity; Disambiguation; Hadith; WSD

Full Text:

PDF

References


Agrawal, A., Gans, J. S., & Goldfarb, A. (2019). Artificial Intelligence: The Ambiguous Labor Market Impact of Automating Prediction. Journal of Economic Perspectives, 33(2), 31–50.

Ali, M. Y., & Rahman, A. U. (2018). Knowledge-Based & Corpus-Based Methods for Evaluation of Semantic Relatedness of Concepts in Knowledge Graphs. International Journal of IT & Knowledge Management, 11(2), 81–86.

Angreni, I. A., Adisasmita, S. A., Ramli, M. I., & Hamid, S. (2018). Pengaruh Nilai K Pada Metode K-Nearest Neighbor (KNN) Terhadap Tingkat Akurasi Identifikasi Kerusakan Jalan. Rekayasa Sipil, 7(2), 63–70.

Faizal, P. R. M., Ridhwan, A. A. M., & Kalsom, A. W. (2013). The Entrepreneurs Characteristic from al-Quran and al-Hadis. International Journal of Trade, Economics and Finance, 4(4), 191–196.

Moro, A., Raganato, A., & Navigli, R. (2014). Entity Linking meets Word Sense Disambiguation. Transactions of the Association for Computational Linguistics, 2, 231–244.

Oscar, H. (2019). Basics of Data Preprocessing. https://medium.com/easyread/basics-of-data-preprocessing-71c314bc7188

Pan, X., Cassidy, T., Hermjakob, U., Ji, H., & Knight, K. (2015). Unsupervised entity linking with abstract meaning representation. In North American Chapter of the Association for Computational Linguistics, 1130-1139. Denver: Association for Computational Linguistics.

Parravicini, A., Patra, R., Bartolini, D. B., & Santambrogio, M. D. (2019). Fast and accurate entity linking via graph embedding. Proceedings of the 2nd Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). 1-9. Amsterdam : ACM,

Rezapour, A. R., Fakhrahmad, S. M., & Sadreddini, M. H. (2011). Applying Weighted KNN to Word Sense Disambiguation. Proceedings of the World Congress on Engineering, London : Imperial College London, 6-8.

Saedi, C., Branco, A., António Rodrigues, J., & Silva, J. (2018). WordNet Embeddings. Proceedings of The Third Workshop on Representation Learning for NLP, 122-131. Australia : Association for Computational Linguistics.

Shen, Y., Yun, H., Lipton, Z. C., Kronrod, Y., & Anandkumar, A. (2018). Deep Active Learning for Named Entity Recognition. International Conference on Learning Representations, 1-15. Canada: arxiv.org

Suryaningsih, S. (2020). Building Synonym Sets for English WordNet with Robust Clustering using Links Method. Edumatic : Jurnal Pendidikan Informatika, 4(1), 57–62.

Upendraa, B., & Sudheer, B. (2016). KNN TFIDF Based Named Entity Recognition. International Journal of Scientific and Research, 1(12), 35–39.

Zapilko, B., Schaible, J., Wandhöfer, T., & Mutschke, P. (2016). Applying Linked Data Technologies in the Social Sciences. KI - Kunstliche Intelligenz, 30(2), 159–162.

Zhang, B., & Hasan, M. (2017). Name Disambiguation in Anonymized Graphs using Network Embedding. In: ACM on Conference on Information and Knowledge Management. Singapore : ACM, 1239-1248.

Zhang, S., Li, X., Zong, M., Zhu, X., & Wang, R. (2018). Efficient kNN classification with different numbers of nearest neighbors. IEEE Transactions on Neural Networks and Learning Systems, 29(5), 1774–1785.


Article Metrics

Abstract view : 0 times
PDF - 0 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

 

  Statistic Pengunjung Edumatic

Creative Commons License

Edumatic: Jurnal Pendidikan Informatika is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.