Lexical Disambiguation (CKBD): A Tool to Identify and Resolve Semantic Conflicts Using Context Knowledge

Said Al Tahat, Kamsuriah Ahmad

Abstract


The schema matching process is a fundamental step in a schema integration system, and its quality impacts the overall performance of the system. Recently, a large number of schema matching approaches have been developed. Until today, the performance of schema matching is inherently uncertain and requires improvement. The most difficult task is inferring the real-world semantics of data from the information provided by schema labels in their representations. Usually, schemas with identical semantics are represented by different vocabularies and only their own designers can completely understand. A schema may contain synonyms and homonyms words. Therefore, it is necessary to understand how the schema elements are "presented"; it is often hard to get aware meaning associated with elements names, due to the semantic ambiguity of human language. Semantic ambiguity problem means the capability of being understood in two or more possible senses. Having more than one meaning for an individual schema element would cause confusion in interpretation of schema name. This may affect negatively on the matching result. Therefore, this paper aims to resolve this problem of semantic ambiguity and represent the intended meaning of the schema labels name, by introducing the CKBD (Context Knowledge-Based Disambiguation) approach. The CKBD is obtained by integrating two pieces of context knowledge:  semantic domain and more frequency used into a disambiguation processor.  Finally, the CKBD is implemented and is tested in a real dataset.  The result is deeply grounded in the ability to detect schema name intended meaning.


Keywords


natural language processing; semantic ambiguity; schema integration; schema matching; word sense disambiguation.

Full Text:

PDF

References


Hossain, J., Sani, N.F.M., Affendey, L.S., Ishak, I., And Kasmiran, K.A.: ‘Semantic Schema Matching Approaches: A Review’, Journal of Theoretical & Applied Information Technology, 2014, 62, (1)

Tahat, S., and Ahmad, K.: ‘Semi-automated schema integration (icase): A tool to identify and resolve naming conflicts’, Australian Journal of Basic & Applied Sciences, 2013, 7

Ahmad, K., Chiew, H.K., and Samad, R.: ‘Intelligent Schema Integrator (ISI): A tool to solve the problem of naming conflict for schema integration’, in Editor (Ed.)^(Eds.): ‘Book Intelligent Schema Integrator (ISI): A tool to solve the problem of naming conflict for schema integration’ (IEEE, 2011, edn.), pp. 1-5

Ahamed, B.B., Ramkumar, T., and Hariharan, S.: ‘Data integration progression in large data source using mapping affinity’, in Editor (Ed.)^(Eds.): ‘Book Data integration progression in large data source using mapping affinity’ (IEEE, 2014, edn.), pp. 16-21

Blomqvist, E., and Thollander, P.: ‘An integrated dataset of energy efficiency measures published as linked open data’, Energy Efficiency, 2015, 8, (6), pp. 1125-1147

Nicklas, D., Schwarz, T., and Mitschang, B.: ‘A Schema-Based Approach to Enable Data Integration on the Fly’, International Journal of Cooperative Information Systems, 2017, 26, (01), pp. 1650010

Kettouch, M.S., Luca, C., Hobbs, M., and Fatima, A.: ‘Data integration approach for semi-structured and structured data (Linked Data)’, in Editor (Ed.)^(Eds.): ‘Book Data integration approach for semi-structured and structured data (Linked Data)’ (IEEE, 2015, edn.), pp. 820-825

He, W., and Da Xu, L.: ‘Integration of distributed enterprise applications: A survey’, IEEE Transactions on Industrial Informatics, 2014, 10, (1), pp. 35-42

Díaz, M., Martín, C., and Rubio, B.: ‘State-of-the-art, challenges, and open issues in the integration of Internet of things and cloud computing’, Journal of Network and Computer Applications, 2016, 67, pp. 99-117

Bergamaschi, S., Beneventano, D., Po, L., and Sorrentino, S.: ‘Automatic normalization and annotation for discovering semantic mappings’: ‘Search computing’ (Springer, 2011), pp. 85-100

Bilke, A.: ‘Duplicate-based Schema Matching’, 2007

Rahm, E., and Bernstein, P.A.: ‘A survey of approaches to automatic schema matching’, the VLDB Journal, 2001, 10, (4), pp. 334-350

Alwan, A.A., Nordin, A., Alzeber, M., and Abualkishik, A.Z.: ‘A Survey of Schema Matching Research using Database Schemas and Instances’, International Journal Of Advanced Computer Science And Applications, 2017, 8, (10), pp. 102-111

Nguyen, Q.V.H., Nguyen, T.T., Miklos, Z., Aberer, K., Gal, A., and Weidlich, M.: ‘Pay-as-you-go reconciliation in schema matching networks’, in Editor (Ed.)^(Eds.): ‘Book Pay-as-you-go reconciliation in schema matching networks’ (IEEE, 2014, edn.), pp. 220-231

Gillani, S., Naeem, M., Habibullah, R., and Qayyum, A.: ‘Semantic schema matching using DBpedia’, International Journal of Intelligent Systems and Applications, 2013, 5, (4), pp. 72

Rachman, M.A.F., and Saptawati, G.A.P.: ‘Database integration based on combination schema matching approach (case study: Multi-database of district health information system)’, in Editor (Ed.)^(Eds.): ‘Book Database integration based on combination schema matching approach (case study: Multi-database of district health information system)’ (IEEE, 2017, edn.), pp. 430-435

Bhattacharjee, S., and Ghosh, S.K.: ‘Automatic resolution of semantic heterogeneity in GIS: An ontology based approach’: ‘Advanced Computing, Networking and Informatics-Volume 1’ (Springer, 2014), pp. 585-591

Bellström, P.: ‘Schema Integration: How to Integrate Static and Dynamic Database Schemata’ (Karlstads universitet, 2010. 2010)

Unal, O., and Afsarmanesh, H.: ‘Schema matching and integration for data sharing among collaborating organizations’, Journal of Software, 2009, 4, (3), pp. 248-261

Unal, O., and Afsarmanesh, H.: ‘Using linguistic techniques for schema matching’, in Editor (Ed.)^(Eds.): ‘Book Using linguistic techniques for schema matching’ (2006, edn.), pp. 115-120

WSD, W.S.D.: ‘Word Sense Disambiguation’, 2015

Al-Harbi, O., Jusoh, S., and Norwawi, N.: ‘Handling ambiguity problems of natural language interface for question answering’, International Journal of Computer Science Issues (IJCSI), 2012, 9, (3), pp. 17

Po, L.: ‘Improving data integration through disambiguation techniques’, Lecture Notes in Computer Science, 2008, 5039, pp. 372-375

Stevenson, M., and Wilks, Y.: ‘Word sense disambiguation’, The Oxford Handbook of Comp. Linguistics, 2003, pp. 249-265

Ponzetto, S.P., and Navigli, R.: ‘Knowledge-rich word sense disambiguation rivaling supervised systems’, in Editor (Ed.)^(Eds.): ‘Book Knowledge-rich word sense disambiguation rivaling supervised systems’ (Association for Computational Linguistics, 2010, edn.), pp. 1522-1531




DOI: http://dx.doi.org/10.18517/ijaseit.9.1.6387

Refbacks

  • There are currently no refbacks.



Published by INSIGHT - Indonesian Society for Knowledge and Human Development