Home // IARIA Congress 2023, The 2023 IARIA Annual Congress on Frontiers in Science, Technology, Services, and Applications // View article


Estimating Text Similarity based on Semantic Concept Embeddings

Authors:
Tim vor der Brück
Marc Pouly

Keywords: Concepts; MultiNet; Concept embeddings; Semantic similarity estimation.

Abstract:
Due to their ease of use and high accuracy, Word2Vec (W2V) word embeddings enjoy great success in the semantic representation of words, sentences, and whole documents as well as for semantic similarity estimation. However, they have the shortcoming that they are directly extracted from a surface representation, which does not adequately represent human thought processes and also performs poorly for highly ambiguous words. Therefore, we propose semantic Concept Embeddings (CE) based on the MultiNet Semantic Network (SN) formalism, which addresses both shortcomings. The evaluation on a marketing target group distribution task showed that the accuracy of predicted target groups can be increased by combining traditional word embeddings with semantic CE.

Pages: 208 to 214

Copyright: Copyright (c) IARIA, 2023

Publication date: November 13, 2023

Published in: conference

ISBN: 978-1-68558-089-6

Location: Valencia, Spain

Dates: from November 13, 2023 to November 17, 2023