Bienvenue chez nous !
Logo Ex Libris
 Laissez-vous inspirer ! 

Identification of Significant Keywords from Gujarati Text Documents

  • Couverture cartonnée
  • 164 Nombre de pages
(0) Donner la première évaluation
Évaluations
(0)
(0)
(0)
(0)
(0)
Afficher toutes les évaluations
Information Retrieval(IR) systems are gaining importance due to wide range of applications like recommender systems, search engine... Lire la suite
CHF 96.00
Habituellement expédié sous 2 à 4 jours ouvrés.
Commande avec livraison dans une succursale

Description

Information Retrieval(IR) systems are gaining importance due to wide range of applications like recommender systems, search engines, etc., however, most of the IR systems use statistical methods built on top of bag-of-words approach for text retrieval. Graph-of-words approach is an alternative to bag-of-words approach that uses graph theoretic methods to rank keywords and related documents. We represent text documents as graphs whose vertices correspond to the unique terms belonging to the document. The edges represent co-occurrences between the terms. The underlying assumption is that the terms that co-occur have some sort of semantic relationship that can be harnessed for IR systems. The significant terms can be extracted using graph centrality measures. In this book, we have proposed a novel graph-of-words indexing technique using eigenvector scores that uses case separation for Gujarati language. We compared the performance of IR systems of our approach over the classical bag-of-words approach, mean average precision (MAP) values obtained in our experiments show that our approach has shown significant improvement over classical approaches.

Auteur

Dr. Hardik Joshi is an Assistant Professor with the Dept. of Computer Sc., Gujarat University, India. His research interests include Natural Language Processing and Information Retrieval.

Informations sur le produit

Titre: Identification of Significant Keywords from Gujarati Text Documents
Auteur:
Code EAN: 9786200082763
ISBN: 978-620-0-08276-3
Format: Couverture cartonnée
Editeur: LAP Lambert Academic Publishing
Genre: Informatique
nombre de pages: 164
Poids: g
Taille: H220mm x B220mm x T150mm
Année: 2019