Development of Different Word Vectors and Testing Using Text Classification Algorithms for Telugu

Guna Santhoshi; Srinivasu Badugu
{'id': 'https://openalex.org/W4226479509', 'doi': 'https://doi.org/10.1007/978-981-16-9669-5_3', 'title': 'Development of Different Word Vectors and Testing Using Text Classification Algorithms for Telugu', 'display_name': 'Development of Different Word Vectors and Testing Using Text Classification Algorithms for Telugu', 'publication_year': 2022, 'publication_date': '2022-01-01', 'ids': {'openalex': 'https://openalex.org/W4226479509', 'doi': 'https://doi.org/10.1007/978-981-16-9669-5_3'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1007/978-981-16-9669-5_3', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4210172046', 'display_name': 'Smart innovation, systems and technologies', 'issn_l': '2190-3018', 'issn': ['2190-3018', '2190-3026'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310319965', 'host_organization_name': 'Springer Nature', 'host_organization_lineage': ['https://openalex.org/P4310319965'], 'host_organization_lineage_names': ['Springer Nature'], 'type': 'book series'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'book-chapter', 'type_crossref': 'book-chapter', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5084793385', 'display_name': 'Guna Santhoshi', 'orcid': None}, 'institutions': [], 'countries': ['IN'], 'is_corresponding': False, 'raw_author_name': 'Guna Santhoshi', 'raw_affiliation_strings': ['G.Narayanamma Institute of Technology and Science (for Women), Hyderabad, Telangana, India'], 'affiliations': [{'raw_affiliation_string': 'G.Narayanamma Institute of Technology and Science (for Women), Hyderabad, Telangana, India', 'institution_ids': []}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5084367559', 'display_name': 'Srinivasu Badugu', 'orcid': 'https://orcid.org/0000-0002-7100-0601'}, 'institutions': [{'id': 'https://openalex.org/I25051037', 'display_name': 'Stanley Medical College', 'ror': 'https://ror.org/0567p6j84', 'country_code': 'IN', 'type': 'education', 'lineage': ['https://openalex.org/I25051037']}], 'countries': ['IN'], 'is_corresponding': False, 'raw_author_name': 'Srinivasu Badugu', 'raw_affiliation_strings': ['Stanley College of Engineering and Technology for Women, Hyderabad, Telangana, India'], 'affiliations': [{'raw_affiliation_string': 'Stanley College of Engineering and Technology for Women, Hyderabad, Telangana, India', 'institution_ids': ['https://openalex.org/I25051037']}]}], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.0, 'has_fulltext': False, 'cited_by_count': 2, 'citation_normalized_percentile': {'value': 0.0, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 72, 'max': 77}, 'biblio': {'volume': None, 'issue': None, 'first_page': '33', 'last_page': '45'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10028', 'display_name': 'Natural Language Processing', 'score': 0.9997, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10028', 'display_name': 'Natural Language Processing', 'score': 0.9997, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10181', 'display_name': 'Statistical Machine Translation and Natural Language Processing', 'score': 0.999, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10664', 'display_name': 'Sentiment Analysis and Opinion Mining', 'score': 0.9943, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/word2vec', 'display_name': 'Word2vec', 'score': 0.9069802}, {'id': 'https://openalex.org/keywords/word-embedding', 'display_name': 'Word embedding', 'score': 0.75710595}, {'id': 'https://openalex.org/keywords/word-representation', 'display_name': 'Word Representation', 'score': 0.577339}, {'id': 'https://openalex.org/keywords/bag-of-words-model', 'display_name': 'Bag-of-words model', 'score': 0.5616401}, {'id': 'https://openalex.org/keywords/language-modeling', 'display_name': 'Language Modeling', 'score': 0.519428}, {'id': 'https://openalex.org/keywords/text-mining', 'display_name': 'Text Mining', 'score': 0.512081}, {'id': 'https://openalex.org/keywords/topic-modeling', 'display_name': 'Topic Modeling', 'score': 0.504637}], 'concepts': [{'id': 'https://openalex.org/C2776461190', 'wikidata': 'https://www.wikidata.org/wiki/Q22673982', 'display_name': 'Word2vec', 'level': 3, 'score': 0.9069802}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.7608905}, {'id': 'https://openalex.org/C2777462759', 'wikidata': 'https://www.wikidata.org/wiki/Q18395344', 'display_name': 'Word embedding', 'level': 3, 'score': 0.75710595}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.7051573}, {'id': 'https://openalex.org/C90805587', 'wikidata': 'https://www.wikidata.org/wiki/Q10944557', 'display_name': 'Word (group theory)', 'level': 2, 'score': 0.65428126}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.64196944}, {'id': 'https://openalex.org/C13672336', 'wikidata': 'https://www.wikidata.org/wiki/Q3460803', 'display_name': 'Bag-of-words model', 'level': 2, 'score': 0.5616401}, {'id': 'https://openalex.org/C2777601683', 'wikidata': 'https://www.wikidata.org/wiki/Q6499736', 'display_name': 'Vocabulary', 'level': 2, 'score': 0.4924341}, {'id': 'https://openalex.org/C41608201', 'wikidata': 'https://www.wikidata.org/wiki/Q980509', 'display_name': 'Embedding', 'level': 2, 'score': 0.45169258}, {'id': 'https://openalex.org/C34736171', 'wikidata': 'https://www.wikidata.org/wiki/Q918333', 'display_name': 'Preprocessor', 'level': 2, 'score': 0.43828574}, {'id': 'https://openalex.org/C2777530160', 'wikidata': 'https://www.wikidata.org/wiki/Q41796', 'display_name': 'Sentence', 'level': 2, 'score': 0.42631835}, {'id': 'https://openalex.org/C33923547', 'wikidata': 'https://www.wikidata.org/wiki/Q395', 'display_name': 'Mathematics', 'level': 0, 'score': 0.14417097}, {'id': 'https://openalex.org/C41895202', 'wikidata': 'https://www.wikidata.org/wiki/Q8162', 'display_name': 'Linguistics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C138885662', 'wikidata': 'https://www.wikidata.org/wiki/Q5891', 'display_name': 'Philosophy', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C2524010', 'wikidata': 'https://www.wikidata.org/wiki/Q8087', 'display_name': 'Geometry', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1007/978-981-16-9669-5_3', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4210172046', 'display_name': 'Smart innovation, systems and technologies', 'issn_l': '2190-3018', 'issn': ['2190-3018', '2190-3026'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310319965', 'host_organization_name': 'Springer Nature', 'host_organization_lineage': ['https://openalex.org/P4310319965'], 'host_organization_lineage_names': ['Springer Nature'], 'type': 'book series'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.83, 'id': 'https://metadata.un.org/sdg/4', 'display_name': 'Quality education'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 9, 'referenced_works': ['https://openalex.org/W2040711288', 'https://openalex.org/W2407712691', 'https://openalex.org/W2493916176', 'https://openalex.org/W2606089314', 'https://openalex.org/W2790325757', 'https://openalex.org/W2900986331', 'https://openalex.org/W2963002901', 'https://openalex.org/W2963088995', 'https://openalex.org/W2964144280'], 'related_works': ['https://openalex.org/W4388996947', 'https://openalex.org/W4251594503', 'https://openalex.org/W4226479509', 'https://openalex.org/W3175524270', 'https://openalex.org/W3152932816', 'https://openalex.org/W3133567596', 'https://openalex.org/W3036348210', 'https://openalex.org/W2985392712', 'https://openalex.org/W2946409105', 'https://openalex.org/W2909602489'], 'abstract_inverted_index': {'Word': [0], 'embedding': [1, 37, 99, 149], 'methods': [2], 'are': [3, 56, 64, 80], 'used': [4, 190], 'to': [5, 29, 41, 60, 90, 95, 135, 142, 216], 'represent': [6], 'words': [7], 'in': [8, 128], 'a': [9, 97, 125], 'numerical': [10, 31, 47], 'way.': [11], 'Text': [12], 'data': [13, 45], 'cannot': [14], 'be': [15], 'directly': [16], 'processed': [17], 'by': [18, 34], 'machine': [19, 117, 160, 226], 'learning': [20, 23, 118, 161, 227], 'or': [21], 'deep': [22], 'algorithms.': [24, 162], 'It': [25], 'is': [26, 134, 155, 185], 'very': [27], 'efficient': [28], 'process': [30], 'data,': [32], 'so': [33], 'using': [35, 101, 115, 138, 146, 159], 'word': [36, 98, 144, 148], 'techniques,': [38], 'we': [39, 76, 111, 169, 213], 'need': [40], 'transform': [42], 'the': [43, 73, 152, 156, 167, 204, 236], 'text': [44], 'into': [46, 223], 'form.': [48], 'One': [49], 'hot': [50, 103, 206], 'encoding': [51, 104, 207], 'vectors': [52, 114, 145], 'of': [53, 182], 'real-valued': [54], 'numbers': [55], 'simple': [57], 'and': [58, 83, 105, 141, 175, 197, 209, 231], 'easy': [59], 'generate.': [61], 'The': [62, 131, 180], 'researchers': [63], 'now': [65], 'managing': [66], 'Word2vec': [67, 106], 'for': [68, 86, 120, 199], 'semantic': [69], 'representation': [70], 'words.': [71, 188], 'In': [72, 108], 'literature': [74], 'review,': [75], 'found': [77], 'that': [78], 'there': [79], 'fewer': [81], 'tools': [82], 'resources': [84], 'available': [85], 'Indian': [87], 'languages': [88], 'compared': [89, 235], 'European': [91], 'languages.': [92], 'We': [93, 123, 163, 171, 189, 233], 'want': [94], 'construct': [96], '(vectors)': [100], 'one': [102, 205], 'strategy.': [107], 'this': [109, 129], 'paper,': [110], 'evaluate': [112], 'these': [113], 'supervised': [116], 'algorithms': [119, 228], 'sentiment': [121], 'classification.': [122], 'pursue': [124], 'two-step': [126], 'approach': [127], 'article.': [130], 'first': [132], 'step': [133], 'generate': [136], 'vocabulary': [137, 184], 'News': [139], 'Corpus': [140], 'create': [143], 'various': [147], 'methods.': [150], 'Validating': [151], 'vector': [153, 208, 211], 'quality': [154], 'second': [157], 'step,': [158], 'did': [164], 'preprocessing': [165], 'on': [166], 'corpus': [168], 'received.': [170], 'got': [172], '178,210': [173, 186], 'types': [174], '929,594': [176], 'tokens': [177], 'after': [178], 'preprocessing.': [179], 'size': [181], 'our': [183], 'unique': [187], 'labeled': [191], 'corpus,': [192], 'i.e.;': [193], 'movie': [194], 'review': [195], 'sentences': [196, 215, 220], 'vocabulary,': [198], 'developing': [200], 'sentence': [201], 'vector.': [202], 'Using': [203], 'word2vec': [210], 'model,': [212], 'translated': [214, 222], 'vectors.': [217], 'Once': [218], 'label': [219], 'were': [221, 229], 'vectors,': [224], 'three': [225], 'trained': [230], 'evaluated.': [232], 'finally': [234], 'outcome.': [237]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W4226479509', 'counts_by_year': [{'year': 2024, 'cited_by_count': 2}], 'updated_date': '2024-09-11T22:04:07.916077', 'created_date': '2022-05-05'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works