Concatenative Text-to-Speech Synthesis System for Communication Recognition

Rahul Kumar Jaiswal; Rajesh Kumar Dubey
{'id': 'https://openalex.org/W4205598694', 'doi': 'https://doi.org/10.1109/iceca52323.2021.9675855', 'title': 'Concatenative Text-to-Speech Synthesis System for Communication Recognition', 'display_name': 'Concatenative Text-to-Speech Synthesis System for Communication Recognition', 'publication_year': 2021, 'publication_date': '2021-12-02', 'ids': {'openalex': 'https://openalex.org/W4205598694', 'doi': 'https://doi.org/10.1109/iceca52323.2021.9675855'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/iceca52323.2021.9675855', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4363607835', 'display_name': '2021 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA)', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'conference'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5050008382', 'display_name': 'Rahul Kumar Jaiswal', 'orcid': 'https://orcid.org/0000-0003-3800-7235'}, 'institutions': [{'id': 'https://openalex.org/I200650556', 'display_name': 'University of Agder', 'ror': 'https://ror.org/03x297z98', 'country_code': 'NO', 'type': 'education', 'lineage': ['https://openalex.org/I200650556']}], 'countries': ['NO'], 'is_corresponding': False, 'raw_author_name': 'Rahul Kumar Jaiswal', 'raw_affiliation_strings': ['Faculty of Engineering and Science, University of Agder, Grimstad, Norway'], 'affiliations': [{'raw_affiliation_string': 'Faculty of Engineering and Science, University of Agder, Grimstad, Norway', 'institution_ids': ['https://openalex.org/I200650556']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5002172554', 'display_name': 'Rajesh Kumar Dubey', 'orcid': 'https://orcid.org/0000-0003-0028-507X'}, 'institutions': [{'id': 'https://openalex.org/I4210133503', 'display_name': 'Central University of Haryana', 'ror': 'https://ror.org/03mtwkv54', 'country_code': 'IN', 'type': 'education', 'lineage': ['https://openalex.org/I4210133503']}], 'countries': ['IN'], 'is_corresponding': False, 'raw_author_name': 'Rajesh Kumar Dubey', 'raw_affiliation_strings': ['School of Engineering and Technology, Central University of Haryana, Mahendragarh, India'], 'affiliations': [{'raw_affiliation_string': 'School of Engineering and Technology, Central University of Haryana, Mahendragarh, India', 'institution_ids': ['https://openalex.org/I4210133503']}]}], 'institution_assertions': [], 'countries_distinct_count': 2, 'institutions_distinct_count': 2, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.549, 'has_fulltext': False, 'cited_by_count': 4, 'citation_normalized_percentile': {'value': 0.801724, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 77, 'max': 80}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10201', 'display_name': 'Speech Recognition Technology', 'score': 0.999, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10201', 'display_name': 'Speech Recognition Technology', 'score': 0.999, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T12031', 'display_name': 'Dialogue Act Modeling for Spoken Language Systems', 'score': 0.9955, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10181', 'display_name': 'Statistical Machine Translation and Natural Language Processing', 'score': 0.9882, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/spoken-dialogue-systems', 'display_name': 'Spoken Dialogue Systems', 'score': 0.559221}, {'id': 'https://openalex.org/keywords/end-to-end-speech-recognition', 'display_name': 'End-to-End Speech Recognition', 'score': 0.554618}, {'id': 'https://openalex.org/keywords/automatic-speech-recognition', 'display_name': 'Automatic Speech Recognition', 'score': 0.540449}, {'id': 'https://openalex.org/keywords/user-simulation', 'display_name': 'User Simulation', 'score': 0.531983}, {'id': 'https://openalex.org/keywords/natural-language-generation', 'display_name': 'Natural Language Generation', 'score': 0.5272}], 'concepts': [{'id': 'https://openalex.org/C14999030', 'wikidata': 'https://www.wikidata.org/wiki/Q16346', 'display_name': 'Speech synthesis', 'level': 2, 'score': 0.87839603}, {'id': 'https://openalex.org/C134537474', 'wikidata': 'https://www.wikidata.org/wiki/Q17144832', 'display_name': 'Naturalness', 'level': 2, 'score': 0.81982267}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.8103828}, {'id': 'https://openalex.org/C28490314', 'wikidata': 'https://www.wikidata.org/wiki/Q189436', 'display_name': 'Speech recognition', 'level': 1, 'score': 0.67956406}, {'id': 'https://openalex.org/C504749915', 'wikidata': 'https://www.wikidata.org/wiki/Q9010971', 'display_name': 'Speech technology', 'level': 3, 'score': 0.6065676}, {'id': 'https://openalex.org/C177291462', 'wikidata': 'https://www.wikidata.org/wiki/Q423038', 'display_name': 'Active listening', 'level': 2, 'score': 0.60391265}, {'id': 'https://openalex.org/C2780451532', 'wikidata': 'https://www.wikidata.org/wiki/Q759676', 'display_name': 'Task (project management)', 'level': 2, 'score': 0.60101444}, {'id': 'https://openalex.org/C554936623', 'wikidata': 'https://www.wikidata.org/wiki/Q199657', 'display_name': 'Reading (process)', 'level': 2, 'score': 0.5246368}, {'id': 'https://openalex.org/C2776608160', 'wikidata': 'https://www.wikidata.org/wiki/Q4785462', 'display_name': 'Natural (archaeology)', 'level': 2, 'score': 0.48794407}, {'id': 'https://openalex.org/C2778348673', 'wikidata': 'https://www.wikidata.org/wiki/Q739302', 'display_name': 'Production (economics)', 'level': 2, 'score': 0.46770877}, {'id': 'https://openalex.org/C61328038', 'wikidata': 'https://www.wikidata.org/wiki/Q3358061', 'display_name': 'Speech processing', 'level': 2, 'score': 0.4273022}, {'id': 'https://openalex.org/C204201278', 'wikidata': 'https://www.wikidata.org/wiki/Q1332614', 'display_name': 'Voice activity detection', 'level': 3, 'score': 0.41125357}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.38580215}, {'id': 'https://openalex.org/C41895202', 'wikidata': 'https://www.wikidata.org/wiki/Q8162', 'display_name': 'Linguistics', 'level': 1, 'score': 0.09715128}, {'id': 'https://openalex.org/C127413603', 'wikidata': 'https://www.wikidata.org/wiki/Q11023', 'display_name': 'Engineering', 'level': 0, 'score': 0.09389359}, {'id': 'https://openalex.org/C138885662', 'wikidata': 'https://www.wikidata.org/wiki/Q5891', 'display_name': 'Philosophy', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C121332964', 'wikidata': 'https://www.wikidata.org/wiki/Q413', 'display_name': 'Physics', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C139719470', 'wikidata': 'https://www.wikidata.org/wiki/Q39680', 'display_name': 'Macroeconomics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C46312422', 'wikidata': 'https://www.wikidata.org/wiki/Q11024', 'display_name': 'Communication', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C201995342', 'wikidata': 'https://www.wikidata.org/wiki/Q682496', 'display_name': 'Systems engineering', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C166957645', 'wikidata': 'https://www.wikidata.org/wiki/Q23498', 'display_name': 'Archaeology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C62520636', 'wikidata': 'https://www.wikidata.org/wiki/Q944', 'display_name': 'Quantum mechanics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C144024400', 'wikidata': 'https://www.wikidata.org/wiki/Q21201', 'display_name': 'Sociology', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C162324750', 'wikidata': 'https://www.wikidata.org/wiki/Q8134', 'display_name': 'Economics', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C95457728', 'wikidata': 'https://www.wikidata.org/wiki/Q309', 'display_name': 'History', 'level': 0, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/iceca52323.2021.9675855', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4363607835', 'display_name': '2021 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA)', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'conference'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.63, 'display_name': 'Quality education', 'id': 'https://metadata.un.org/sdg/4'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 21, 'referenced_works': ['https://openalex.org/W1544407768', 'https://openalex.org/W1570629387', 'https://openalex.org/W2015876361', 'https://openalex.org/W2102737569', 'https://openalex.org/W2132130797', 'https://openalex.org/W2275192249', 'https://openalex.org/W2308409568', 'https://openalex.org/W2326830712', 'https://openalex.org/W2566053671', 'https://openalex.org/W2635977642', 'https://openalex.org/W2697504846', 'https://openalex.org/W2734544569', 'https://openalex.org/W2963091184', 'https://openalex.org/W2963809228', 'https://openalex.org/W2996286887', 'https://openalex.org/W3038342317', 'https://openalex.org/W3092028330', 'https://openalex.org/W3115402873', 'https://openalex.org/W3159740474', 'https://openalex.org/W3161782335', 'https://openalex.org/W4249399411'], 'related_works': ['https://openalex.org/W642007152', 'https://openalex.org/W596245619', 'https://openalex.org/W2946856121', 'https://openalex.org/W2550171623', 'https://openalex.org/W2320148263', 'https://openalex.org/W2184127972', 'https://openalex.org/W2164147372', 'https://openalex.org/W2008120082', 'https://openalex.org/W1914543332', 'https://openalex.org/W168921769'], 'abstract_inverted_index': {'Text-to-speech': [0], '(TTS)': [1], 'synthesis': [2, 36, 68, 114, 123, 162], 'is': [3, 16, 28, 168, 177], 'one': [4], 'of': [5, 10, 24, 34, 82, 95, 98, 183], 'the': [6, 19, 22, 47, 61, 85, 92, 120, 155, 159, 184], 'rapidly': [7], 'emerging': [8], 'areas': [9], 'computer-to-human': [11], 'interaction': [12], 'technology.': [13], 'Human-like': [14], 'speech': [15, 67, 122, 136, 145, 156, 161], 'replicated': [17], 'by': [18, 158, 179], 'computer': [20], 'with': [21], 'introduction': [23], 'input': [25], 'text': [26, 143], 'which': [27, 72], 'usually': [29], 'very': [30], 'natural.': [31], 'Real-life': [32], 'applications': [33], 'TTS': [35, 113, 148], 'technique': [37, 167], 'make': [38], 'users': [39, 99], 'task': [40], 'hassle-free.': [41], 'For': [42], 'example,': [43], 'reading': [44], 'book': [45], 'for': [46, 116, 144], 'visually': [48], 'impaired': [49], 'people,': [50], 'paying': [51], 'electricity': [52], 'bill': [53], 'through': [54, 147], 'automated': [55], 'call-centre,': [56], 'announcing': [57], 'train': [58], 'information': [59], 'at': [60], 'railway': [62], 'station,': [63], 'etc.': [64], 'Traditionally,': [65], 'rule-based': [66], 'methods': [69], 'are': [70, 150], 'deployed': [71], 'find': [73], 'difficulties': [74], 'in': [75, 80, 84, 132, 141, 164], 'obtaining': [76], 'optimal': [77], 'rules,': [78], 'resulting': [79], 'lack': [81], 'naturalness': [83], 'generated': [86, 157, 185], 'synthesized': [87, 186], 'speech.': [88, 187], 'Alternatively,': [89], 'to': [90], 'meet': [91], 'desired': [93], 'quality': [94], 'experience': [96], '(QoE)': [97], 'while': [100], 'using': [101, 119], 'these': [102], 'applications,': [103], 'this': [104], 'paper': [105], 'designs': [106], 'and': [107, 111, 125, 128, 170], 'develops': [108], 'a': [109], 'simple': [110], 'robust': [112], 'system': [115, 149], 'English': [117], 'language': [118], 'concatenative': [121, 160], 'method': [124], 'its': [126, 130], 'variants': [127], 'finds': [129], 'suitability': [131], 'intelligible': [133], 'and/or': [134], 'natural': [135], 'production.': [137], 'Various': [138], 'steps': [139], 'involved': [140], 'processing': [142], 'production': [146], 'described.': [151], 'Results': [152], 'demonstrate': [153], 'that': [154], 'method,': [163], 'particular,': [165], 'unit-selection': [166], 'smoother': [169], 'natural,': [171], 'sounding': [172], 'like': [173], 'human': [174], 'voice.': [175], 'This': [176], 'supported': [178], 'informal': [180], 'listening': [181], 'test': [182]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W4205598694', 'counts_by_year': [{'year': 2024, 'cited_by_count': 2}, {'year': 2023, 'cited_by_count': 1}, {'year': 2022, 'cited_by_count': 1}], 'updated_date': '2024-09-18T05:01:55.892994', 'created_date': '2022-01-26'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works