Comparing the naturalness of several approaches for generating <i>F</i>0 contours in German text-to-speech systems

Hansjrg Mixdorff; Dieter Mehnert
{'id': 'https://openalex.org/W2024201202', 'doi': 'https://doi.org/10.1121/1.424746', 'title': 'Comparing the naturalness of several approaches for generating <i>F</i>0 contours in German text-to-speech systems', 'display_name': 'Comparing the naturalness of several approaches for generating <i>F</i>0 contours in German text-to-speech systems', 'publication_year': 1999, 'publication_date': '1999-02-01', 'ids': {'openalex': 'https://openalex.org/W2024201202', 'doi': 'https://doi.org/10.1121/1.424746', 'mag': '2024201202'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1121/1.424746', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S11296630', 'display_name': 'The Journal of the Acoustical Society of America', 'issn_l': '0001-4966', 'issn': ['0001-4966', '1520-8524', '1520-9024'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310320226', 'host_organization_name': 'Acoustical Society of America', 'host_organization_lineage': ['https://openalex.org/P4310320226'], 'host_organization_lineage_names': ['Acoustical Society of America'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'journal-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5027680537', 'display_name': 'Hansjrg Mixdorff', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I78650965', 'display_name': 'TU Dresden', 'ror': 'https://ror.org/042aqky30', 'country_code': 'DE', 'type': 'education', 'lineage': ['https://openalex.org/I78650965']}], 'countries': ['DE'], 'is_corresponding': False, 'raw_author_name': 'Hansjrg Mixdorff', 'raw_affiliation_strings': ['TU Dresden, Mommsenstr. 13, 01062 Dresden, Germany, [email protected]'], 'affiliations': [{'raw_affiliation_string': 'TU Dresden, Mommsenstr. 13, 01062 Dresden, Germany, [email protected]', 'institution_ids': ['https://openalex.org/I78650965']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5007707911', 'display_name': 'Dieter Mehnert', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I78650965', 'display_name': 'TU Dresden', 'ror': 'https://ror.org/042aqky30', 'country_code': 'DE', 'type': 'education', 'lineage': ['https://openalex.org/I78650965']}], 'countries': ['DE'], 'is_corresponding': False, 'raw_author_name': 'Dieter Mehnert', 'raw_affiliation_strings': ['TU Dresden, Mommsenstr. 13, 01062 Dresden, Germany, [email protected]'], 'affiliations': [{'raw_affiliation_string': 'TU Dresden, Mommsenstr. 13, 01062 Dresden, Germany, [email protected]', 'institution_ids': ['https://openalex.org/I78650965']}]}], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.485, 'has_fulltext': False, 'cited_by_count': 1, 'citation_normalized_percentile': {'value': 0.479055, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 56, 'max': 63}, 'biblio': {'volume': '105', 'issue': '2_Supplement', 'first_page': '1302', 'last_page': '1302'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T12031', 'display_name': 'Dialogue Act Modeling for Spoken Language Systems', 'score': 0.9893, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T12031', 'display_name': 'Dialogue Act Modeling for Spoken Language Systems', 'score': 0.9893, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10403', 'display_name': 'Speech Perception and Phonetics', 'score': 0.9883, 'subfield': {'id': 'https://openalex.org/subfields/3205', 'display_name': 'Experimental and Cognitive Psychology'}, 'field': {'id': 'https://openalex.org/fields/32', 'display_name': 'Psychology'}, 'domain': {'id': 'https://openalex.org/domains/2', 'display_name': 'Social Sciences'}}, {'id': 'https://openalex.org/T10201', 'display_name': 'Speech Recognition Technology', 'score': 0.9865, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/stress', 'display_name': 'Stress (linguistics)', 'score': 0.6158196}, {'id': 'https://openalex.org/keywords/pitch-contour', 'display_name': 'Pitch contour', 'score': 0.58340406}, {'id': 'https://openalex.org/keywords/intonation', 'display_name': 'Intonation', 'score': 0.508854}], 'concepts': [{'id': 'https://openalex.org/C134537474', 'wikidata': 'https://www.wikidata.org/wiki/Q17144832', 'display_name': 'Naturalness', 'level': 2, 'score': 0.96830857}, {'id': 'https://openalex.org/C2781045179', 'wikidata': 'https://www.wikidata.org/wiki/Q5576720', 'display_name': 'Intonation (linguistics)', 'level': 2, 'score': 0.75453186}, {'id': 'https://openalex.org/C14999030', 'wikidata': 'https://www.wikidata.org/wiki/Q16346', 'display_name': 'Speech synthesis', 'level': 2, 'score': 0.7377009}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.6483939}, {'id': 'https://openalex.org/C28490314', 'wikidata': 'https://www.wikidata.org/wiki/Q189436', 'display_name': 'Speech recognition', 'level': 1, 'score': 0.632099}, {'id': 'https://openalex.org/C2776756274', 'wikidata': 'https://www.wikidata.org/wiki/Q181767', 'display_name': 'Stress (linguistics)', 'level': 2, 'score': 0.6158196}, {'id': 'https://openalex.org/C2777895490', 'wikidata': 'https://www.wikidata.org/wiki/Q7198848', 'display_name': 'Pitch contour', 'level': 2, 'score': 0.58340406}, {'id': 'https://openalex.org/C154775046', 'wikidata': 'https://www.wikidata.org/wiki/Q188', 'display_name': 'German', 'level': 2, 'score': 0.4774069}, {'id': 'https://openalex.org/C189430467', 'wikidata': 'https://www.wikidata.org/wiki/Q7293293', 'display_name': 'Ranking (information retrieval)', 'level': 2, 'score': 0.46495354}, {'id': 'https://openalex.org/C2776608160', 'wikidata': 'https://www.wikidata.org/wiki/Q4785462', 'display_name': 'Natural (archaeology)', 'level': 2, 'score': 0.4449364}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.43558827}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.3741672}, {'id': 'https://openalex.org/C41895202', 'wikidata': 'https://www.wikidata.org/wiki/Q8162', 'display_name': 'Linguistics', 'level': 1, 'score': 0.27682358}, {'id': 'https://openalex.org/C95457728', 'wikidata': 'https://www.wikidata.org/wiki/Q309', 'display_name': 'History', 'level': 0, 'score': 0.094505996}, {'id': 'https://openalex.org/C121332964', 'wikidata': 'https://www.wikidata.org/wiki/Q413', 'display_name': 'Physics', 'level': 0, 'score': 0.073400706}, {'id': 'https://openalex.org/C138885662', 'wikidata': 'https://www.wikidata.org/wiki/Q5891', 'display_name': 'Philosophy', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C166957645', 'wikidata': 'https://www.wikidata.org/wiki/Q23498', 'display_name': 'Archaeology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C62520636', 'wikidata': 'https://www.wikidata.org/wiki/Q944', 'display_name': 'Quantum mechanics', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1121/1.424746', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S11296630', 'display_name': 'The Journal of the Acoustical Society of America', 'issn_l': '0001-4966', 'issn': ['0001-4966', '1520-8524', '1520-9024'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310320226', 'host_organization_name': 'Acoustical Society of America', 'host_organization_lineage': ['https://openalex.org/P4310320226'], 'host_organization_lineage_names': ['Acoustical Society of America'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'display_name': 'Quality education', 'score': 0.42, 'id': 'https://metadata.un.org/sdg/4'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 0, 'referenced_works': [], 'related_works': ['https://openalex.org/W4212971584', 'https://openalex.org/W2946856121', 'https://openalex.org/W2535215250', 'https://openalex.org/W2530248338', 'https://openalex.org/W2381837697', 'https://openalex.org/W2370805772', 'https://openalex.org/W2332932563', 'https://openalex.org/W2108985546', 'https://openalex.org/W2024201202', 'https://openalex.org/W1914543332'], 'abstract_inverted_index': {'Generating': [0], 'near-to-natural': [1], 'F0': [2, 43], 'contours': [3], 'is': [4, 34, 45, 197], 'an': [5], 'important': [6], 'issue': [7], 'in': [8], 'text-to-speech': [9], 'synthesis': [10], 'and': [11, 53, 60, 83, 109], 'contributes': [12], 'vastly': [13], 'to': [14, 66, 162], 'the': [15, 24, 37, 76, 104, 147, 164, 179, 182, 191], 'quality': [16], 'of': [17, 28, 50, 62, 79, 103, 115, 126, 139, 146, 172, 176, 193, 203], 'synthetic': [18], 'speech.': [19], 'In': [20], 'earlier': [21], 'studies': [22], 'by': [23, 58], 'authors,': [25], 'a': [26, 48, 80, 100, 123, 131, 136, 157, 169, 201], 'model': [27], 'German': [29, 86], 'intonation': [30, 189], 'was': [31], 'developed': [32], 'that': [33, 155], 'based': [35, 134], 'on': [36, 135, 156], 'quantitative': [38], 'Fujisaki': [39], 'model.': [40], 'A': [41], 'typical': [42], 'contour': [44], 'described': [46], 'as': [47, 99], 'sequence': [49], 'major': [51], 'rises': [52], 'falls,': [54], 'which': [55, 196], 'are': [56], 'modeled': [57], 'onsets': [59], 'offsets': [61], 'accent': [63], 'commands': [64], 'connected': [65], 'accented': [67], 'syllables.': [68], 'The': [69, 185], 'current': [70], 'paper': [71], 'addresses': [72], 'perception': [73], 'experiments': [74, 116], 'comparing': [75], 'intonational': [77], 'naturalness': [78, 158], 'Fujisaki-model-based': [81, 188], 'TTS': [82, 87, 105], 'four': [84], 'other': [85], 'systems': [88, 106], 'with': [89, 119, 144, 174, 187], 'comparably': [90], 'high': [91], 'segmental': [92], 'quality.': [93], 'Natural': [94], 'speech': [95, 166], 'samples': [96, 167], 'were': [97, 117], 'used': [98], 'reference.': [101], 'Three': [102], 'had': [107], 'PSOLA,': [108], 'one': [110], 'LPC': [111], 'segmentals.': [112], 'Two': [113], 'types': [114], 'conducted': [118], '20': [120], 'subjects:': [121], '(1)': [122, 153], 'pair': [124], 'comparison': [125], '15': [127, 141], 'isolated': [128], 'sentences,': [129], '(2)': [130], 'ranking': [132], 'test': [133], 'news': [137], 'passage': [138], 'about': [140], 'sec': [142], 'produced': [143], 'each': [145], 'systems.': [148], 'Preliminary': [149], 'results': [150], 'from': [151, 160], 'experiment': [152], 'show,': [154], 'scale': [159], '0': [161], '5,': [163], 'natural': [165], 'reach': [168], 'maximum': [170], 'score': [171], '4.5,': [173], 'values': [175], '2.8': [177], 'for': [178], 'best': [180], 'synthesis,': [181], 'LPC-based': [183], 'one.': [184], 'system': [186], 'leads': [190], 'group': [192], 'PSOLA': [194], 'systems,': [195], 'closely': [198], 'clustered': [199], 'at': [200], 'mean': [202], '2.4.': [204]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2024201202', 'counts_by_year': [], 'updated_date': '2024-08-13T21:27:21.943451', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works