Improving n-gram models by incorporating enhanced distributions

Peter O'Boyle; Ming Jiang; J. McMahon; Fraser Smith
{'id': 'https://openalex.org/W2132221452', 'doi': 'https://doi.org/10.1109/icassp.1996.540317', 'title': 'Improving n-gram models by incorporating enhanced distributions', 'display_name': 'Improving n-gram models by incorporating enhanced distributions', 'publication_year': 2002, 'publication_date': '2002-12-24', 'ids': {'openalex': 'https://openalex.org/W2132221452', 'doi': 'https://doi.org/10.1109/icassp.1996.540317', 'mag': '2132221452'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/icassp.1996.540317', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5017695430', 'display_name': "Peter O'Boyle", 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I126231945', 'display_name': "Queen's University Belfast", 'ror': 'https://ror.org/00hswnk62', 'country_code': 'GB', 'type': 'education', 'lineage': ['https://openalex.org/I126231945']}], 'countries': ['GB'], 'is_corresponding': False, 'raw_author_name': "P. O'Boyle", 'raw_affiliation_strings': ["Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#"], 'affiliations': [{'raw_affiliation_string': "Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#", 'institution_ids': ['https://openalex.org/I126231945']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5056308697', 'display_name': 'Ming Jiang', 'orcid': 'https://orcid.org/0000-0002-1661-0538'}, 'institutions': [{'id': 'https://openalex.org/I126231945', 'display_name': "Queen's University Belfast", 'ror': 'https://ror.org/00hswnk62', 'country_code': 'GB', 'type': 'education', 'lineage': ['https://openalex.org/I126231945']}], 'countries': ['GB'], 'is_corresponding': False, 'raw_author_name': 'J. Ming', 'raw_affiliation_strings': ["Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#"], 'affiliations': [{'raw_affiliation_string': "Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#", 'institution_ids': ['https://openalex.org/I126231945']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5000113966', 'display_name': 'J. McMahon', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I126231945', 'display_name': "Queen's University Belfast", 'ror': 'https://ror.org/00hswnk62', 'country_code': 'GB', 'type': 'education', 'lineage': ['https://openalex.org/I126231945']}], 'countries': ['GB'], 'is_corresponding': False, 'raw_author_name': 'J. McMahon', 'raw_affiliation_strings': ["Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#"], 'affiliations': [{'raw_affiliation_string': "Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#", 'institution_ids': ['https://openalex.org/I126231945']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5023410995', 'display_name': 'Fraser Smith', 'orcid': 'https://orcid.org/0000-0002-8016-5109'}, 'institutions': [{'id': 'https://openalex.org/I126231945', 'display_name': "Queen's University Belfast", 'ror': 'https://ror.org/00hswnk62', 'country_code': 'GB', 'type': 'education', 'lineage': ['https://openalex.org/I126231945']}], 'countries': ['GB'], 'is_corresponding': False, 'raw_author_name': 'F.J. Smith', 'raw_affiliation_strings': ["Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#"], 'affiliations': [{'raw_affiliation_string': "Sch. of Electr. Eng. & Comput. Sci., Queen's Univ., Belfast, UK#TAB#", 'institution_ids': ['https://openalex.org/I126231945']}]}], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.346, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 2, 'citation_normalized_percentile': {'value': 0.292143, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 64, 'max': 68}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10181', 'display_name': 'Statistical Machine Translation and Natural Language Processing', 'score': 0.9995, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10181', 'display_name': 'Statistical Machine Translation and Natural Language Processing', 'score': 0.9995, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10028', 'display_name': 'Natural Language Processing', 'score': 0.9962, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10201', 'display_name': 'Speech Recognition Technology', 'score': 0.9933, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/perplexity', 'display_name': 'Perplexity', 'score': 0.9641844}, {'id': 'https://openalex.org/keywords/n-gram', 'display_name': 'n-gram', 'score': 0.9087842}, {'id': 'https://openalex.org/keywords/gram', 'display_name': 'Gram', 'score': 0.71949196}, {'id': 'https://openalex.org/keywords/statistical-language-modeling', 'display_name': 'Statistical Language Modeling', 'score': 0.573238}, {'id': 'https://openalex.org/keywords/language-modeling', 'display_name': 'Language Modeling', 'score': 0.572844}, {'id': 'https://openalex.org/keywords/topic-modeling', 'display_name': 'Topic Modeling', 'score': 0.554248}, {'id': 'https://openalex.org/keywords/syntax-based-translation-models', 'display_name': 'Syntax-based Translation Models', 'score': 0.53596}, {'id': 'https://openalex.org/keywords/neural-machine-translation', 'display_name': 'Neural Machine Translation', 'score': 0.534022}], 'concepts': [{'id': 'https://openalex.org/C100279451', 'wikidata': 'https://www.wikidata.org/wiki/Q372193', 'display_name': 'Perplexity', 'level': 3, 'score': 0.9641844}, {'id': 'https://openalex.org/C117884012', 'wikidata': 'https://www.wikidata.org/wiki/Q94489', 'display_name': 'n-gram', 'level': 3, 'score': 0.9087842}, {'id': 'https://openalex.org/C137293760', 'wikidata': 'https://www.wikidata.org/wiki/Q3621696', 'display_name': 'Language model', 'level': 2, 'score': 0.75092506}, {'id': 'https://openalex.org/C161369605', 'wikidata': 'https://www.wikidata.org/wiki/Q41803', 'display_name': 'Gram', 'level': 3, 'score': 0.71949196}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.6917845}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.5959248}, {'id': 'https://openalex.org/C177264268', 'wikidata': 'https://www.wikidata.org/wiki/Q1514741', 'display_name': 'Set (abstract data type)', 'level': 2, 'score': 0.48902377}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.46674663}, {'id': 'https://openalex.org/C49781872', 'wikidata': 'https://www.wikidata.org/wiki/Q1045555', 'display_name': 'Maximum likelihood', 'level': 2, 'score': 0.45686907}, {'id': 'https://openalex.org/C111335779', 'wikidata': 'https://www.wikidata.org/wiki/Q3454686', 'display_name': 'Reduction (mathematics)', 'level': 2, 'score': 0.429786}, {'id': 'https://openalex.org/C105795698', 'wikidata': 'https://www.wikidata.org/wiki/Q12483', 'display_name': 'Statistics', 'level': 1, 'score': 0.3796654}, {'id': 'https://openalex.org/C33923547', 'wikidata': 'https://www.wikidata.org/wiki/Q395', 'display_name': 'Mathematics', 'level': 0, 'score': 0.2331855}, {'id': 'https://openalex.org/C2524010', 'wikidata': 'https://www.wikidata.org/wiki/Q8087', 'display_name': 'Geometry', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C86803240', 'wikidata': 'https://www.wikidata.org/wiki/Q420', 'display_name': 'Biology', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C523546767', 'wikidata': 'https://www.wikidata.org/wiki/Q10876', 'display_name': 'Bacteria', 'level': 2, 'score': 0.0}, {'id': 'https://openalex.org/C54355233', 'wikidata': 'https://www.wikidata.org/wiki/Q7162', 'display_name': 'Genetics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C199360897', 'wikidata': 'https://www.wikidata.org/wiki/Q9143', 'display_name': 'Programming language', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/icassp.1996.540317', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'display_name': 'Quality education', 'score': 0.73, 'id': 'https://metadata.un.org/sdg/4'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 6, 'referenced_works': ['https://openalex.org/W1575431606', 'https://openalex.org/W1597533204', 'https://openalex.org/W1767837355', 'https://openalex.org/W1974413746', 'https://openalex.org/W2008422177', 'https://openalex.org/W2134237567'], 'related_works': ['https://openalex.org/W4322096525', 'https://openalex.org/W3126081632', 'https://openalex.org/W2906970013', 'https://openalex.org/W2787311093', 'https://openalex.org/W2551914602', 'https://openalex.org/W2252095989', 'https://openalex.org/W2169518243', 'https://openalex.org/W2132221452', 'https://openalex.org/W2105076537', 'https://openalex.org/W1577263324'], 'abstract_inverted_index': {'Two': [0], 'methods': [1], 'of': [2, 18, 27, 78], 'improving': [3], 'conventional': [4], 'n-gram': [5, 19], 'statistical': [6], 'language': [7, 62], 'models': [8, 63], 'are': [9], 'examined.': [10], 'The': [11, 34, 70], 'first': [12], 'involves': [13, 36], 'using': [14], 'a': [15, 28, 66, 74], 'new': [16], 'set': [17], 'statistics': [20], 'that': [21], 'attempt': [22], 'to': [23, 30, 73], 'improve': [24], 'the': [25, 38], 'ability': [26], 'system': [29], 'identify': [31], 'phrases': [32], 'correctly.': [33], 'second': [35], 'replacing': [37], 'maximum': [39], 'likelihood': [40], 'unigram': [41], 'component': [42], 'with': [43], 'an': [44], 'optimised': [45], 'distribution.': [46], 'We': [47], 'test': [48], 'these': [49, 84], 'approaches': [50], 'by': [51], 'incorporating': [52], 'them': [53], 'into': [54], 'weighted': [55], 'average': [56], '[1]': [57], 'and': [58, 80], 'deleted': [59], 'estimate': [60], '[2]': [61], 'trained': [64], 'on': [65], 'large': [67], 'newspaper': [68], 'corpus.': [69], 'improvements': [71], 'lead': [72], 'reduction': [75], 'in': [76], 'perplexity': [77], '4.5%': [79], '4.9%': [81], 'respectively': [82], 'for': [83], 'models.': [85]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2132221452', 'counts_by_year': [], 'updated_date': '2024-09-10T03:08:47.334959', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works