Attention-based Persian Language Modeling

Arman Yekkehkhani; Mohammadjalal Pouromid; Mohammad Bahrani
{'id': 'https://openalex.org/W4281893144', 'doi': 'https://doi.org/10.1109/csicc55295.2022.9780494', 'title': 'Attention-based Persian Language Modeling', 'display_name': 'Attention-based Persian Language Modeling', 'publication_year': 2022, 'publication_date': '2022-02-23', 'ids': {'openalex': 'https://openalex.org/W4281893144', 'doi': 'https://doi.org/10.1109/csicc55295.2022.9780494'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/csicc55295.2022.9780494', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5063226201', 'display_name': 'Arman Yekkehkhani', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I200432940', 'display_name': "Allameh Tabataba'i University", 'ror': 'https://ror.org/02cc4gc68', 'country_code': 'IR', 'type': 'education', 'lineage': ['https://openalex.org/I200432940']}], 'countries': ['IR'], 'is_corresponding': False, 'raw_author_name': 'Arman Yekkehkhani', 'raw_affiliation_strings': ['Allameh Tabataba&#x2019;i University,Computer Science Dept,Tehran,Iran'], 'affiliations': [{'raw_affiliation_string': 'Allameh Tabataba&#x2019;i University,Computer Science Dept,Tehran,Iran', 'institution_ids': ['https://openalex.org/I200432940']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5025054000', 'display_name': 'Mohammadjalal Pouromid', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I200432940', 'display_name': "Allameh Tabataba'i University", 'ror': 'https://ror.org/02cc4gc68', 'country_code': 'IR', 'type': 'education', 'lineage': ['https://openalex.org/I200432940']}], 'countries': ['IR'], 'is_corresponding': False, 'raw_author_name': 'Mohammadjalal Pouromid', 'raw_affiliation_strings': ['Allameh Tabataba&#x2019;i University,Computer Science Dept,Tehran,Iran'], 'affiliations': [{'raw_affiliation_string': 'Allameh Tabataba&#x2019;i University,Computer Science Dept,Tehran,Iran', 'institution_ids': ['https://openalex.org/I200432940']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5017474830', 'display_name': 'Mohammad Bahrani', 'orcid': 'https://orcid.org/0000-0003-1645-1562'}, 'institutions': [{'id': 'https://openalex.org/I200432940', 'display_name': "Allameh Tabataba'i University", 'ror': 'https://ror.org/02cc4gc68', 'country_code': 'IR', 'type': 'education', 'lineage': ['https://openalex.org/I200432940']}], 'countries': ['IR'], 'is_corresponding': False, 'raw_author_name': 'Mohammad Bahrani', 'raw_affiliation_strings': ['Allameh Tabataba&#x2019;i University,Computer Science Dept,Tehran,Iran'], 'affiliations': [{'raw_affiliation_string': 'Allameh Tabataba&#x2019;i University,Computer Science Dept,Tehran,Iran', 'institution_ids': ['https://openalex.org/I200432940']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.0, 'has_fulltext': False, 'cited_by_count': 0, 'citation_normalized_percentile': {'value': 0.0, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 0, 'max': 62}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10028', 'display_name': 'Natural Language Processing', 'score': 1.0, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10028', 'display_name': 'Natural Language Processing', 'score': 1.0, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10181', 'display_name': 'Statistical Machine Translation and Natural Language Processing', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11714', 'display_name': 'Visual Question Answering in Images and Videos', 'score': 0.9958, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/perplexity', 'display_name': 'Perplexity', 'score': 0.99902177}, {'id': 'https://openalex.org/keywords/language-modeling', 'display_name': 'Language Modeling', 'score': 0.538115}, {'id': 'https://openalex.org/keywords/syntax-based-translation-models', 'display_name': 'Syntax-based Translation Models', 'score': 0.512203}, {'id': 'https://openalex.org/keywords/language-understanding', 'display_name': 'Language Understanding', 'score': 0.511476}, {'id': 'https://openalex.org/keywords/topic-modeling', 'display_name': 'Topic Modeling', 'score': 0.500555}], 'concepts': [{'id': 'https://openalex.org/C100279451', 'wikidata': 'https://www.wikidata.org/wiki/Q372193', 'display_name': 'Perplexity', 'level': 3, 'score': 0.99902177}, {'id': 'https://openalex.org/C137293760', 'wikidata': 'https://www.wikidata.org/wiki/Q3621696', 'display_name': 'Language model', 'level': 2, 'score': 0.85386693}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.81451416}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.6749326}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.6733039}, {'id': 'https://openalex.org/C2776527531', 'wikidata': 'https://www.wikidata.org/wiki/Q9168', 'display_name': 'Persian', 'level': 2, 'score': 0.5056586}, {'id': 'https://openalex.org/C90805587', 'wikidata': 'https://www.wikidata.org/wiki/Q10944557', 'display_name': 'Word (group theory)', 'level': 2, 'score': 0.47307786}, {'id': 'https://openalex.org/C2780009758', 'wikidata': 'https://www.wikidata.org/wiki/Q6804172', 'display_name': 'Measure (data warehouse)', 'level': 2, 'score': 0.43984863}, {'id': 'https://openalex.org/C41895202', 'wikidata': 'https://www.wikidata.org/wiki/Q8162', 'display_name': 'Linguistics', 'level': 1, 'score': 0.19332042}, {'id': 'https://openalex.org/C124101348', 'wikidata': 'https://www.wikidata.org/wiki/Q172491', 'display_name': 'Data mining', 'level': 1, 'score': 0.12975422}, {'id': 'https://openalex.org/C138885662', 'wikidata': 'https://www.wikidata.org/wiki/Q5891', 'display_name': 'Philosophy', 'level': 0, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/csicc55295.2022.9780494', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'display_name': 'Quality education', 'id': 'https://metadata.un.org/sdg/4', 'score': 0.81}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 19, 'referenced_works': ['https://openalex.org/W10704533', 'https://openalex.org/W179875071', 'https://openalex.org/W2057256724', 'https://openalex.org/W2293185259', 'https://openalex.org/W2896457183', 'https://openalex.org/W2913129712', 'https://openalex.org/W2921584913', 'https://openalex.org/W2962739339', 'https://openalex.org/W2963026768', 'https://openalex.org/W2970597249', 'https://openalex.org/W2979826702', 'https://openalex.org/W2980708516', 'https://openalex.org/W2987972786', 'https://openalex.org/W2998704965', 'https://openalex.org/W3012110710', 'https://openalex.org/W3162629839', 'https://openalex.org/W4292779060', 'https://openalex.org/W4294170691', 'https://openalex.org/W4385245566'], 'related_works': ['https://openalex.org/W4322096525', 'https://openalex.org/W4281893144', 'https://openalex.org/W2921174581', 'https://openalex.org/W2787311093', 'https://openalex.org/W2551914602', 'https://openalex.org/W2252095989', 'https://openalex.org/W2169518243', 'https://openalex.org/W2105076537', 'https://openalex.org/W2020757772', 'https://openalex.org/W1700330385'], 'abstract_inverted_index': {'Attention-based': [0], 'models': [1, 53, 63, 116], 'have': [2, 99, 124], 'proved': [3], 'their': [4, 14, 56], 'superiority': [5], 'on': [6, 46, 58, 68], 'many': [7], 'NLP': [8], 'tasks,': [9], 'especially': [10], 'for': [11, 31, 95, 112, 142], 'English.': [12], 'Despite': [13], 'great': [15], 'potential': [16], 'and': [17, 43, 88], 'importance': [18], 'of': [19, 71, 86, 104, 114, 130], 'language': [20, 29, 39, 96, 115, 135], 'models,': [21, 40], 'little': [22], 'attention': [23], 'has': [24], 'been': [25], 'paid': [26], 'to': [27, 90], 'attention-based': [28], 'modeling': [30], 'Persian.': [32], 'In': [33], 'this': [34], 'paper,': [35], 'we': [36], 'fine-tuned': [37], 'two': [38], 'namely': [41], 'BERT': [42, 132], 'Persian': [44], 'GPT-2': [45, 77], 'Persica': [47], 'corpus.': [48], 'We': [49, 98, 123], 'then': [50], 'evaluated': [51], 'these': [52], 'by': [54, 81, 137], 'computing': [55], 'perplexity': [57, 87], 'a': [59, 92, 101, 110, 134, 139], '5-million-word': [60], 'dataset.': [61], 'Both': [62], 'outperform': [64], 'previous': [65], 'SOTA': [66], 'results': [67, 74], 'the': [69], 'measure': [70, 111], 'perplexity.': [72], 'Our': [73], 'indicate': [75], 'that': [76], 'performs': [78], 'slightly': [79], 'better': [80, 93], 'approximately': [82], '10': [83], 'percent': [84], 'improvement': [85], 'seems': [89], 'be': [91, 109], 'fit': [94], 'modeling.': [97], 'proposed': [100], 'modified': [102], 'version': [103], 'perplexity,': [105], 'bi-perplexity,': [106], 'which': [107], 'can': [108], 'comparison': [113], 'trained': [117], 'with': [118], 'Masked': [119], 'Language': [120], 'Modeling': [121], 'objective.': [122], 'also': [125], 'introduced': [126], 'an': [127], 'innovative': [128], 'way': [129], 'using': [131], 'as': [133], 'model': [136], 'devising': [138], 'new': [140], 'strategy': [141], 'sampling.': [143]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W4281893144', 'counts_by_year': [], 'updated_date': '2024-09-19T18:06:17.929441', 'created_date': '2022-06-13'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works