An Empirical Exploration of Recurrent Network Architectures

Rafał Józefowicz; Wojciech Zaremba; Ilya Sutskever
{'id': 'https://openalex.org/W581956982', 'doi': None, 'title': 'An Empirical Exploration of Recurrent Network Architectures', 'display_name': 'An Empirical Exploration of Recurrent Network Architectures', 'publication_year': 2015, 'publication_date': '2015-07-06', 'ids': {'openalex': 'https://openalex.org/W581956982', 'mag': '581956982'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'http://proceedings.mlr.press/v37/jozefowicz15.pdf', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4306419644', 'display_name': 'International Conference on Machine Learning', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'conference'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': [], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5067370075', 'display_name': 'Rafał Józefowicz', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I1291425158', 'display_name': 'Google (United States)', 'ror': 'https://ror.org/00njsd438', 'country_code': 'US', 'type': 'company', 'lineage': ['https://openalex.org/I1291425158', 'https://openalex.org/I4210128969']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Rafal Jozefowicz', 'raw_affiliation_strings': ['GOOGLE INC .'], 'affiliations': [{'raw_affiliation_string': 'GOOGLE INC .', 'institution_ids': ['https://openalex.org/I1291425158']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5076651586', 'display_name': 'Wojciech Zaremba', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I57206974', 'display_name': 'New York University', 'ror': 'https://ror.org/0190ak572', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I57206974']}, {'id': 'https://openalex.org/I2252078561', 'display_name': 'Meta (Israel)', 'ror': 'https://ror.org/02388em19', 'country_code': 'IL', 'type': 'company', 'lineage': ['https://openalex.org/I2252078561', 'https://openalex.org/I4210114444']}], 'countries': ['IL', 'US'], 'is_corresponding': False, 'raw_author_name': 'Wojciech Zaremba', 'raw_affiliation_strings': ['Facebook', 'New York University'], 'affiliations': [{'raw_affiliation_string': 'New York University', 'institution_ids': ['https://openalex.org/I57206974']}, {'raw_affiliation_string': 'Facebook', 'institution_ids': ['https://openalex.org/I2252078561']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5006446297', 'display_name': 'Ilya Sutskever', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I1291425158', 'display_name': 'Google (United States)', 'ror': 'https://ror.org/00njsd438', 'country_code': 'US', 'type': 'company', 'lineage': ['https://openalex.org/I1291425158', 'https://openalex.org/I4210128969']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Ilya Sutskever', 'raw_affiliation_strings': ['GOOGLE INC .'], 'affiliations': [{'raw_affiliation_string': 'GOOGLE INC .', 'institution_ids': ['https://openalex.org/I1291425158']}]}], 'institution_assertions': [], 'countries_distinct_count': 2, 'institutions_distinct_count': 3, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 74.045, 'has_fulltext': False, 'cited_by_count': 1317, 'citation_normalized_percentile': {'value': 0.999876, 'is_in_top_1_percent': True, 'is_in_top_10_percent': True}, 'cited_by_percentile_year': {'min': 99, 'max': 100}, 'biblio': {'volume': None, 'issue': None, 'first_page': '2342', 'last_page': '2350'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T11307', 'display_name': 'Domain Adaptation and Few-Shot Learning', 'score': 0.9989, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T11307', 'display_name': 'Domain Adaptation and Few-Shot Learning', 'score': 0.9989, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10028', 'display_name': 'Topic Modeling', 'score': 0.9986, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T12808', 'display_name': 'Ferroelectric and Negative Capacitance Devices', 'score': 0.9984, 'subfield': {'id': 'https://openalex.org/subfields/2208', 'display_name': 'Electrical and Electronic Engineering'}, 'field': {'id': 'https://openalex.org/fields/22', 'display_name': 'Engineering'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/sequence', 'display_name': 'Sequence (biology)', 'score': 0.50301236}], 'concepts': [{'id': 'https://openalex.org/C147168706', 'wikidata': 'https://www.wikidata.org/wiki/Q1457734', 'display_name': 'Recurrent neural network', 'level': 3, 'score': 0.81716037}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.80893517}, {'id': 'https://openalex.org/C123657996', 'wikidata': 'https://www.wikidata.org/wiki/Q12271', 'display_name': 'Architecture', 'level': 2, 'score': 0.74666953}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.5686402}, {'id': 'https://openalex.org/C2778112365', 'wikidata': 'https://www.wikidata.org/wiki/Q3511065', 'display_name': 'Sequence (biology)', 'level': 2, 'score': 0.50301236}, {'id': 'https://openalex.org/C133488467', 'wikidata': 'https://www.wikidata.org/wiki/Q6673524', 'display_name': 'Long short term memory', 'level': 4, 'score': 0.4663049}, {'id': 'https://openalex.org/C193415008', 'wikidata': 'https://www.wikidata.org/wiki/Q639681', 'display_name': 'Network architecture', 'level': 2, 'score': 0.4385351}, {'id': 'https://openalex.org/C119857082', 'wikidata': 'https://www.wikidata.org/wiki/Q2539', 'display_name': 'Machine learning', 'level': 1, 'score': 0.38258445}, {'id': 'https://openalex.org/C50644808', 'wikidata': 'https://www.wikidata.org/wiki/Q192776', 'display_name': 'Artificial neural network', 'level': 2, 'score': 0.3226225}, {'id': 'https://openalex.org/C31258907', 'wikidata': 'https://www.wikidata.org/wiki/Q1301371', 'display_name': 'Computer network', 'level': 1, 'score': 0.109767914}, {'id': 'https://openalex.org/C142362112', 'wikidata': 'https://www.wikidata.org/wiki/Q735', 'display_name': 'Art', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C86803240', 'wikidata': 'https://www.wikidata.org/wiki/Q420', 'display_name': 'Biology', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C153349607', 'wikidata': 'https://www.wikidata.org/wiki/Q36649', 'display_name': 'Visual arts', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C54355233', 'wikidata': 'https://www.wikidata.org/wiki/Q7162', 'display_name': 'Genetics', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'http://proceedings.mlr.press/v37/jozefowicz15.pdf', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4306419644', 'display_name': 'International Conference on Machine Learning', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'conference'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.7, 'display_name': 'Sustainable cities and communities', 'id': 'https://metadata.un.org/sdg/11'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 25, 'referenced_works': ['https://openalex.org/W104184427', 'https://openalex.org/W139960808', 'https://openalex.org/W1408639475', 'https://openalex.org/W1581407678', 'https://openalex.org/W1591801644', 'https://openalex.org/W1632114991', 'https://openalex.org/W1689711448', 'https://openalex.org/W179875071', 'https://openalex.org/W1810943226', 'https://openalex.org/W1815076433', 'https://openalex.org/W1819710477', 'https://openalex.org/W1884390770', 'https://openalex.org/W1924770834', 'https://openalex.org/W194249466', 'https://openalex.org/W196761320', 'https://openalex.org/W2025192393', 'https://openalex.org/W2064675550', 'https://openalex.org/W2107878631', 'https://openalex.org/W2111258243', 'https://openalex.org/W2118706537', 'https://openalex.org/W2118776487', 'https://openalex.org/W2130942839', 'https://openalex.org/W2136848157', 'https://openalex.org/W2157331557', 'https://openalex.org/W753012316'], 'related_works': ['https://openalex.org/W2964308564', 'https://openalex.org/W2964199361', 'https://openalex.org/W2964121744', 'https://openalex.org/W2919115771', 'https://openalex.org/W2250539671', 'https://openalex.org/W2194775991', 'https://openalex.org/W2163605009', 'https://openalex.org/W2157331557', 'https://openalex.org/W2143612262', 'https://openalex.org/W2136848157', 'https://openalex.org/W2131774270', 'https://openalex.org/W2130942839', 'https://openalex.org/W2112796928', 'https://openalex.org/W2110485445', 'https://openalex.org/W2107878631', 'https://openalex.org/W2095705004', 'https://openalex.org/W2064675550', 'https://openalex.org/W1924770834', 'https://openalex.org/W1815076433', 'https://openalex.org/W1689711448'], 'abstract_inverted_index': {'The': [0, 17], 'Recurrent': [1, 111], 'Neural': [2], 'Network': [3], '(RNN)': [4], 'is': [5, 12, 22, 49, 54, 63, 75], 'an': [6, 100], 'extremely': [7], 'powerful': [8], 'sequence': [9], 'model': [10], 'that': [11, 102, 122], 'often': [13], 'difficult': [14], 'to': [15, 33, 44, 69, 128], 'train.': [16, 34], 'Long': [18], 'Short-Term': [19], 'Memory': [20], '(LSTM)': [21], 'a': [23, 85, 124], 'specific': [24], 'RNN': [25, 96], 'architecture': [26, 42, 74, 87, 101], 'whose': [27], 'design': [28], 'makes': [29], 'it': [30, 48, 53], 'much': [31, 79], 'easier': [32], 'While': [35], 'wildly': [36], 'successful': [37], 'in': [38], 'practice,': [39], 'the': [40, 57, 72, 105, 108, 129, 134, 137, 140], "LSTM's": [41, 130], 'appears': [43], 'be': [45], 'ad-hoc': [46], 'so': [47], 'not': [50, 117], 'clear': [51], 'if': [52], 'optimal,': [55], 'and': [56, 98, 107, 139], 'significance': [58], 'of': [59, 126], 'its': [60], 'individual': [61], 'components': [62], 'unclear.\r\n\r\nIn': [64], 'this': [65], 'work,': [66], 'we': [67, 90], 'aim': [68], 'determine': [70], 'whether': [71, 78], 'LSTM': [73, 106, 138], 'optimal': [76], 'or': [77], 'better': [80], 'architectures': [81], 'exist.': [82], 'We': [83, 120], 'conducted': [84], 'thorough': [86], 'search': [88], 'where': [89], 'evaluated': [91], 'over': [92], 'ten': [93], 'thousand': [94], 'different': [95], 'architectures,': [97], 'identified': [99], 'outperforms': [103], 'both': [104], 'recently-introduced': [109], 'Gated': [110], 'Unit': [112], '(GRU)': [113], 'on': [114], 'some': [115], 'but': [116], 'all': [118], 'tasks.': [119], 'found': [121], 'adding': [123], 'bias': [125], '1': [127], 'forget': [131], 'gate': [132], 'closes': [133], 'gap': [135], 'between': [136], 'GRU.': [141]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W581956982', 'counts_by_year': [{'year': 2024, 'cited_by_count': 16}, {'year': 2023, 'cited_by_count': 55}, {'year': 2022, 'cited_by_count': 60}, {'year': 2021, 'cited_by_count': 159}, {'year': 2020, 'cited_by_count': 236}, {'year': 2019, 'cited_by_count': 243}, {'year': 2018, 'cited_by_count': 230}, {'year': 2017, 'cited_by_count': 171}, {'year': 2016, 'cited_by_count': 136}, {'year': 2015, 'cited_by_count': 9}, {'year': 2014, 'cited_by_count': 1}], 'updated_date': '2024-12-24T05:02:31.725333', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works