On Automating Hyperparameter Optimization for Deep Learning Applications

Nabila Shawki; R. Rodriguez Nunez; Iyad Obeid; J. Picone
{'id': 'https://openalex.org/W4210794429', 'doi': 'https://doi.org/10.1109/spmb52430.2021.9672266', 'title': 'On Automating Hyperparameter Optimization for Deep Learning Applications', 'display_name': 'On Automating Hyperparameter Optimization for Deep Learning Applications', 'publication_year': 2021, 'publication_date': '2021-12-04', 'ids': {'openalex': 'https://openalex.org/W4210794429', 'doi': 'https://doi.org/10.1109/spmb52430.2021.9672266'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/spmb52430.2021.9672266', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5013606385', 'display_name': 'Nabila Shawki', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I84392919', 'display_name': 'Temple University', 'ror': 'https://ror.org/00kx1jb78', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I84392919']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'N. Shawki', 'raw_affiliation_strings': ['Neural Engineering Data Consortium, Temple University, Philadelphia, Pennsylvania, USA'], 'affiliations': [{'raw_affiliation_string': 'Neural Engineering Data Consortium, Temple University, Philadelphia, Pennsylvania, USA', 'institution_ids': ['https://openalex.org/I84392919']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5032552882', 'display_name': 'R. Rodriguez Nunez', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I4210140143', 'display_name': 'Dynamic Systems (United States)', 'ror': 'https://ror.org/043vm9914', 'country_code': 'US', 'type': 'company', 'lineage': ['https://openalex.org/I4210140143']}, {'id': 'https://openalex.org/I84392919', 'display_name': 'Temple University', 'ror': 'https://ror.org/00kx1jb78', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I84392919']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'R. Rodriguez Nunez', 'raw_affiliation_strings': ['Dynamical Systems Lab, Temple University, Philadelphia, Pennsylvania, USA'], 'affiliations': [{'raw_affiliation_string': 'Dynamical Systems Lab, Temple University, Philadelphia, Pennsylvania, USA', 'institution_ids': ['https://openalex.org/I4210140143', 'https://openalex.org/I84392919']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5023383141', 'display_name': 'Iyad Obeid', 'orcid': 'https://orcid.org/0000-0002-5796-843X'}, 'institutions': [{'id': 'https://openalex.org/I84392919', 'display_name': 'Temple University', 'ror': 'https://ror.org/00kx1jb78', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I84392919']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'I. Obeid', 'raw_affiliation_strings': ['Neural Engineering Data Consortium, Temple University, Philadelphia, Pennsylvania, USA'], 'affiliations': [{'raw_affiliation_string': 'Neural Engineering Data Consortium, Temple University, Philadelphia, Pennsylvania, USA', 'institution_ids': ['https://openalex.org/I84392919']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5004343613', 'display_name': 'J. Picone', 'orcid': 'https://orcid.org/0000-0001-6836-7886'}, 'institutions': [{'id': 'https://openalex.org/I84392919', 'display_name': 'Temple University', 'ror': 'https://ror.org/00kx1jb78', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I84392919']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'J. Picone', 'raw_affiliation_strings': ['Neural Engineering Data Consortium, Temple University, Philadelphia, Pennsylvania, USA'], 'affiliations': [{'raw_affiliation_string': 'Neural Engineering Data Consortium, Temple University, Philadelphia, Pennsylvania, USA', 'institution_ids': ['https://openalex.org/I84392919']}]}], 'countries_distinct_count': 1, 'institutions_distinct_count': 2, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 1.655, 'has_fulltext': False, 'cited_by_count': 14, 'citation_normalized_percentile': {'value': 0.833261, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 91, 'max': 92}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T12535', 'display_name': 'Learning with Noisy Labels in Machine Learning', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T12535', 'display_name': 'Learning with Noisy Labels in Machine Learning', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10036', 'display_name': 'Deep Learning in Computer Vision and Image Recognition', 'score': 0.9969, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T12072', 'display_name': 'Active Learning in Machine Learning Research', 'score': 0.9947, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/overfitting', 'display_name': 'Overfitting', 'score': 0.8187262}, {'id': 'https://openalex.org/keywords/hyperparameter', 'display_name': 'Hyperparameter', 'score': 0.81587243}, {'id': 'https://openalex.org/keywords/hyperparameter-optimization', 'display_name': 'Hyperparameter Optimization', 'score': 0.572596}, {'id': 'https://openalex.org/keywords/meta-learning', 'display_name': 'Meta-Learning', 'score': 0.54501}, {'id': 'https://openalex.org/keywords/deep-learning', 'display_name': 'Deep Learning', 'score': 0.537201}, {'id': 'https://openalex.org/keywords/automated-machine-learning', 'display_name': 'Automated Machine Learning', 'score': 0.524772}, {'id': 'https://openalex.org/keywords/robust-learning', 'display_name': 'Robust Learning', 'score': 0.506566}, {'id': 'https://openalex.org/keywords/dropout', 'display_name': 'Dropout (neural networks)', 'score': 0.48764053}], 'concepts': [{'id': 'https://openalex.org/C119857082', 'wikidata': 'https://www.wikidata.org/wiki/Q2539', 'display_name': 'Machine learning', 'level': 1, 'score': 0.86083066}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.8244299}, {'id': 'https://openalex.org/C22019652', 'wikidata': 'https://www.wikidata.org/wiki/Q331309', 'display_name': 'Overfitting', 'level': 3, 'score': 0.8187262}, {'id': 'https://openalex.org/C8642999', 'wikidata': 'https://www.wikidata.org/wiki/Q4171168', 'display_name': 'Hyperparameter', 'level': 2, 'score': 0.81587243}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.7748069}, {'id': 'https://openalex.org/C108583219', 'wikidata': 'https://www.wikidata.org/wiki/Q197536', 'display_name': 'Deep learning', 'level': 2, 'score': 0.59675676}, {'id': 'https://openalex.org/C2776145597', 'wikidata': 'https://www.wikidata.org/wiki/Q25339462', 'display_name': 'Dropout (neural networks)', 'level': 2, 'score': 0.48764053}, {'id': 'https://openalex.org/C207685749', 'wikidata': 'https://www.wikidata.org/wiki/Q2088941', 'display_name': 'Domain knowledge', 'level': 2, 'score': 0.4194566}, {'id': 'https://openalex.org/C98045186', 'wikidata': 'https://www.wikidata.org/wiki/Q205663', 'display_name': 'Process (computing)', 'level': 2, 'score': 0.41856438}, {'id': 'https://openalex.org/C50644808', 'wikidata': 'https://www.wikidata.org/wiki/Q192776', 'display_name': 'Artificial neural network', 'level': 2, 'score': 0.28586194}, {'id': 'https://openalex.org/C111919701', 'wikidata': 'https://www.wikidata.org/wiki/Q9135', 'display_name': 'Operating system', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/spmb52430.2021.9672266', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.65, 'display_name': 'Quality education', 'id': 'https://metadata.un.org/sdg/4'}], 'grants': [{'funder': 'https://openalex.org/F4320306076', 'funder_display_name': 'National Science Foundation', 'award_id': 'CNS-1726188,1925494'}], 'datasets': [], 'versions': [], 'referenced_works_count': 24, 'referenced_works': ['https://openalex.org/W122178443', 'https://openalex.org/W1538131130', 'https://openalex.org/W2064675550', 'https://openalex.org/W2070410525', 'https://openalex.org/W2732547613', 'https://openalex.org/W2776146695', 'https://openalex.org/W2790089582', 'https://openalex.org/W2883265831', 'https://openalex.org/W2892341857', 'https://openalex.org/W2896348597', 'https://openalex.org/W2897722020', 'https://openalex.org/W2910242301', 'https://openalex.org/W2912851808', 'https://openalex.org/W2949676527', 'https://openalex.org/W2963290013', 'https://openalex.org/W2998993395', 'https://openalex.org/W3006913750', 'https://openalex.org/W3014988774', 'https://openalex.org/W3092578031', 'https://openalex.org/W3133388486', 'https://openalex.org/W3150692941', 'https://openalex.org/W3155716648', 'https://openalex.org/W4210460113', 'https://openalex.org/W4211116959'], 'related_works': ['https://openalex.org/W4362499066', 'https://openalex.org/W4361732492', 'https://openalex.org/W4307195028', 'https://openalex.org/W4304128395', 'https://openalex.org/W4295309597', 'https://openalex.org/W4283697347', 'https://openalex.org/W4210794429', 'https://openalex.org/W3186919929', 'https://openalex.org/W3099765033', 'https://openalex.org/W2576264401'], 'abstract_inverted_index': {'Given': [0], 'a': [1], 'large': [2, 43], 'amount': [3], 'of': [4, 83, 101, 155, 193], 'data': [5], 'and': [6, 44, 79, 91, 109, 201], 'appropriate': [7], 'hyperparameters,': [8], 'deep': [9, 30, 84, 190], 'learning': [10, 31, 77, 85, 208, 220], 'techniques': [11, 33], 'can': [12, 26, 68], 'deliver': [13], 'impressive': [14], 'performance': [15], 'if': [16], 'several': [17], 'challenging': [18], 'issues': [19], 'with': [20, 46, 142, 186], 'training,': [21], 'such': [22], 'as': [23, 55], 'vanishing': [24], 'gradients,': [25], 'be': [27, 69], 'overcome.': [28], 'Often,': [29], 'training': [32, 64], 'produce': [34], 'suboptimal': [35], 'results': [36], 'because': [37, 209], 'the': [38, 63, 81, 99], 'parameter': [39], 'search': [40], 'space': [41], 'is': [42, 203], 'populated': [45], 'many': [47], 'less-than-ideal': [48], 'solutions.': [49], 'Automatic': [50], 'hyperparameter': [51], 'tuning': [52, 196], 'algorithms,': [53], 'known': [54], 'autotuners,': [56], 'offer': [57], 'an': [58, 156], 'attractive': [59], 'alternative': [60], 'for': [61, 140, 150, 168, 184], 'automating': [62], 'process,': [65], 'though': [66, 215], 'they': [67, 176], 'computationally': [70], 'expensive.': [71], 'Additionally,': [72], 'autotuners': [73], 'democratize': [74], 'state-of-the-art': [75], 'machine': [76, 146, 207, 219], 'approaches': [78], 'increase': [80, 129], 'accessibility': [82], 'technology': [86], 'to': [87, 138, 180], 'different': [88], 'scientific': [89], 'communities': [90], 'novice': [92], 'users.': [93], 'In': [94], 'this': [95], 'paper,': [96], 'we': [97, 128], 'investigate': [98], 'efficacy': [100], 'autotuning': [102, 115, 132], 'using': [103], 'Keras': [104], 'Tuner': [105], 'on': [106, 118, 124, 172, 198], 'both': [107], 'synthetic': [108, 119], 'real-world': [110], 'datasets.': [111], 'We': [112], 'show': [113], 'that': [114, 135], 'performed': [116], 'well': [117], 'datasets': [120], 'but': [121, 175], 'was': [122], 'inadequate': [123], 'real': [125], 'data.': [126], 'As': [127], 'model': [130], 'complexity,': [131], 'produces': [133, 211], 'errors': [134], 'are': [136, 166], 'tedious': [137], 'resolve': [139], 'those': [141], 'limited': [143], 'experience': [144, 202], 'in': [145, 189, 206], 'learning.': [147, 191], 'Avoiding': [148], 'overfitting,': [149], 'example,': [151], 'requires': [152, 217], 'extensive': [153, 218], 'knowledge': [154, 200], "algorithm's": [157], 'unique': [158], 'characteristics': [159], '(e.g.,': [160], 'adding': [161], 'dropout': [162], 'layers).': [163], 'Autotuning': [164], 'tools': [165], 'excellent': [167], 'creating': [169], 'baseline': [170], 'models': [171], 'new': [173], 'datasets,': [174], 'need': [177], 'more': [178], 'attention': [179], 'formulate': [181], 'optimal': [182], 'solutions': [183], 'end-users': [185], 'less': [187], 'background': [188], 'Because': [192], 'this,': [194], 'manual': [195], 'based': [197], 'domain': [199], 'still': [204], 'preferred': [205], 'it': [210, 216], 'better': [212], 'performance,': [213], 'even': [214], 'expertise.': [221]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W4210794429', 'counts_by_year': [{'year': 2024, 'cited_by_count': 8}, {'year': 2023, 'cited_by_count': 2}, {'year': 2022, 'cited_by_count': 4}], 'updated_date': '2024-09-09T11:11:38.576525', 'created_date': '2022-02-08'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works