A tasks reordering model to reduce transfers overhead on GPUs

A.J. Lázaro-Muñoz; José María González-Linares; Juan Gómez-Luna; Nicolás Guil
{'id': 'https://openalex.org/W2734000466', 'doi': 'https://doi.org/10.1016/j.jpdc.2017.06.015', 'title': 'A tasks reordering model to reduce transfers overhead on GPUs', 'display_name': 'A tasks reordering model to reduce transfers overhead on GPUs', 'publication_year': 2017, 'publication_date': '2017-06-29', 'ids': {'openalex': 'https://openalex.org/W2734000466', 'doi': 'https://doi.org/10.1016/j.jpdc.2017.06.015', 'mag': '2734000466'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1016/j.jpdc.2017.06.015', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S157146593', 'display_name': 'Journal of Parallel and Distributed Computing', 'issn_l': '0743-7315', 'issn': ['0743-7315', '1096-0848'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310320990', 'host_organization_name': 'Elsevier BV', 'host_organization_lineage': ['https://openalex.org/P4310320990'], 'host_organization_lineage_names': ['Elsevier BV'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'journal-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5053314479', 'display_name': 'A.J. Lázaro-Muñoz', 'orcid': 'https://orcid.org/0000-0002-1867-413X'}, 'institutions': [{'id': 'https://openalex.org/I82767444', 'display_name': 'Universidad de Málaga', 'ror': 'https://ror.org/036b2ww28', 'country_code': 'ES', 'type': 'education', 'lineage': ['https://openalex.org/I82767444']}], 'countries': ['ES'], 'is_corresponding': False, 'raw_author_name': 'A.J. Lázaro-Muñoz', 'raw_affiliation_strings': ['Department of Computer Architecture, University of Málaga, Campus de Teatinos, 29071 Málaga, Spain'], 'affiliations': [{'raw_affiliation_string': 'Department of Computer Architecture, University of Málaga, Campus de Teatinos, 29071 Málaga, Spain', 'institution_ids': ['https://openalex.org/I82767444']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5069104967', 'display_name': 'José María González-Linares', 'orcid': 'https://orcid.org/0000-0002-0545-5958'}, 'institutions': [{'id': 'https://openalex.org/I82767444', 'display_name': 'Universidad de Málaga', 'ror': 'https://ror.org/036b2ww28', 'country_code': 'ES', 'type': 'education', 'lineage': ['https://openalex.org/I82767444']}], 'countries': ['ES'], 'is_corresponding': True, 'raw_author_name': 'J.M. González-Linares', 'raw_affiliation_strings': ['Department of Computer Architecture, University of Málaga, Campus de Teatinos, 29071 Málaga, Spain'], 'affiliations': [{'raw_affiliation_string': 'Department of Computer Architecture, University of Málaga, Campus de Teatinos, 29071 Málaga, Spain', 'institution_ids': ['https://openalex.org/I82767444']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5044416322', 'display_name': 'Juan Gómez-Luna', 'orcid': 'https://orcid.org/0000-0002-6514-1571'}, 'institutions': [{'id': 'https://openalex.org/I53110688', 'display_name': 'University of Córdoba', 'ror': 'https://ror.org/05yc77b46', 'country_code': 'ES', 'type': 'education', 'lineage': ['https://openalex.org/I53110688']}], 'countries': ['ES'], 'is_corresponding': False, 'raw_author_name': 'J. Gómez-Luna', 'raw_affiliation_strings': ['Department of Computer Architecture and Electronics, University of Córdoba, Córdoba, Spain'], 'affiliations': [{'raw_affiliation_string': 'Department of Computer Architecture and Electronics, University of Córdoba, Córdoba, Spain', 'institution_ids': ['https://openalex.org/I53110688']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5029567783', 'display_name': 'Nicolás Guil', 'orcid': 'https://orcid.org/0000-0003-3431-6516'}, 'institutions': [{'id': 'https://openalex.org/I82767444', 'display_name': 'Universidad de Málaga', 'ror': 'https://ror.org/036b2ww28', 'country_code': 'ES', 'type': 'education', 'lineage': ['https://openalex.org/I82767444']}], 'countries': ['ES'], 'is_corresponding': False, 'raw_author_name': 'N. Guil', 'raw_affiliation_strings': ['Department of Computer Architecture, University of Málaga, Campus de Teatinos, 29071 Málaga, Spain'], 'affiliations': [{'raw_affiliation_string': 'Department of Computer Architecture, University of Málaga, Campus de Teatinos, 29071 Málaga, Spain', 'institution_ids': ['https://openalex.org/I82767444']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 2, 'corresponding_author_ids': ['https://openalex.org/A5069104967'], 'corresponding_institution_ids': ['https://openalex.org/I82767444'], 'apc_list': {'value': 3160, 'currency': 'USD', 'value_usd': 3160, 'provenance': 'doaj'}, 'apc_paid': None, 'fwci': 0.964, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 6, 'citation_normalized_percentile': {'value': 0.782226, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 81, 'max': 83}, 'biblio': {'volume': '109', 'issue': None, 'first_page': '258', 'last_page': '271'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10054', 'display_name': 'Parallel Computing and Performance Optimization', 'score': 1.0, 'subfield': {'id': 'https://openalex.org/subfields/1708', 'display_name': 'Hardware and Architecture'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10054', 'display_name': 'Parallel Computing and Performance Optimization', 'score': 1.0, 'subfield': {'id': 'https://openalex.org/subfields/1708', 'display_name': 'Hardware and Architecture'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10715', 'display_name': 'Distributed Grid Computing Systems', 'score': 0.9995, 'subfield': {'id': 'https://openalex.org/subfields/1705', 'display_name': 'Computer Networks and Communications'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11181', 'display_name': 'Distributed Storage Systems and Network Coding', 'score': 0.9993, 'subfield': {'id': 'https://openalex.org/subfields/1705', 'display_name': 'Computer Networks and Communications'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/gpu-computing', 'display_name': 'GPU Computing', 'score': 0.634872}, {'id': 'https://openalex.org/keywords/task-scheduling', 'display_name': 'Task Scheduling', 'score': 0.627441}, {'id': 'https://openalex.org/keywords/performance-optimization', 'display_name': 'Performance Optimization', 'score': 0.57883}, {'id': 'https://openalex.org/keywords/parallel-computing', 'display_name': 'Parallel Computing', 'score': 0.572336}, {'id': 'https://openalex.org/keywords/kernel', 'display_name': 'Kernel (algebra)', 'score': 0.5391996}, {'id': 'https://openalex.org/keywords/heterogeneous-computing', 'display_name': 'Heterogeneous Computing', 'score': 0.527334}], 'concepts': [{'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.9401235}, {'id': 'https://openalex.org/C193702766', 'wikidata': 'https://www.wikidata.org/wiki/Q1414548', 'display_name': 'Concurrency', 'level': 2, 'score': 0.7222233}, {'id': 'https://openalex.org/C173608175', 'wikidata': 'https://www.wikidata.org/wiki/Q232661', 'display_name': 'Parallel computing', 'level': 1, 'score': 0.7203287}, {'id': 'https://openalex.org/C138101251', 'wikidata': 'https://www.wikidata.org/wiki/Q213092', 'display_name': 'Thread (computing)', 'level': 2, 'score': 0.7073308}, {'id': 'https://openalex.org/C45374587', 'wikidata': 'https://www.wikidata.org/wiki/Q12525525', 'display_name': 'Computation', 'level': 2, 'score': 0.5687901}, {'id': 'https://openalex.org/C74193536', 'wikidata': 'https://www.wikidata.org/wiki/Q574844', 'display_name': 'Kernel (algebra)', 'level': 2, 'score': 0.5391996}, {'id': 'https://openalex.org/C201410400', 'wikidata': 'https://www.wikidata.org/wiki/Q1064412', 'display_name': 'Multithreading', 'level': 3, 'score': 0.47649568}, {'id': 'https://openalex.org/C2779960059', 'wikidata': 'https://www.wikidata.org/wiki/Q7113681', 'display_name': 'Overhead (engineering)', 'level': 2, 'score': 0.43130323}, {'id': 'https://openalex.org/C2989134064', 'wikidata': 'https://www.wikidata.org/wiki/Q288510', 'display_name': 'Execution time', 'level': 2, 'score': 0.4311996}, {'id': 'https://openalex.org/C111919701', 'wikidata': 'https://www.wikidata.org/wiki/Q9135', 'display_name': 'Operating system', 'level': 1, 'score': 0.2668382}, {'id': 'https://openalex.org/C199360897', 'wikidata': 'https://www.wikidata.org/wiki/Q9143', 'display_name': 'Programming language', 'level': 1, 'score': 0.13672134}, {'id': 'https://openalex.org/C33923547', 'wikidata': 'https://www.wikidata.org/wiki/Q395', 'display_name': 'Mathematics', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C114614502', 'wikidata': 'https://www.wikidata.org/wiki/Q76592', 'display_name': 'Combinatorics', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1016/j.jpdc.2017.06.015', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S157146593', 'display_name': 'Journal of Parallel and Distributed Computing', 'issn_l': '0743-7315', 'issn': ['0743-7315', '1096-0848'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310320990', 'host_organization_name': 'Elsevier BV', 'host_organization_lineage': ['https://openalex.org/P4310320990'], 'host_organization_lineage_names': ['Elsevier BV'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 28, 'referenced_works': ['https://openalex.org/W1965942711', 'https://openalex.org/W1969863734', 'https://openalex.org/W1976139981', 'https://openalex.org/W1979527452', 'https://openalex.org/W1981896890', 'https://openalex.org/W1984296775', 'https://openalex.org/W2005574683', 'https://openalex.org/W2030675942', 'https://openalex.org/W2041351581', 'https://openalex.org/W2064757275', 'https://openalex.org/W2080592089', 'https://openalex.org/W2098274770', 'https://openalex.org/W2121893797', 'https://openalex.org/W2126118553', 'https://openalex.org/W2140348470', 'https://openalex.org/W2142677441', 'https://openalex.org/W2152517358', 'https://openalex.org/W2171226522', 'https://openalex.org/W2341343020', 'https://openalex.org/W2400452202', 'https://openalex.org/W2418353084', 'https://openalex.org/W2774814158', 'https://openalex.org/W3005101099', 'https://openalex.org/W3148339845', 'https://openalex.org/W4240294622', 'https://openalex.org/W4241224545', 'https://openalex.org/W4299358365', 'https://openalex.org/W574202735'], 'related_works': ['https://openalex.org/W4235861380', 'https://openalex.org/W2733115356', 'https://openalex.org/W2377593213', 'https://openalex.org/W2153202644', 'https://openalex.org/W2115561485', 'https://openalex.org/W2106625514', 'https://openalex.org/W2105895556', 'https://openalex.org/W2010970156', 'https://openalex.org/W1985089255', 'https://openalex.org/W1867214769'], 'abstract_inverted_index': {'The': [0], 'compute': [1], 'capabilities': [2], 'of': [3, 116, 128], 'current': [4, 117], 'GPUs': [5], 'allow': [6], 'exploiting': [7], 'concurrency': [8, 40], 'when': [9], 'several': [10], 'independent': [11], 'tasks': [12, 17, 58, 73, 80], 'are': [13, 18], 'simultaneously': [14], 'launched.': [15], 'These': [16], 'typically': [19], 'composed': [20], 'by': [21, 112, 142], 'data': [22, 42, 90], 'transfer': [23], 'commands': [24], 'and': [25, 44], 'kernel': [26, 45], 'computation': [27, 46], 'commands.': [28], 'In': [29], 'this': [30], 'paper': [31], 'we': [32], 'develop': [33], 'a': [34, 49, 67, 93], 'run-time': [35], 'approach': [36, 121], 'to': [37, 59, 77, 95, 106, 130, 134], 'optimize': [38], 'the': [39, 60, 79, 85, 97, 108, 113, 135], 'between': [41], 'transfers': [43], 'operations': [47], 'in': [48], 'multithreaded': [50], 'scenario': [51], 'where': [52], 'each': [53], 'CPU': [54], 'thread': [55], 'is': [56, 64, 75, 104], 'sending': [57], 'GPU.': [61], 'Our': [62, 120], 'solution': [63], 'based': [65], 'on': [66], 'temporal': [68], 'execution': [69, 81, 87, 109, 136], 'model': [70], 'for': [71], 'concurrent': [72], 'that': [74, 83], 'able': [76, 105], 'establish': [78], 'order': [82, 99], 'minimizes': [84], 'total': [86], 'time,': [88], 'including': [89], 'transfers.': [91], 'Moreover,': [92], 'heuristic': [94], 'select': [96], 'best': [98], 'has': [100], 'been': [101], 'developed,': [102], 'which': [103], 'improve': [107], 'time': [110], 'achieved': [111], 'hardware': [114, 139], 'scheduler': [115], 'NVIDIA': [118], 'cards.': [119], 'obtains': [122], 'performance': [123], 'improvements,': [124], 'under': [125], 'real': [126], 'workloads,': [127], 'up': [129], '19%': [131], 'with': [132], 'respect': [133], 'using': [137], 'multiple': [138], 'queues': [140], 'managed': [141], 'Hyper-Q.': [143]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2734000466', 'counts_by_year': [{'year': 2023, 'cited_by_count': 1}, {'year': 2021, 'cited_by_count': 2}, {'year': 2019, 'cited_by_count': 2}, {'year': 2018, 'cited_by_count': 1}], 'updated_date': '2024-09-19T11:16:12.788795', 'created_date': '2017-07-14'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works