Performance Measurement on Scale-Up and Scale-Out Hadoop with Remote and Local File Systems

Zhuozhao Li; Haiying Shen
{'id': 'https://openalex.org/W2571822082', 'doi': 'https://doi.org/10.1109/cloud.2016.0067', 'title': 'Performance Measurement on Scale-Up and Scale-Out Hadoop with Remote and Local File Systems', 'display_name': 'Performance Measurement on Scale-Up and Scale-Out Hadoop with Remote and Local File Systems', 'publication_year': 2016, 'publication_date': '2016-06-01', 'ids': {'openalex': 'https://openalex.org/W2571822082', 'doi': 'https://doi.org/10.1109/cloud.2016.0067', 'mag': '2571822082'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/cloud.2016.0067', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5087300252', 'display_name': 'Zhuozhao Li', 'orcid': 'https://orcid.org/0000-0003-1903-6428'}, 'institutions': [{'id': 'https://openalex.org/I8078737', 'display_name': 'Clemson University', 'ror': 'https://ror.org/037s24f05', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I8078737']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Zhuozhao Li', 'raw_affiliation_strings': ['Department of Electrical and Computer Engineering, Clemson University, Clemson, SC'], 'affiliations': [{'raw_affiliation_string': 'Department of Electrical and Computer Engineering, Clemson University, Clemson, SC', 'institution_ids': ['https://openalex.org/I8078737']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5064217355', 'display_name': 'Haiying Shen', 'orcid': 'https://orcid.org/0000-0002-7681-6255'}, 'institutions': [{'id': 'https://openalex.org/I8078737', 'display_name': 'Clemson University', 'ror': 'https://ror.org/037s24f05', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I8078737']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Haiying Shen', 'raw_affiliation_strings': ['Department of Electrical and Computer Engineering, Clemson University, Clemson, SC'], 'affiliations': [{'raw_affiliation_string': 'Department of Electrical and Computer Engineering, Clemson University, Clemson, SC', 'institution_ids': ['https://openalex.org/I8078737']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 1.829, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 9, 'citation_normalized_percentile': {'value': 0.836477, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 86, 'max': 87}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10101', 'display_name': 'Cloud Computing and Big Data Technologies', 'score': 0.9999, 'subfield': {'id': 'https://openalex.org/subfields/1710', 'display_name': 'Information Systems'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10101', 'display_name': 'Cloud Computing and Big Data Technologies', 'score': 0.9999, 'subfield': {'id': 'https://openalex.org/subfields/1710', 'display_name': 'Information Systems'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11181', 'display_name': 'Distributed Storage Systems and Network Coding', 'score': 0.9985, 'subfield': {'id': 'https://openalex.org/subfields/1705', 'display_name': 'Computer Networks and Communications'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11478', 'display_name': 'Content-Centric Networking for Information Delivery', 'score': 0.9968, 'subfield': {'id': 'https://openalex.org/subfields/1705', 'display_name': 'Computer Networks and Communications'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/hadoop', 'display_name': 'Hadoop', 'score': 0.574021}, {'id': 'https://openalex.org/keywords/mapreduce', 'display_name': 'MapReduce', 'score': 0.562165}, {'id': 'https://openalex.org/keywords/scalability', 'display_name': 'Scalability', 'score': 0.522157}, {'id': 'https://openalex.org/keywords/parallel-computing', 'display_name': 'Parallel Computing', 'score': 0.513789}], 'concepts': [{'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.7242869}, {'id': 'https://openalex.org/C2778755073', 'wikidata': 'https://www.wikidata.org/wiki/Q10858537', 'display_name': 'Scale (ratio)', 'level': 2, 'score': 0.7091116}, {'id': 'https://openalex.org/C111919701', 'wikidata': 'https://www.wikidata.org/wiki/Q9135', 'display_name': 'Operating system', 'level': 1, 'score': 0.49080953}, {'id': 'https://openalex.org/C77088390', 'wikidata': 'https://www.wikidata.org/wiki/Q8513', 'display_name': 'Database', 'level': 1, 'score': 0.39301687}, {'id': 'https://openalex.org/C205649164', 'wikidata': 'https://www.wikidata.org/wiki/Q1071', 'display_name': 'Geography', 'level': 0, 'score': 0.05236727}, {'id': 'https://openalex.org/C58640448', 'wikidata': 'https://www.wikidata.org/wiki/Q42515', 'display_name': 'Cartography', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/cloud.2016.0067', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.57, 'id': 'https://metadata.un.org/sdg/8', 'display_name': 'Decent work and economic growth'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 12, 'referenced_works': ['https://openalex.org/W1575777113', 'https://openalex.org/W1676944998', 'https://openalex.org/W1977110635', 'https://openalex.org/W2067005380', 'https://openalex.org/W2096125134', 'https://openalex.org/W2104993419', 'https://openalex.org/W2124256298', 'https://openalex.org/W2141249441', 'https://openalex.org/W2150478767', 'https://openalex.org/W2163291889', 'https://openalex.org/W2173213060', 'https://openalex.org/W2293423600'], 'related_works': ['https://openalex.org/W4391913857', 'https://openalex.org/W2748952813', 'https://openalex.org/W2530322880', 'https://openalex.org/W2478288626', 'https://openalex.org/W2390279801', 'https://openalex.org/W2382290278', 'https://openalex.org/W2376932109', 'https://openalex.org/W2358668433', 'https://openalex.org/W2350741829', 'https://openalex.org/W2001405890'], 'abstract_inverted_index': {'MapReduce': [0, 24], 'is': [1, 49, 56, 168], 'a': [2, 41, 52, 111, 127, 172], 'popular': [3], 'computing': [4], 'model': [5], 'for': [6, 51, 174], 'parallel': [7], 'data': [8, 97, 144], 'processing': [9], 'on': [10, 83, 118, 162], 'large-scale': [11], 'datasets,': [12], 'which': [13, 55], 'can': [14, 36, 99], 'vary': [15], 'from': [16, 101], 'gigabytes': [17], 'to': [18, 39, 170, 176, 181], 'terabytes': [19], 'and': [20, 67, 72, 120, 126, 136, 151, 195], 'petabytes.': [21], 'Though': [22], 'Hadoop': [23, 27, 69], 'normally': [25], 'uses': [26], 'Distributed': [28], 'File': [29], 'System': [30], '(HDFS)': [31], 'local': [32, 73, 196], 'file': [33, 43, 74, 129, 149], 'system,': [34], 'it': [35], 'be': [37], 'configured': [38, 123], 'use': [40], 'remote': [42, 71, 128, 194], 'system.': [44], 'Then,': [45], 'an': [46], 'interesting': [47], 'question': [48], 'raised:': [50], 'given': [53], 'application,': [54], 'the': [57, 62, 102, 146, 152, 157, 163, 178, 189], 'best': [58, 179], 'running': [59], 'platform': [60, 180], 'among': [61], 'different': [63, 85, 93, 103, 116, 139, 160, 164, 183, 186], 'combinations': [64], 'of': [65, 87, 115, 148, 154, 159], 'scale-up': [66, 119], 'scale-out': [68], 'with': [70, 92, 124, 185], 'systems.': [75], 'However,': [76], 'there': [77], 'has': [78], 'been': [79], 'no': [80], 'previous': [81], 'research': [82], 'how': [84, 138], 'types': [86], 'applications': [88, 117, 161, 184], '(e.g.,': [89, 95, 142], 'CPU-intensive,': [90], 'data-intensive)': [91], 'characteristics': [94, 141, 187], 'input': [96, 143], 'size)': [98], 'benefit': [100], 'platforms.': [104, 165], 'Thus,': [105], 'in': [106, 188], 'this': [107], 'paper,': [108], 'we': [109], 'conduct': [110], 'comprehensive': [112], 'performance': [113, 158], 'measurement': [114], 'scaleout': [121], 'clusters': [122], 'HDFS': [125], 'system': [130], '(i.e.,': [131], 'OFS),': [132], 'respectively.': [133], 'We': [134], 'identify': [135], 'study': [137, 167], 'job': [140], 'size,': [145], 'number': [147], 'reads/writes,': [150], 'amount': [153], 'computations)': [155], 'affect': [156], 'This': [166], 'expected': [169], 'provide': [171], 'guidance': [173], 'users': [175], 'choose': [177], 'run': [182], 'environment': [190], 'that': [191], 'provides': [192], 'both': [193], 'storage,': [197], 'such': [198], 'as': [199], 'HPC': [200], 'cluster.': [201]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2571822082', 'counts_by_year': [{'year': 2022, 'cited_by_count': 1}, {'year': 2020, 'cited_by_count': 1}, {'year': 2019, 'cited_by_count': 2}, {'year': 2018, 'cited_by_count': 3}, {'year': 2017, 'cited_by_count': 1}, {'year': 2016, 'cited_by_count': 1}], 'updated_date': '2024-09-15T08:52:16.563597', 'created_date': '2017-01-26'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works