Content based file type detection algorithms

Name: Work Video:
Duration: 3 min 30 s
Melinda McDaniel; Mohammad Heydari
{'id': 'https://openalex.org/W2071484336', 'doi': 'https://doi.org/10.1109/hicss.2003.1174905', 'title': 'Content based file type detection algorithms', 'display_name': 'Content based file type detection algorithms', 'publication_year': 2003, 'publication_date': '2003-01-01', 'ids': {'openalex': 'https://openalex.org/W2071484336', 'doi': 'https://doi.org/10.1109/hicss.2003.1174905', 'mag': '2071484336'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/hicss.2003.1174905', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5025751395', 'display_name': 'Melinda McDaniel', 'orcid': 'https://orcid.org/0000-0002-1652-3389'}, 'institutions': [{'id': 'https://openalex.org/I11883440', 'display_name': 'James Madison University', 'ror': 'https://ror.org/028pmsz77', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I11883440']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'M. McDaniel', 'raw_affiliation_strings': ['[Department of Computer Science, James Madison University, Harrisonburg, VA, USA]'], 'affiliations': [{'raw_affiliation_string': '[Department of Computer Science, James Madison University, Harrisonburg, VA, USA]', 'institution_ids': ['https://openalex.org/I11883440']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5074418418', 'display_name': 'Mohammad Heydari', 'orcid': 'https://orcid.org/0000-0001-6764-4394'}, 'institutions': [{'id': 'https://openalex.org/I11883440', 'display_name': 'James Madison University', 'ror': 'https://ror.org/028pmsz77', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I11883440']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'M.H. Heydari', 'raw_affiliation_strings': ['[Department of Computer Science, James Madison University, Harrisonburg, VA, USA]'], 'affiliations': [{'raw_affiliation_string': '[Department of Computer Science, James Madison University, Harrisonburg, VA, USA]', 'institution_ids': ['https://openalex.org/I11883440']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 1.666, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 149, 'citation_normalized_percentile': {'value': 0.954163, 'is_in_top_1_percent': False, 'is_in_top_10_percent': True}, 'cited_by_percentile_year': {'min': 97, 'max': 98}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T12034', 'display_name': 'Digital and Cyber Forensics', 'score': 0.9993, 'subfield': {'id': 'https://openalex.org/subfields/1710', 'display_name': 'Information Systems'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T12034', 'display_name': 'Digital and Cyber Forensics', 'score': 0.9993, 'subfield': {'id': 'https://openalex.org/subfields/1710', 'display_name': 'Information Systems'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11241', 'display_name': 'Advanced Malware Detection Techniques', 'score': 0.9987, 'subfield': {'id': 'https://openalex.org/subfields/1711', 'display_name': 'Signal Processing'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T12357', 'display_name': 'Digital Media Forensic Detection', 'score': 0.987, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/torrent-file', 'display_name': 'Torrent file', 'score': 0.6165377}, {'id': 'https://openalex.org/keywords/file-format', 'display_name': 'File format', 'score': 0.5672171}, {'id': 'https://openalex.org/keywords/identification', 'display_name': 'Identification', 'score': 0.41495}], 'concepts': [{'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.8383404}, {'id': 'https://openalex.org/C26656859', 'wikidata': 'https://www.wikidata.org/wiki/Q4089244', 'display_name': 'File system fragmentation', 'level': 4, 'score': 0.6615194}, {'id': 'https://openalex.org/C21729314', 'wikidata': 'https://www.wikidata.org/wiki/Q7896858', 'display_name': 'Unix file types', 'level': 4, 'score': 0.64242613}, {'id': 'https://openalex.org/C11413529', 'wikidata': 'https://www.wikidata.org/wiki/Q8366', 'display_name': 'Algorithm', 'level': 1, 'score': 0.6379782}, {'id': 'https://openalex.org/C166807848', 'wikidata': 'https://www.wikidata.org/wiki/Q3445065', 'display_name': 'Versioning file system', 'level': 4, 'score': 0.62793696}, {'id': 'https://openalex.org/C40350719', 'wikidata': 'https://www.wikidata.org/wiki/Q2451637', 'display_name': 'Torrent file', 'level': 4, 'score': 0.6165377}, {'id': 'https://openalex.org/C97250363', 'wikidata': 'https://www.wikidata.org/wiki/Q235557', 'display_name': 'File format', 'level': 2, 'score': 0.5672171}, {'id': 'https://openalex.org/C43364308', 'wikidata': 'https://www.wikidata.org/wiki/Q8799', 'display_name': 'Byte', 'level': 2, 'score': 0.54898566}, {'id': 'https://openalex.org/C95637964', 'wikidata': 'https://www.wikidata.org/wiki/Q82753', 'display_name': 'Computer file', 'level': 2, 'score': 0.5410468}, {'id': 'https://openalex.org/C180500224', 'wikidata': 'https://www.wikidata.org/wiki/Q1412592', 'display_name': 'File Control Block', 'level': 4, 'score': 0.4895526}, {'id': 'https://openalex.org/C111919701', 'wikidata': 'https://www.wikidata.org/wiki/Q9135', 'display_name': 'Operating system', 'level': 1, 'score': 0.46112177}, {'id': 'https://openalex.org/C77088390', 'wikidata': 'https://www.wikidata.org/wiki/Q8513', 'display_name': 'Database', 'level': 1, 'score': 0.42318633}, {'id': 'https://openalex.org/C171730128', 'wikidata': 'https://www.wikidata.org/wiki/Q5227290', 'display_name': 'Data file', 'level': 2, 'score': 0.41615838}, {'id': 'https://openalex.org/C116834253', 'wikidata': 'https://www.wikidata.org/wiki/Q2039217', 'display_name': 'Identification (biology)', 'level': 2, 'score': 0.41495}, {'id': 'https://openalex.org/C124101348', 'wikidata': 'https://www.wikidata.org/wiki/Q172491', 'display_name': 'Data mining', 'level': 1, 'score': 0.37341738}, {'id': 'https://openalex.org/C193769178', 'wikidata': 'https://www.wikidata.org/wiki/Q14679', 'display_name': 'Device file', 'level': 3, 'score': 0.36186892}, {'id': 'https://openalex.org/C13674803', 'wikidata': 'https://www.wikidata.org/wiki/Q7627301', 'display_name': 'Stub file', 'level': 3, 'score': 0.26999968}, {'id': 'https://openalex.org/C59822182', 'wikidata': 'https://www.wikidata.org/wiki/Q441', 'display_name': 'Botany', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C86803240', 'wikidata': 'https://www.wikidata.org/wiki/Q420', 'display_name': 'Biology', 'level': 0, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/hicss.2003.1174905', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.57, 'display_name': 'Peace, justice, and strong institutions', 'id': 'https://metadata.un.org/sdg/16'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 2, 'referenced_works': ['https://openalex.org/W3195259938', 'https://openalex.org/W329998640'], 'related_works': ['https://openalex.org/W4313226861', 'https://openalex.org/W4233195927', 'https://openalex.org/W3199587876', 'https://openalex.org/W2559378645', 'https://openalex.org/W2342627122', 'https://openalex.org/W2092235977', 'https://openalex.org/W2038900077', 'https://openalex.org/W2023412278', 'https://openalex.org/W1990666357', 'https://openalex.org/W1572832495'], 'abstract_inverted_index': {'Identifying': [0], 'the': [1, 28, 63, 67, 108, 155, 169], 'true': [2, 68], 'type': [3, 17, 69, 163], 'of': [4, 15, 36, 50, 57, 70, 110, 141, 157, 177, 180], 'a': [5, 10, 55], 'computer': [6, 142], 'file': [7, 16, 21, 33, 51, 100, 181], 'can': [8], 'be': [9, 129], 'difficult': [11], 'problem.': [12], 'Previous': [13], 'methods': [14, 38], 'recognition': [18], 'include': [19], 'fixed': [20, 23], 'extensions,': [22], '"magic': [24], 'numbers"': [25], 'stored': [26], 'with': [27, 81], 'files,': [29, 60], 'and': [30, 99, 175], 'proprietary': [31], 'descriptive': [32], 'wrappers.': [34], 'All': [35], 'these': [37, 111], 'have': [39], 'significant': [40], 'limitations.': [41], 'This': [42], 'paper': [43], 'proposes': [44], 'algorithms': [45, 89, 127], 'for': [46, 159, 172], 'automatically': [47], 'generating': [48], '"fingerprints"': [49], 'types': [52, 156], 'based': [53, 73, 90], 'on': [54, 74], 'set': [56], 'known': [58], 'input': [59], 'then': [61], 'using': [62], 'fingerprints': [64], 'to': [65, 106, 118, 153, 168], 'recognize': [66], 'unknown': [71], 'files': [72, 158, 178], 'their': [75], 'content,': [76], 'rather': [77], 'than': [78], 'metadata': [79], 'associated': [80], 'them.': [82], 'Recognition': [83], 'is': [84, 165], 'performed': [85], 'by': [86, 131], 'three': [87], 'different': [88], 'on:': [91], 'byte': [92, 95], 'frequency': [93, 96], 'analysis,': [94, 98], 'cross-correlation': [97], 'header/trailer': [101], 'analysis.': [102], 'Tests': [103], 'were': [104], 'run': [105], 'measure': [107], 'accuracy': [109, 114], 'algorithms.': [112], 'The': [113], 'varied': [115], 'from': [116], '23%': [117], '96%': [119], 'depending': [120], 'upon': [121], 'which': [122], 'algorithm': [123], 'was': [124], 'used.': [125], 'These': [126], 'could': [128], 'used': [130], 'virus': [132], 'scanning': [133], 'packages,': [134], 'firewalls,': [135], 'intrusion': [136], 'detection': [137, 164], 'systems,': [138], 'forensic': [139], 'analyses': [140], 'hard': [143], 'drives,': [144], 'Web': [145], 'browsers,': [146], 'or': [147], 'any': [148], 'other': [149], 'program': [150], 'that': [151], 'needs': [152], 'identify': [154], 'proper': [160], 'operation.': [161], 'File': [162], 'also': [166], 'important': [167], 'operating': [170], 'systems': [171], 'correct': [173], 'identification': [174], 'handling': [176], 'regardless': [179], 'extension.': [182]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2071484336', 'counts_by_year': [{'year': 2024, 'cited_by_count': 3}, {'year': 2023, 'cited_by_count': 6}, {'year': 2022, 'cited_by_count': 6}, {'year': 2021, 'cited_by_count': 10}, {'year': 2020, 'cited_by_count': 12}, {'year': 2019, 'cited_by_count': 8}, {'year': 2018, 'cited_by_count': 9}, {'year': 2016, 'cited_by_count': 9}, {'year': 2015, 'cited_by_count': 9}, {'year': 2014, 'cited_by_count': 11}, {'year': 2013, 'cited_by_count': 9}, {'year': 2012, 'cited_by_count': 7}], 'updated_date': '2025-01-10T23:38:28.745917', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works