Analyzing Big Environmental Audio with Frequency Preserving Autoencoders

Benjamin Rowe; Philip Eichinski; Jinglan Zhang; Paul Roe
{'id': 'https://openalex.org/W3208888551', 'doi': 'https://doi.org/10.1109/escience51609.2021.00017', 'title': 'Analyzing Big Environmental Audio with Frequency Preserving Autoencoders', 'display_name': 'Analyzing Big Environmental Audio with Frequency Preserving Autoencoders', 'publication_year': 2021, 'publication_date': '2021-09-01', 'ids': {'openalex': 'https://openalex.org/W3208888551', 'doi': 'https://doi.org/10.1109/escience51609.2021.00017', 'mag': '3208888551'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/escience51609.2021.00017', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': True, 'oa_status': 'green', 'oa_url': 'https://eprints.qut.edu.au/228402/1/Freuency_Preserving_Autoencoders_6_.pdf', 'any_repository_has_fulltext': True}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5090409041', 'display_name': 'Benjamin Rowe', 'orcid': None}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Benjamin Rowe', 'raw_affiliation_strings': [], 'affiliations': []}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5030216406', 'display_name': 'Philip Eichinski', 'orcid': 'https://orcid.org/0000-0001-5758-5060'}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Philip Eichinski', 'raw_affiliation_strings': [], 'affiliations': []}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5101979224', 'display_name': 'Jinglan Zhang', 'orcid': 'https://orcid.org/0000-0001-6459-2963'}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Jinglan Zhang', 'raw_affiliation_strings': [], 'affiliations': []}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5061954034', 'display_name': 'Paul Roe', 'orcid': 'https://orcid.org/0000-0002-4892-1509'}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Paul Roe', 'raw_affiliation_strings': [], 'affiliations': []}], 'institution_assertions': [], 'countries_distinct_count': 0, 'institutions_distinct_count': 0, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.548, 'has_fulltext': True, 'fulltext_origin': 'pdf', 'cited_by_count': 2, 'citation_normalized_percentile': {'value': 0.457143, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 68, 'max': 73}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T11665', 'display_name': 'Avian Vocal Communication and Evolutionary Implications', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1309', 'display_name': 'Developmental Biology'}, 'field': {'id': 'https://openalex.org/fields/13', 'display_name': 'Biochemistry, Genetics and Molecular Biology'}, 'domain': {'id': 'https://openalex.org/domains/1', 'display_name': 'Life Sciences'}}, 'topics': [{'id': 'https://openalex.org/T11665', 'display_name': 'Avian Vocal Communication and Evolutionary Implications', 'score': 0.9998, 'subfield': {'id': 'https://openalex.org/subfields/1309', 'display_name': 'Developmental Biology'}, 'field': {'id': 'https://openalex.org/fields/13', 'display_name': 'Biochemistry, Genetics and Molecular Biology'}, 'domain': {'id': 'https://openalex.org/domains/1', 'display_name': 'Life Sciences'}}, {'id': 'https://openalex.org/T11309', 'display_name': 'Audio Signal Classification and Analysis', 'score': 0.999, 'subfield': {'id': 'https://openalex.org/subfields/1711', 'display_name': 'Signal Processing'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T13996', 'display_name': 'Ecomusicology and Ethnomusicological Research on Environmental and Cultural Sustainability', 'score': 0.9832, 'subfield': {'id': 'https://openalex.org/subfields/1210', 'display_name': 'Music'}, 'field': {'id': 'https://openalex.org/fields/12', 'display_name': 'Arts and Humanities'}, 'domain': {'id': 'https://openalex.org/domains/2', 'display_name': 'Social Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/autoencoder', 'display_name': 'Autoencoder', 'score': 0.95723003}, {'id': 'https://openalex.org/keywords/feature-learning', 'display_name': 'Feature learning', 'score': 0.76785374}, {'id': 'https://openalex.org/keywords/feature', 'display_name': 'Feature (linguistics)', 'score': 0.7083869}, {'id': 'https://openalex.org/keywords/representation', 'display_name': 'Representation (politics)', 'score': 0.6417134}, {'id': 'https://openalex.org/keywords/environmental-sound-recognition', 'display_name': 'Environmental Sound Recognition', 'score': 0.613115}, {'id': 'https://openalex.org/keywords/mel-frequency-cepstrum', 'display_name': 'Mel-frequency cepstrum', 'score': 0.57625204}, {'id': 'https://openalex.org/keywords/audio-event-detection', 'display_name': 'Audio Event Detection', 'score': 0.562675}, {'id': 'https://openalex.org/keywords/audio-signal-classification', 'display_name': 'Audio Signal Classification', 'score': 0.534017}, {'id': 'https://openalex.org/keywords/ecomusicology', 'display_name': 'Ecomusicology', 'score': 0.516953}, {'id': 'https://openalex.org/keywords/feature-extraction', 'display_name': 'Feature Extraction', 'score': 0.509687}], 'concepts': [{'id': 'https://openalex.org/C101738243', 'wikidata': 'https://www.wikidata.org/wiki/Q786435', 'display_name': 'Autoencoder', 'level': 3, 'score': 0.95723003}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.7875027}, {'id': 'https://openalex.org/C59404180', 'wikidata': 'https://www.wikidata.org/wiki/Q17013334', 'display_name': 'Feature learning', 'level': 2, 'score': 0.76785374}, {'id': 'https://openalex.org/C2776401178', 'wikidata': 'https://www.wikidata.org/wiki/Q12050496', 'display_name': 'Feature (linguistics)', 'level': 2, 'score': 0.7083869}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.6445468}, {'id': 'https://openalex.org/C2776359362', 'wikidata': 'https://www.wikidata.org/wiki/Q2145286', 'display_name': 'Representation (politics)', 'level': 3, 'score': 0.6417134}, {'id': 'https://openalex.org/C151989614', 'wikidata': 'https://www.wikidata.org/wiki/Q440370', 'display_name': 'Mel-frequency cepstrum', 'level': 3, 'score': 0.57625204}, {'id': 'https://openalex.org/C73555534', 'wikidata': 'https://www.wikidata.org/wiki/Q622825', 'display_name': 'Cluster analysis', 'level': 2, 'score': 0.54514456}, {'id': 'https://openalex.org/C153180895', 'wikidata': 'https://www.wikidata.org/wiki/Q7148389', 'display_name': 'Pattern recognition (psychology)', 'level': 2, 'score': 0.5351544}, {'id': 'https://openalex.org/C52622490', 'wikidata': 'https://www.wikidata.org/wiki/Q1026626', 'display_name': 'Feature extraction', 'level': 2, 'score': 0.49512878}, {'id': 'https://openalex.org/C119857082', 'wikidata': 'https://www.wikidata.org/wiki/Q2539', 'display_name': 'Machine learning', 'level': 1, 'score': 0.4372966}, {'id': 'https://openalex.org/C108583219', 'wikidata': 'https://www.wikidata.org/wiki/Q197536', 'display_name': 'Deep learning', 'level': 2, 'score': 0.36060977}, {'id': 'https://openalex.org/C138885662', 'wikidata': 'https://www.wikidata.org/wiki/Q5891', 'display_name': 'Philosophy', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C41895202', 'wikidata': 'https://www.wikidata.org/wiki/Q8162', 'display_name': 'Linguistics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C94625758', 'wikidata': 'https://www.wikidata.org/wiki/Q7163', 'display_name': 'Politics', 'level': 2, 'score': 0.0}, {'id': 'https://openalex.org/C17744445', 'wikidata': 'https://www.wikidata.org/wiki/Q36442', 'display_name': 'Political science', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C199539241', 'wikidata': 'https://www.wikidata.org/wiki/Q7748', 'display_name': 'Law', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 2, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/escience51609.2021.00017', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, {'is_oa': True, 'landing_page_url': 'https://eprints.qut.edu.au/228402/1/Freuency_Preserving_Autoencoders_6_.pdf', 'pdf_url': 'https://eprints.qut.edu.au/228402/1/Freuency_Preserving_Autoencoders_6_.pdf', 'source': {'id': 'https://openalex.org/S4306402607', 'display_name': 'QUT ePrints (Queensland University of Technology)', 'issn_l': None, 'issn': None, 'is_oa': True, 'is_in_doaj': False, 'is_core': False, 'host_organization': 'https://openalex.org/I160993911', 'host_organization_name': 'Queensland University of Technology', 'host_organization_lineage': ['https://openalex.org/I160993911'], 'host_organization_lineage_names': ['Queensland University of Technology'], 'type': 'repository'}, 'license': None, 'license_id': None, 'version': 'submittedVersion', 'is_accepted': False, 'is_published': False}], 'best_oa_location': {'is_oa': True, 'landing_page_url': 'https://eprints.qut.edu.au/228402/1/Freuency_Preserving_Autoencoders_6_.pdf', 'pdf_url': 'https://eprints.qut.edu.au/228402/1/Freuency_Preserving_Autoencoders_6_.pdf', 'source': {'id': 'https://openalex.org/S4306402607', 'display_name': 'QUT ePrints (Queensland University of Technology)', 'issn_l': None, 'issn': None, 'is_oa': True, 'is_in_doaj': False, 'is_core': False, 'host_organization': 'https://openalex.org/I160993911', 'host_organization_name': 'Queensland University of Technology', 'host_organization_lineage': ['https://openalex.org/I160993911'], 'host_organization_lineage_names': ['Queensland University of Technology'], 'type': 'repository'}, 'license': None, 'license_id': None, 'version': 'submittedVersion', 'is_accepted': False, 'is_published': False}, 'sustainable_development_goals': [{'id': 'https://metadata.un.org/sdg/15', 'display_name': 'Life on land', 'score': 0.75}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 33, 'referenced_works': ['https://openalex.org/W1503432700', 'https://openalex.org/W1650736245', 'https://openalex.org/W1901129140', 'https://openalex.org/W1921523184', 'https://openalex.org/W1967901188', 'https://openalex.org/W1977434466', 'https://openalex.org/W1989870659', 'https://openalex.org/W2026430219', 'https://openalex.org/W2078626246', 'https://openalex.org/W2100435836', 'https://openalex.org/W2119525517', 'https://openalex.org/W2136738044', 'https://openalex.org/W2164122462', 'https://openalex.org/W2183341477', 'https://openalex.org/W2218318129', 'https://openalex.org/W2277302829', 'https://openalex.org/W2405774341', 'https://openalex.org/W2419448466', 'https://openalex.org/W2497526351', 'https://openalex.org/W2773048923', 'https://openalex.org/W2789494826', 'https://openalex.org/W2800788874', 'https://openalex.org/W2894847505', 'https://openalex.org/W2911161057', 'https://openalex.org/W2941016989', 'https://openalex.org/W2963395421', 'https://openalex.org/W2963881378', 'https://openalex.org/W2981609437', 'https://openalex.org/W2989931625', 'https://openalex.org/W2999972254', 'https://openalex.org/W3036908200', 'https://openalex.org/W3097686148', 'https://openalex.org/W4293406525'], 'related_works': ['https://openalex.org/W4365790226', 'https://openalex.org/W4317383455', 'https://openalex.org/W4313561566', 'https://openalex.org/W4281663961', 'https://openalex.org/W4249005693', 'https://openalex.org/W4220682630', 'https://openalex.org/W3208888551', 'https://openalex.org/W2983142544', 'https://openalex.org/W2891059443', 'https://openalex.org/W2669956259'], 'abstract_inverted_index': {'Continuous': [0], 'audio': [1, 161], 'recordings': [2, 19], 'are': [3, 71], 'playing': [4], 'an': [5, 75], 'ever': [6], 'more': [7], 'important': [8], 'role': [9], 'in': [10, 37], 'conservation': [11], 'and': [12, 65, 81, 130], 'biodiversity': [13], 'monitoring,': [14], 'however,': [15], 'listening': [16], 'to': [17, 73, 123, 146, 170], 'these': [18, 41, 54, 140, 147], 'is': [20, 36], 'often': [21, 85], 'infeasible,': [22], 'as': [23, 62], 'they': [24], 'can': [25], 'be': [26, 74], 'thousands': [27], 'of': [28, 78, 139, 175], 'hours': [29], 'long.': [30], 'Automating': [31], 'analysis': [32], 'using': [33, 59, 114], 'machine': [34], 'learning': [35, 83], 'high': [38], 'demand.': [39], 'However,': [40, 68], 'algorithms': [42], 'require': [43, 86, 181], 'a': [44, 94, 98, 115, 124, 155, 164, 172], 'feature': [45, 51, 101, 112, 127, 149, 156, 173], 'representation.': [46], 'Several': [47], 'methods': [48, 84], 'for': [49, 53, 96, 103, 153, 158, 168], 'generating': [50, 97, 154], 'representations': [52, 113], 'data': [55, 80], 'have': [56], 'been': [57], 'developed,': [58], 'techniques': [60], 'such': [61], 'domain-specific': [63, 69], 'features': [64, 70], 'deep': [66, 82], 'learning.': [67], 'unlikely': [72], 'ideal': [76], 'representation': [77, 102, 157, 174], 'the': [79], 'extensively': [87, 182], 'labeled': [88, 183], 'data.In': [89], 'this': [90, 122], 'paper,': [91], 'we': [92], 'propose': [93], 'method': [95, 152], 'frequency-preserving': [99, 110], 'autoencoder-based': [100, 111], 'unlabeled': [104, 159], 'ecological': [105, 160], 'audio.': [106], 'We': [107, 120], 'evaluate': [108], 'multiple': [109], 'hierarchical': [116], 'clustering': [117], 'sample': [118], 'task.': [119], 'compare': [121, 144], 'basic': [125], 'autoencoder': [126, 142], 'representation,': [128], 'MFCC,': [129], 'spectral': [131], 'acoustic': [132], 'indices.': [133], 'Experimental': [134], 'results': [135], 'show': [136], 'that': [137], 'some': [138], 'non-square': [141], 'architectures': [143], 'well': [145], 'existing': [148], 'representations.This': [150], 'novel': [151], 'will': [162], 'offer': [163], 'fast,': [165], 'general': [166], 'way': [167], 'ecologists': [169], 'generate': [171], 'their': [176], 'audio,': [177], 'which': [178], 'does': [179], 'not': [180], 'data.': [184]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W3208888551', 'counts_by_year': [{'year': 2024, 'cited_by_count': 1}, {'year': 2023, 'cited_by_count': 1}], 'updated_date': '2024-09-18T04:57:57.808011', 'created_date': '2021-11-08'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works