Committee-Based Sampling For Training Probabilistic Classifiers

Ido Dagan; Sean P. Engelson
{'id': 'https://openalex.org/W1553262910', 'doi': 'https://doi.org/10.1016/b978-1-55860-377-6.50027-x', 'title': 'Committee-Based Sampling For Training Probabilistic Classifiers', 'display_name': 'Committee-Based Sampling For Training Probabilistic Classifiers', 'publication_year': 1995, 'publication_date': '1995-01-01', 'ids': {'openalex': 'https://openalex.org/W1553262910', 'doi': 'https://doi.org/10.1016/b978-1-55860-377-6.50027-x', 'mag': '1553262910'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1016/b978-1-55860-377-6.50027-x', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4306463230', 'display_name': 'Elsevier eBooks', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': 'https://openalex.org/P4310320990', 'host_organization_name': 'Elsevier BV', 'host_organization_lineage': ['https://openalex.org/P4310320990'], 'host_organization_lineage_names': ['Elsevier BV'], 'type': 'ebook platform'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'book-chapter', 'type_crossref': 'book-chapter', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5033006575', 'display_name': 'Ido Dagan', 'orcid': 'https://orcid.org/0000-0003-3415-3374'}, 'institutions': [{'id': 'https://openalex.org/I13955877', 'display_name': 'Bar-Ilan University', 'ror': 'https://ror.org/03kgsv495', 'country_code': 'IL', 'type': 'education', 'lineage': ['https://openalex.org/I13955877']}], 'countries': ['IL'], 'is_corresponding': False, 'raw_author_name': 'Ido Dagan', 'raw_affiliation_strings': ['Department of Mathematics and Computer Science, Bar-Ilan University, 52900 Ramat-Gan, Israel'], 'affiliations': [{'raw_affiliation_string': 'Department of Mathematics and Computer Science, Bar-Ilan University, 52900 Ramat-Gan, Israel', 'institution_ids': ['https://openalex.org/I13955877']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5060766225', 'display_name': 'Sean P. Engelson', 'orcid': 'https://orcid.org/0009-0007-2493-8107'}, 'institutions': [{'id': 'https://openalex.org/I13955877', 'display_name': 'Bar-Ilan University', 'ror': 'https://ror.org/03kgsv495', 'country_code': 'IL', 'type': 'education', 'lineage': ['https://openalex.org/I13955877']}], 'countries': ['IL'], 'is_corresponding': False, 'raw_author_name': 'Sean P. Engelson', 'raw_affiliation_strings': ['Department of Mathematics and Computer Science, Bar-Ilan University, 52900 Ramat-Gan, Israel'], 'affiliations': [{'raw_affiliation_string': 'Department of Mathematics and Computer Science, Bar-Ilan University, 52900 Ramat-Gan, Israel', 'institution_ids': ['https://openalex.org/I13955877']}]}], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 7.467, 'has_fulltext': False, 'cited_by_count': 488, 'citation_normalized_percentile': {'value': 0.999471, 'is_in_top_1_percent': True, 'is_in_top_10_percent': True}, 'cited_by_percentile_year': {'min': 99, 'max': 100}, 'biblio': {'volume': None, 'issue': None, 'first_page': '150', 'last_page': '157'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T12072', 'display_name': 'Active Learning in Machine Learning Research', 'score': 0.9999, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T12072', 'display_name': 'Active Learning in Machine Learning Research', 'score': 0.9999, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T12535', 'display_name': 'Learning with Noisy Labels in Machine Learning', 'score': 0.9974, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10181', 'display_name': 'Statistical Machine Translation and Natural Language Processing', 'score': 0.9963, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/instance-selection', 'display_name': 'Instance Selection', 'score': 0.519995}, {'id': 'https://openalex.org/keywords/robust-learning', 'display_name': 'Robust Learning', 'score': 0.514351}, {'id': 'https://openalex.org/keywords/semi-supervised-learning', 'display_name': 'Semi-Supervised Learning', 'score': 0.506377}, {'id': 'https://openalex.org/keywords/training-set', 'display_name': 'Training set', 'score': 0.5027778}, {'id': 'https://openalex.org/keywords/probabilistic-classification', 'display_name': 'Probabilistic classification', 'score': 0.41658223}], 'concepts': [{'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.7613583}, {'id': 'https://openalex.org/C2780451532', 'wikidata': 'https://www.wikidata.org/wiki/Q759676', 'display_name': 'Task (project management)', 'level': 2, 'score': 0.6834665}, {'id': 'https://openalex.org/C49937458', 'wikidata': 'https://www.wikidata.org/wiki/Q2599292', 'display_name': 'Probabilistic logic', 'level': 2, 'score': 0.6754912}, {'id': 'https://openalex.org/C140779682', 'wikidata': 'https://www.wikidata.org/wiki/Q210868', 'display_name': 'Sampling (signal processing)', 'level': 3, 'score': 0.6433417}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.63343704}, {'id': 'https://openalex.org/C119857082', 'wikidata': 'https://www.wikidata.org/wiki/Q2539', 'display_name': 'Machine learning', 'level': 1, 'score': 0.6188234}, {'id': 'https://openalex.org/C177264268', 'wikidata': 'https://www.wikidata.org/wiki/Q1514741', 'display_name': 'Set (abstract data type)', 'level': 2, 'score': 0.5796342}, {'id': 'https://openalex.org/C2777211547', 'wikidata': 'https://www.wikidata.org/wiki/Q17141490', 'display_name': 'Training (meteorology)', 'level': 2, 'score': 0.55313075}, {'id': 'https://openalex.org/C51632099', 'wikidata': 'https://www.wikidata.org/wiki/Q3985153', 'display_name': 'Training set', 'level': 2, 'score': 0.5027778}, {'id': 'https://openalex.org/C23224414', 'wikidata': 'https://www.wikidata.org/wiki/Q176769', 'display_name': 'Hidden Markov model', 'level': 2, 'score': 0.4943962}, {'id': 'https://openalex.org/C189119545', 'wikidata': 'https://www.wikidata.org/wiki/Q5128022', 'display_name': 'Probabilistic classification', 'level': 4, 'score': 0.41658223}, {'id': 'https://openalex.org/C12267149', 'wikidata': 'https://www.wikidata.org/wiki/Q282453', 'display_name': 'Support vector machine', 'level': 2, 'score': 0.14923325}, {'id': 'https://openalex.org/C52001869', 'wikidata': 'https://www.wikidata.org/wiki/Q812530', 'display_name': 'Naive Bayes classifier', 'level': 3, 'score': 0.09766376}, {'id': 'https://openalex.org/C127413603', 'wikidata': 'https://www.wikidata.org/wiki/Q11023', 'display_name': 'Engineering', 'level': 0, 'score': 0.07138169}, {'id': 'https://openalex.org/C121332964', 'wikidata': 'https://www.wikidata.org/wiki/Q413', 'display_name': 'Physics', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C201995342', 'wikidata': 'https://www.wikidata.org/wiki/Q682496', 'display_name': 'Systems engineering', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C106131492', 'wikidata': 'https://www.wikidata.org/wiki/Q3072260', 'display_name': 'Filter (signal processing)', 'level': 2, 'score': 0.0}, {'id': 'https://openalex.org/C153294291', 'wikidata': 'https://www.wikidata.org/wiki/Q25261', 'display_name': 'Meteorology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C31972630', 'wikidata': 'https://www.wikidata.org/wiki/Q844240', 'display_name': 'Computer vision', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C199360897', 'wikidata': 'https://www.wikidata.org/wiki/Q9143', 'display_name': 'Programming language', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1016/b978-1-55860-377-6.50027-x', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4306463230', 'display_name': 'Elsevier eBooks', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': 'https://openalex.org/P4310320990', 'host_organization_name': 'Elsevier BV', 'host_organization_lineage': ['https://openalex.org/P4310320990'], 'host_organization_lineage_names': ['Elsevier BV'], 'type': 'ebook platform'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'score': 0.8, 'display_name': 'Quality education', 'id': 'https://metadata.un.org/sdg/4'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 22, 'referenced_works': ['https://openalex.org/W1513874326', 'https://openalex.org/W1976241232', 'https://openalex.org/W1977182536', 'https://openalex.org/W1989445634', 'https://openalex.org/W2061271742', 'https://openalex.org/W2080021732', 'https://openalex.org/W2081687495', 'https://openalex.org/W2084310470', 'https://openalex.org/W2088538739', 'https://openalex.org/W2110327402', 'https://openalex.org/W2115305054', 'https://openalex.org/W2125838338', 'https://openalex.org/W2139709458', 'https://openalex.org/W2151023586', 'https://openalex.org/W2162995740', 'https://openalex.org/W2165579577', 'https://openalex.org/W2166856932', 'https://openalex.org/W2167434254', 'https://openalex.org/W2949071206', 'https://openalex.org/W4230030242', 'https://openalex.org/W4243142018', 'https://openalex.org/W4252861488'], 'related_works': ['https://openalex.org/W3201070945', 'https://openalex.org/W3193088696', 'https://openalex.org/W3165388794', 'https://openalex.org/W3150234497', 'https://openalex.org/W3133077027', 'https://openalex.org/W2801308982', 'https://openalex.org/W2792951589', 'https://openalex.org/W2346074333', 'https://openalex.org/W2081344281', 'https://openalex.org/W2010693867'], 'abstract_inverted_index': {'In': [0], 'many': [1], 'real-world': [2], 'learning': [3], 'tasks,': [4], 'it': [5, 91], 'is': [6, 87], 'expensive': [7], 'to': [8, 107, 116], 'acquire': [9], 'a': [10, 21, 39, 72, 98, 162], 'sufficient': [11], 'number': [12], 'of': [13, 41, 51, 58, 143, 154], 'labeled': [14], 'examples': [15, 37], 'for': [16, 24, 31, 120, 130, 140], 'training.': [17], 'This': [18], 'paper': [19], 'proposes': [20], 'general': [22], 'method': [23, 86, 135], 'efficiently': [25], 'training': [26, 32, 78, 99, 121, 160, 166], 'probabilistic': [27], 'classifiers,': [28, 125], 'by': [29, 54, 76], 'selecting': [30], 'only': [33], 'the': [34, 49, 56, 77, 93, 103, 141], 'more': [35], 'informative': [36], 'in': [38, 146, 165], 'stream': [40], 'unlabeled': [42], 'examples.': [43], 'The': [44, 85, 134], 'method,': [45], 'committee-based': [46, 118, 155], 'sampling,': [47], 'evaluates': [48, 92], 'informativeness': [50], 'an': [52], 'example': [53, 100], 'measuring': [55], 'degree': [57], 'disagreement': [59], 'between': [60], 'several': [61], 'model': [62, 104], 'variants.': [63], 'These': [64], 'variants': [65], '(the': [66], 'committee)': [67], 'are': [68, 127], 'drawn': [69], 'randomly': [70], 'from': [71, 97], 'probability': [73], 'distribution': [74], 'conditioned': [75], 'set': [79], 'selected': [80], 'so': [81], 'far': [82], '(Monte-Carlo': [83], 'sampling).': [84], 'particularly': [88], 'attractive': [89], 'because': [90], 'expected': [94], 'information': [95], 'gain': [96], 'implicitly,': [101], 'making': [102], 'both': [105], 'easy': [106], 'implement': [108], 'and': [109, 138], 'generally': [110], 'applicable.': [111], 'We': [112], 'further': [113], 'show': [114], 'how': [115], 'apply': [117], 'sampling': [119, 156], 'Hidden': [122], 'Markov': [123], 'Model': [124], 'which': [126], 'commonly': [128], 'used': [129], 'complex': [131], 'classification': [132], 'tasks.': [133], 'was': [136], 'implemented': [137], 'tested': [139], 'task': [142], 'tagging': [144], 'words': [145], 'natural': [147], 'language': [148], 'sentences': [149], 'with': [150], 'parts-of-speech.': [151], 'Experimental': [152], 'evaluation': [153], 'versus': [157], 'standard': [158], 'sequential': [159], 'showed': [161], 'substantial': [163], 'improvement': [164], 'efficiency.': [167]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W1553262910', 'counts_by_year': [{'year': 2024, 'cited_by_count': 15}, {'year': 2023, 'cited_by_count': 27}, {'year': 2022, 'cited_by_count': 28}, {'year': 2021, 'cited_by_count': 39}, {'year': 2020, 'cited_by_count': 39}, {'year': 2019, 'cited_by_count': 24}, {'year': 2018, 'cited_by_count': 22}, {'year': 2017, 'cited_by_count': 23}, {'year': 2016, 'cited_by_count': 20}, {'year': 2015, 'cited_by_count': 28}, {'year': 2014, 'cited_by_count': 21}, {'year': 2013, 'cited_by_count': 21}, {'year': 2012, 'cited_by_count': 28}], 'updated_date': '2024-09-08T16:38:13.158977', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works