Context-Aware Captions from Context-Agnostic Supervision

Name: Work Video:
Duration: 3 min 30 s
Ramakrishna Vedantam; Samy Bengio; Kevin Murphy; Devi Parikh; Gal Chechik
{'id': 'https://openalex.org/W2574790321', 'doi': 'https://doi.org/10.1109/cvpr.2017.120', 'title': 'Context-Aware Captions from Context-Agnostic Supervision', 'display_name': 'Context-Aware Captions from Context-Agnostic Supervision', 'publication_year': 2017, 'publication_date': '2017-07-01', 'ids': {'openalex': 'https://openalex.org/W2574790321', 'doi': 'https://doi.org/10.1109/cvpr.2017.120', 'mag': '2574790321'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/cvpr.2017.120', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': True, 'oa_status': 'green', 'oa_url': 'https://arxiv.org/pdf/1701.02870', 'any_repository_has_fulltext': True}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5041797310', 'display_name': 'Ramakrishna Vedantam', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I859038795', 'display_name': 'Virginia Tech', 'ror': 'https://ror.org/02smfhw86', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I859038795']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Ramakrishna Vedantam', 'raw_affiliation_strings': ['Virginia Tech'], 'affiliations': [{'raw_affiliation_string': 'Virginia Tech', 'institution_ids': ['https://openalex.org/I859038795']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5017529415', 'display_name': 'Samy Bengio', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I1291425158', 'display_name': 'Google (United States)', 'ror': 'https://ror.org/00njsd438', 'country_code': 'US', 'type': 'company', 'lineage': ['https://openalex.org/I1291425158', 'https://openalex.org/I4210128969']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Samy Bengio', 'raw_affiliation_strings': ['Google'], 'affiliations': [{'raw_affiliation_string': 'Google', 'institution_ids': ['https://openalex.org/I1291425158']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5002713363', 'display_name': 'Kevin Murphy', 'orcid': 'https://orcid.org/0000-0001-5196-7236'}, 'institutions': [{'id': 'https://openalex.org/I1291425158', 'display_name': 'Google (United States)', 'ror': 'https://ror.org/00njsd438', 'country_code': 'US', 'type': 'company', 'lineage': ['https://openalex.org/I1291425158', 'https://openalex.org/I4210128969']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Kevin Murphy', 'raw_affiliation_strings': ['Google'], 'affiliations': [{'raw_affiliation_string': 'Google', 'institution_ids': ['https://openalex.org/I1291425158']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5050342343', 'display_name': 'Devi Parikh', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I130701444', 'display_name': 'Georgia Institute of Technology', 'ror': 'https://ror.org/01zkghx44', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I130701444']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Devi Parikh', 'raw_affiliation_strings': ['Georgia Institute of Technology'], 'affiliations': [{'raw_affiliation_string': 'Georgia Institute of Technology', 'institution_ids': ['https://openalex.org/I130701444']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5045719865', 'display_name': 'Gal Chechik', 'orcid': 'https://orcid.org/0000-0001-9164-5303'}, 'institutions': [{'id': 'https://openalex.org/I1291425158', 'display_name': 'Google (United States)', 'ror': 'https://ror.org/00njsd438', 'country_code': 'US', 'type': 'company', 'lineage': ['https://openalex.org/I1291425158', 'https://openalex.org/I4210128969']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'Gal Chechik', 'raw_affiliation_strings': ['Google'], 'affiliations': [{'raw_affiliation_string': 'Google', 'institution_ids': ['https://openalex.org/I1291425158']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 3, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 4.164, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 112, 'citation_normalized_percentile': {'value': 0.999862, 'is_in_top_1_percent': True, 'is_in_top_10_percent': True}, 'cited_by_percentile_year': {'min': 98, 'max': 99}, 'biblio': {'volume': None, 'issue': None, 'first_page': '1070', 'last_page': '1079'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T11714', 'display_name': 'Multimodal Machine Learning Applications', 'score': 1.0, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T11714', 'display_name': 'Multimodal Machine Learning Applications', 'score': 1.0, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10812', 'display_name': 'Human Pose and Action Recognition', 'score': 0.9959, 'subfield': {'id': 'https://openalex.org/subfields/1707', 'display_name': 'Computer Vision and Pattern Recognition'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11307', 'display_name': 'Domain Adaptation and Few-Shot Learning', 'score': 0.9918, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/closed-captioning', 'display_name': 'Closed captioning', 'score': 0.90928733}, {'id': 'https://openalex.org/keywords/discriminative-model', 'display_name': 'Discriminative model', 'score': 0.9067759}, {'id': 'https://openalex.org/keywords/context-model', 'display_name': 'Context model', 'score': 0.44135132}, {'id': 'https://openalex.org/keywords/ground-truth', 'display_name': 'Ground truth', 'score': 0.41160044}], 'concepts': [{'id': 'https://openalex.org/C157657479', 'wikidata': 'https://www.wikidata.org/wiki/Q2367247', 'display_name': 'Closed captioning', 'level': 3, 'score': 0.90928733}, {'id': 'https://openalex.org/C97931131', 'wikidata': 'https://www.wikidata.org/wiki/Q5282087', 'display_name': 'Discriminative model', 'level': 2, 'score': 0.9067759}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.7949625}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.70626223}, {'id': 'https://openalex.org/C2779343474', 'wikidata': 'https://www.wikidata.org/wiki/Q3109175', 'display_name': 'Context (archaeology)', 'level': 2, 'score': 0.61576474}, {'id': 'https://openalex.org/C2776214188', 'wikidata': 'https://www.wikidata.org/wiki/Q408386', 'display_name': 'Inference', 'level': 2, 'score': 0.6154845}, {'id': 'https://openalex.org/C204321447', 'wikidata': 'https://www.wikidata.org/wiki/Q30642', 'display_name': 'Natural language processing', 'level': 1, 'score': 0.6044295}, {'id': 'https://openalex.org/C26517878', 'wikidata': 'https://www.wikidata.org/wiki/Q228039', 'display_name': 'Key (lock)', 'level': 2, 'score': 0.47115457}, {'id': 'https://openalex.org/C183322885', 'wikidata': 'https://www.wikidata.org/wiki/Q17007702', 'display_name': 'Context model', 'level': 3, 'score': 0.44135132}, {'id': 'https://openalex.org/C2778738651', 'wikidata': 'https://www.wikidata.org/wiki/Q16546687', 'display_name': 'Novelty', 'level': 2, 'score': 0.43859708}, {'id': 'https://openalex.org/C146849305', 'wikidata': 'https://www.wikidata.org/wiki/Q370766', 'display_name': 'Ground truth', 'level': 2, 'score': 0.41160044}, {'id': 'https://openalex.org/C115961682', 'wikidata': 'https://www.wikidata.org/wiki/Q860623', 'display_name': 'Image (mathematics)', 'level': 2, 'score': 0.40155625}, {'id': 'https://openalex.org/C153180895', 'wikidata': 'https://www.wikidata.org/wiki/Q7148389', 'display_name': 'Pattern recognition (psychology)', 'level': 2, 'score': 0.3667304}, {'id': 'https://openalex.org/C15744967', 'wikidata': 'https://www.wikidata.org/wiki/Q9418', 'display_name': 'Psychology', 'level': 0, 'score': 0.09674224}, {'id': 'https://openalex.org/C151730666', 'wikidata': 'https://www.wikidata.org/wiki/Q7205', 'display_name': 'Paleontology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C77805123', 'wikidata': 'https://www.wikidata.org/wiki/Q161272', 'display_name': 'Social psychology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C38652104', 'wikidata': 'https://www.wikidata.org/wiki/Q3510521', 'display_name': 'Computer security', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C2781238097', 'wikidata': 'https://www.wikidata.org/wiki/Q175026', 'display_name': 'Object (grammar)', 'level': 2, 'score': 0.0}, {'id': 'https://openalex.org/C86803240', 'wikidata': 'https://www.wikidata.org/wiki/Q420', 'display_name': 'Biology', 'level': 0, 'score': 0.0}], 'mesh': [], 'locations_count': 2, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/cvpr.2017.120', 'pdf_url': None, 'source': None, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, {'is_oa': True, 'landing_page_url': 'https://arxiv.org/abs/1701.02870', 'pdf_url': 'https://arxiv.org/pdf/1701.02870', 'source': {'id': 'https://openalex.org/S4306400194', 'display_name': 'arXiv (Cornell University)', 'issn_l': None, 'issn': None, 'is_oa': True, 'is_in_doaj': False, 'is_core': False, 'host_organization': 'https://openalex.org/I205783295', 'host_organization_name': 'Cornell University', 'host_organization_lineage': ['https://openalex.org/I205783295'], 'host_organization_lineage_names': ['Cornell University'], 'type': 'repository'}, 'license': None, 'license_id': None, 'version': 'submittedVersion', 'is_accepted': False, 'is_published': False}], 'best_oa_location': {'is_oa': True, 'landing_page_url': 'https://arxiv.org/abs/1701.02870', 'pdf_url': 'https://arxiv.org/pdf/1701.02870', 'source': {'id': 'https://openalex.org/S4306400194', 'display_name': 'arXiv (Cornell University)', 'issn_l': None, 'issn': None, 'is_oa': True, 'is_in_doaj': False, 'is_core': False, 'host_organization': 'https://openalex.org/I205783295', 'host_organization_name': 'Cornell University', 'host_organization_lineage': ['https://openalex.org/I205783295'], 'host_organization_lineage_names': ['Cornell University'], 'type': 'repository'}, 'license': None, 'license_id': None, 'version': 'submittedVersion', 'is_accepted': False, 'is_published': False}, 'sustainable_development_goals': [{'id': 'https://metadata.un.org/sdg/10', 'display_name': 'Reduced inequalities', 'score': 0.75}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 49, 'referenced_works': ['https://openalex.org/W1514535095', 'https://openalex.org/W1686810756', 'https://openalex.org/W1706899115', 'https://openalex.org/W1797268635', 'https://openalex.org/W1861492603', 'https://openalex.org/W1895577753', 'https://openalex.org/W1905882502', 'https://openalex.org/W1931639407', 'https://openalex.org/W1933349210', 'https://openalex.org/W1947481528', 'https://openalex.org/W1956340063', 'https://openalex.org/W1977195311', 'https://openalex.org/W1983927101', 'https://openalex.org/W1996418862', 'https://openalex.org/W2064675550', 'https://openalex.org/W2094937109', 'https://openalex.org/W2112055291', 'https://openalex.org/W2120176017', 'https://openalex.org/W2123301721', 'https://openalex.org/W2133564696', 'https://openalex.org/W2144968237', 'https://openalex.org/W2151498684', 'https://openalex.org/W2202226326', 'https://openalex.org/W2251512949', 'https://openalex.org/W2252033417', 'https://openalex.org/W2264742718', 'https://openalex.org/W2293700449', 'https://openalex.org/W2332488709', 'https://openalex.org/W2398118205', 'https://openalex.org/W2496066288', 'https://openalex.org/W2530647954', 'https://openalex.org/W2574790321', 'https://openalex.org/W2602275733', 'https://openalex.org/W2603266952', 'https://openalex.org/W2949555952', 'https://openalex.org/W2950472486', 'https://openalex.org/W2951183276', 'https://openalex.org/W2962749469', 'https://openalex.org/W2962835968', 'https://openalex.org/W2963109634', 'https://openalex.org/W2963206148', 'https://openalex.org/W2963881016', 'https://openalex.org/W2964183327', 'https://openalex.org/W2964193163', 'https://openalex.org/W2964284374', 'https://openalex.org/W2964308564', 'https://openalex.org/W4295846245', 'https://openalex.org/W4298392976', 'https://openalex.org/W48884151'], 'related_works': ['https://openalex.org/W4388893791', 'https://openalex.org/W4290852288', 'https://openalex.org/W4283207562', 'https://openalex.org/W4210416330', 'https://openalex.org/W3088136942', 'https://openalex.org/W2963177403', 'https://openalex.org/W2949362007', 'https://openalex.org/W2775506363', 'https://openalex.org/W2330246314', 'https://openalex.org/W2145850538'], 'abstract_inverted_index': {'We': [0, 91, 121], 'introduce': [1], 'an': [2, 32, 104], 'inference': [3, 76], 'technique': [4, 95], 'to': [5, 73, 96, 101, 113, 127, 133], 'produce': [6], 'discriminative': [7, 124, 145, 154], 'context-aware': [8], 'image': [9, 33, 105, 125, 155], 'captions': [10, 41], '(captions': [11, 26], 'that': [12, 27, 51, 59, 69, 81, 130, 158], 'describe': [13, 28, 102], 'differences': [14], 'between': [15], 'images': [16, 39, 138], 'or': [17, 31], 'visual': [18], 'concepts)': [19], 'using': [20], 'only': [21], 'generic': [22], 'context-agnostic': [23, 83], 'training': [24], 'data': [25], 'a': [29, 57, 78, 85, 97, 107], 'concept': [30], 'in': [34, 56, 139], 'isolation).': [35], 'For': [36], 'example,': [37], 'given': [38], 'and': [40, 45, 84, 150, 164], 'of': [42, 117, 135], 'siamese': [43, 54], 'cat': [44, 55], 'tiger': [46, 63], 'cat,': [47], 'we': [48, 70], 'generate': [49, 128], 'language': [50, 79, 129], 'describes': [52], 'the': [53, 118, 140], 'way': [58], 'distinguishes': [60, 88], 'it': [61], 'from': [62], 'cat.': [64], 'Our': [65], 'key': [66], 'novelty': [67], 'is': [68, 82], 'show': [71], 'how': [72], 'do': [74], 'joint': [75], 'over': [77], 'model': [80], 'listener': [86], 'which': [87], 'closely-related': [89, 115], 'concepts.': [90], 'first': [92], 'apply': [93], 'our': [94, 159], 'justification': [98, 149], 'task,': [99], 'namely': [100], 'why': [103], 'contains': [106], 'particular': [108], 'fine-grained': [109], 'category': [110, 116], 'as': [111], 'opposed': [112], 'another': [114], 'CUB-200-2011': [119], 'dataset.': [120, 142], 'then': [122], 'study': [123], 'captioning': [126, 156], 'uniquely': [131], 'refers': [132], 'one': [134], 'two': [136], 'semantically-similar': [137], 'COCO': [141], 'Evaluations': [143], 'with': [144], 'ground': [146], 'truth': [147], 'for': [148, 153, 167], 'human': [151], 'studies': [152], 'reveal': [157], 'approach': [160], 'outperforms': [161], 'baseline': [162], 'generative': [163], 'speaker-listener': [165], 'approaches': [166], 'discrimination.': [168]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2574790321', 'counts_by_year': [{'year': 2024, 'cited_by_count': 2}, {'year': 2023, 'cited_by_count': 14}, {'year': 2022, 'cited_by_count': 10}, {'year': 2021, 'cited_by_count': 17}, {'year': 2020, 'cited_by_count': 22}, {'year': 2019, 'cited_by_count': 32}, {'year': 2018, 'cited_by_count': 12}, {'year': 2017, 'cited_by_count': 3}], 'updated_date': '2025-01-03T20:53:05.212141', 'created_date': '2017-01-26'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works