Consolidated actor–critic model for partially-observable Markov decision processes

I. Elhanany; Christopher Allen Niedzwiedz; Z. Liu; Scott C. Livingston
{'id': 'https://openalex.org/W2009810843', 'doi': 'https://doi.org/10.1049/el:20081346', 'title': 'Consolidated actor–critic model for partially-observable Markov decision processes', 'display_name': 'Consolidated actor–critic model for partially-observable Markov decision processes', 'publication_year': 2008, 'publication_date': '2008-01-01', 'ids': {'openalex': 'https://openalex.org/W2009810843', 'doi': 'https://doi.org/10.1049/el:20081346', 'mag': '2009810843'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1049/el:20081346', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S149016011', 'display_name': 'Electronics Letters', 'issn_l': '0013-5194', 'issn': ['0013-5194', '1350-911X'], 'is_oa': True, 'is_in_doaj': True, 'is_core': True, 'host_organization': 'https://openalex.org/P4310311714', 'host_organization_name': 'Institution of Engineering and Technology', 'host_organization_lineage': ['https://openalex.org/P4310311714'], 'host_organization_lineage_names': ['Institution of Engineering and Technology'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'journal-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5063480258', 'display_name': 'I. Elhanany', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I75027704', 'display_name': 'University of Tennessee at Knoxville', 'ror': 'https://ror.org/020f3ap87', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I75027704']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'I. Elhanany', 'raw_affiliation_strings': ['#N##TAB##TAB##TAB##TAB# University of Tennessee#N##TAB##TAB##TAB#'], 'affiliations': [{'raw_affiliation_string': '#N##TAB##TAB##TAB##TAB# University of Tennessee#N##TAB##TAB##TAB#', 'institution_ids': ['https://openalex.org/I75027704']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5078574024', 'display_name': 'Christopher Allen Niedzwiedz', 'orcid': None}, 'institutions': [{'id': 'https://openalex.org/I75027704', 'display_name': 'University of Tennessee at Knoxville', 'ror': 'https://ror.org/020f3ap87', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I75027704']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'C. Niedzwiedz', 'raw_affiliation_strings': ['#N##TAB##TAB##TAB##TAB# University of Tennessee#N##TAB##TAB##TAB#'], 'affiliations': [{'raw_affiliation_string': '#N##TAB##TAB##TAB##TAB# University of Tennessee#N##TAB##TAB##TAB#', 'institution_ids': ['https://openalex.org/I75027704']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5037397718', 'display_name': 'Z. Liu', 'orcid': 'https://orcid.org/0000-0002-1192-540X'}, 'institutions': [{'id': 'https://openalex.org/I1325784139', 'display_name': 'Yahoo (United Kingdom)', 'ror': 'https://ror.org/038p3gq39', 'country_code': 'GB', 'type': 'company', 'lineage': ['https://openalex.org/I1325784139', 'https://openalex.org/I4210134091']}], 'countries': ['GB'], 'is_corresponding': False, 'raw_author_name': 'Z. Liu', 'raw_affiliation_strings': ['Yahoo Search, Inc'], 'affiliations': [{'raw_affiliation_string': 'Yahoo Search, Inc', 'institution_ids': ['https://openalex.org/I1325784139']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5001613291', 'display_name': 'Scott C. Livingston', 'orcid': 'https://orcid.org/0000-0003-2679-8818'}, 'institutions': [{'id': 'https://openalex.org/I75027704', 'display_name': 'University of Tennessee at Knoxville', 'ror': 'https://ror.org/020f3ap87', 'country_code': 'US', 'type': 'education', 'lineage': ['https://openalex.org/I75027704']}], 'countries': ['US'], 'is_corresponding': False, 'raw_author_name': 'S. Livingston', 'raw_affiliation_strings': ['#N##TAB##TAB##TAB##TAB# University of Tennessee#N##TAB##TAB##TAB#'], 'affiliations': [{'raw_affiliation_string': '#N##TAB##TAB##TAB##TAB# University of Tennessee#N##TAB##TAB##TAB#', 'institution_ids': ['https://openalex.org/I75027704']}]}], 'institution_assertions': [], 'countries_distinct_count': 2, 'institutions_distinct_count': 2, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': {'value': 2200, 'currency': 'USD', 'value_usd': 2200, 'provenance': 'doaj'}, 'apc_paid': None, 'fwci': 0.0, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 0, 'citation_normalized_percentile': {'value': 0.0, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 0, 'max': 63}, 'biblio': {'volume': '44', 'issue': '22', 'first_page': '1317', 'last_page': '1317'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T12794', 'display_name': 'Adaptive Dynamic Programming for Optimal Control', 'score': 0.9997, 'subfield': {'id': 'https://openalex.org/subfields/1703', 'display_name': 'Computational Theory and Mathematics'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T12794', 'display_name': 'Adaptive Dynamic Programming for Optimal Control', 'score': 0.9997, 'subfield': {'id': 'https://openalex.org/subfields/1703', 'display_name': 'Computational Theory and Mathematics'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10462', 'display_name': 'Reinforcement Learning Algorithms', 'score': 0.9984, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10675', 'display_name': 'Mechanical Circulatory Support Systems', 'score': 0.9303, 'subfield': {'id': 'https://openalex.org/subfields/2204', 'display_name': 'Biomedical Engineering'}, 'field': {'id': 'https://openalex.org/fields/22', 'display_name': 'Engineering'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/actor-critic-algorithm', 'display_name': 'Actor-Critic Algorithm', 'score': 0.581079}, {'id': 'https://openalex.org/keywords/adaptive-dynamic-programming', 'display_name': 'Adaptive Dynamic Programming', 'score': 0.515129}, {'id': 'https://openalex.org/keywords/policy-iteration', 'display_name': 'Policy Iteration', 'score': 0.500266}, {'id': 'https://openalex.org/keywords/decision-theory', 'display_name': 'Decision theory', 'score': 0.42339867}], 'concepts': [{'id': 'https://openalex.org/C17098449', 'wikidata': 'https://www.wikidata.org/wiki/Q176814', 'display_name': 'Partially observable Markov decision process', 'level': 4, 'score': 0.95154536}, {'id': 'https://openalex.org/C32848918', 'wikidata': 'https://www.wikidata.org/wiki/Q845789', 'display_name': 'Observable', 'level': 2, 'score': 0.83887386}, {'id': 'https://openalex.org/C106189395', 'wikidata': 'https://www.wikidata.org/wiki/Q176789', 'display_name': 'Markov decision process', 'level': 3, 'score': 0.757493}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.66215545}, {'id': 'https://openalex.org/C48103436', 'wikidata': 'https://www.wikidata.org/wiki/Q599031', 'display_name': 'State (computer science)', 'level': 2, 'score': 0.5876013}, {'id': 'https://openalex.org/C159886148', 'wikidata': 'https://www.wikidata.org/wiki/Q176645', 'display_name': 'Markov process', 'level': 2, 'score': 0.5674802}, {'id': 'https://openalex.org/C98763669', 'wikidata': 'https://www.wikidata.org/wiki/Q176645', 'display_name': 'Markov chain', 'level': 2, 'score': 0.5361656}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.4936179}, {'id': 'https://openalex.org/C126255220', 'wikidata': 'https://www.wikidata.org/wiki/Q141495', 'display_name': 'Mathematical optimization', 'level': 1, 'score': 0.4498906}, {'id': 'https://openalex.org/C50644808', 'wikidata': 'https://www.wikidata.org/wiki/Q192776', 'display_name': 'Artificial neural network', 'level': 2, 'score': 0.42461282}, {'id': 'https://openalex.org/C28901747', 'wikidata': 'https://www.wikidata.org/wiki/Q177571', 'display_name': 'Decision theory', 'level': 2, 'score': 0.42339867}, {'id': 'https://openalex.org/C163836022', 'wikidata': 'https://www.wikidata.org/wiki/Q6771326', 'display_name': 'Markov model', 'level': 3, 'score': 0.38887525}, {'id': 'https://openalex.org/C119857082', 'wikidata': 'https://www.wikidata.org/wiki/Q2539', 'display_name': 'Machine learning', 'level': 1, 'score': 0.30391315}, {'id': 'https://openalex.org/C33923547', 'wikidata': 'https://www.wikidata.org/wiki/Q395', 'display_name': 'Mathematics', 'level': 0, 'score': 0.2388975}, {'id': 'https://openalex.org/C11413529', 'wikidata': 'https://www.wikidata.org/wiki/Q8366', 'display_name': 'Algorithm', 'level': 1, 'score': 0.17121932}, {'id': 'https://openalex.org/C105795698', 'wikidata': 'https://www.wikidata.org/wiki/Q12483', 'display_name': 'Statistics', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C121332964', 'wikidata': 'https://www.wikidata.org/wiki/Q413', 'display_name': 'Physics', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C62520636', 'wikidata': 'https://www.wikidata.org/wiki/Q944', 'display_name': 'Quantum mechanics', 'level': 1, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1049/el:20081346', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S149016011', 'display_name': 'Electronics Letters', 'issn_l': '0013-5194', 'issn': ['0013-5194', '1350-911X'], 'is_oa': True, 'is_in_doaj': True, 'is_core': True, 'host_organization': 'https://openalex.org/P4310311714', 'host_organization_name': 'Institution of Engineering and Technology', 'host_organization_lineage': ['https://openalex.org/P4310311714'], 'host_organization_lineage_names': ['Institution of Engineering and Technology'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'display_name': 'Peace, justice, and strong institutions', 'score': 0.74, 'id': 'https://metadata.un.org/sdg/16'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 6, 'referenced_works': ['https://openalex.org/W1854776945', 'https://openalex.org/W2108523728', 'https://openalex.org/W2110485445', 'https://openalex.org/W2121863487', 'https://openalex.org/W4214717370', 'https://openalex.org/W4254816979'], 'related_works': ['https://openalex.org/W52153049', 'https://openalex.org/W4323315247', 'https://openalex.org/W2999848267', 'https://openalex.org/W2951545791', 'https://openalex.org/W2294884454', 'https://openalex.org/W2096013579', 'https://openalex.org/W1760611253', 'https://openalex.org/W1589140671', 'https://openalex.org/W1515117609', 'https://openalex.org/W131709709'], 'abstract_inverted_index': {'A': [0], 'method': [1], 'for': [2, 16, 27], 'consolidating': [3], 'the': [4, 34, 37, 54], 'traditionally': [5], 'separate': [6], 'actor': [7], 'and': [8, 46], 'critic': [9], 'neural': [10], 'networks': [11], 'in': [12], 'temporal': [13], 'difference': [14], 'learning': [15], 'addressing': [17], 'partially-observable': [18], 'Markov': [19], 'decision': [20], 'processes': [21], '(POMDPs)': [22], 'is': [23], 'presented.': [24], 'Simulation': [25], 'results': [26], 'solving': [28], 'a': [29], 'five-state': [30], 'POMDP': [31], 'problem': [32], 'support': [33], 'claim': [35], 'that': [36], 'consolidated': [38], 'model': [39], 'achieves': [40], 'higher': [41], 'performance': [42], 'while': [43], 'reducing': [44], 'computational': [45], 'storage': [47], 'requirements': [48], 'to': [49], 'approximately': [50], 'half': [51], 'those': [52], 'of': [53], 'traditional': [55], 'approach.': [56]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2009810843', 'counts_by_year': [], 'updated_date': '2024-09-24T00:50:57.673633', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works