On the First Passage $g$-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes

Xianping Guo; Xiangxiang Huang; Yi Zhang
{'id': 'https://openalex.org/W2182209928', 'doi': 'https://doi.org/10.1137/140968872', 'title': 'On the First Passage $g$-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes', 'display_name': 'On the First Passage $g$-Mean-Variance Optimality for Discounted Continuous-Time Markov Decision Processes', 'publication_year': 2015, 'publication_date': '2015-01-01', 'ids': {'openalex': 'https://openalex.org/W2182209928', 'doi': 'https://doi.org/10.1137/140968872', 'mag': '2182209928'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1137/140968872', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S897311980', 'display_name': 'SIAM Journal on Control and Optimization', 'issn_l': '0363-0129', 'issn': ['0363-0129', '1095-7138'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310320508', 'host_organization_name': 'Society for Industrial and Applied Mathematics', 'host_organization_lineage': ['https://openalex.org/P4310320508'], 'host_organization_lineage_names': ['Society for Industrial and Applied Mathematics'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'journal-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5101505539', 'display_name': 'Xianping Guo', 'orcid': 'https://orcid.org/0000-0001-6954-5947'}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Xianping Guo', 'raw_affiliation_strings': [], 'affiliations': []}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5084123655', 'display_name': 'Xiangxiang Huang', 'orcid': 'https://orcid.org/0000-0002-5608-3218'}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Xiangxiang Huang', 'raw_affiliation_strings': [], 'affiliations': []}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5036373241', 'display_name': 'Yi Zhang', 'orcid': 'https://orcid.org/0000-0002-3200-6306'}, 'institutions': [], 'countries': [], 'is_corresponding': False, 'raw_author_name': 'Yi Zhang', 'raw_affiliation_strings': [], 'affiliations': []}], 'institution_assertions': [], 'countries_distinct_count': 0, 'institutions_distinct_count': 0, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 1.336, 'has_fulltext': True, 'fulltext_origin': 'ngrams', 'cited_by_count': 9, 'citation_normalized_percentile': {'value': 0.802004, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 85, 'max': 86}, 'biblio': {'volume': '53', 'issue': '3', 'first_page': '1406', 'last_page': '1424'}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T10462', 'display_name': 'Reinforcement Learning Algorithms', 'score': 0.9965, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, 'topics': [{'id': 'https://openalex.org/T10462', 'display_name': 'Reinforcement Learning Algorithms', 'score': 0.9965, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T10791', 'display_name': 'Model Predictive Control in Industrial Processes', 'score': 0.9951, 'subfield': {'id': 'https://openalex.org/subfields/2207', 'display_name': 'Control and Systems Engineering'}, 'field': {'id': 'https://openalex.org/fields/22', 'display_name': 'Engineering'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11236', 'display_name': 'System Identification Techniques', 'score': 0.9811, 'subfield': {'id': 'https://openalex.org/subfields/2207', 'display_name': 'Control and Systems Engineering'}, 'field': {'id': 'https://openalex.org/fields/22', 'display_name': 'Engineering'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/multivariable-systems', 'display_name': 'Multivariable Systems', 'score': 0.523019}, {'id': 'https://openalex.org/keywords/optimization', 'display_name': 'Optimization', 'score': 0.514128}, {'id': 'https://openalex.org/keywords/parameter-estimation', 'display_name': 'Parameter Estimation', 'score': 0.509041}, {'id': 'https://openalex.org/keywords/real-time-optimization', 'display_name': 'Real-time Optimization', 'score': 0.501104}, {'id': 'https://openalex.org/keywords/constraint-handling', 'display_name': 'Constraint Handling', 'score': 0.500756}, {'id': 'https://openalex.org/keywords/value', 'display_name': 'Value (mathematics)', 'score': 0.5002401}], 'concepts': [{'id': 'https://openalex.org/C33923547', 'wikidata': 'https://www.wikidata.org/wiki/Q395', 'display_name': 'Mathematics', 'level': 0, 'score': 0.8403252}, {'id': 'https://openalex.org/C106189395', 'wikidata': 'https://www.wikidata.org/wiki/Q176789', 'display_name': 'Markov decision process', 'level': 3, 'score': 0.77228785}, {'id': 'https://openalex.org/C196083921', 'wikidata': 'https://www.wikidata.org/wiki/Q7915758', 'display_name': 'Variance (accounting)', 'level': 2, 'score': 0.7210768}, {'id': 'https://openalex.org/C204241405', 'wikidata': 'https://www.wikidata.org/wiki/Q461499', 'display_name': 'Transformation (genetics)', 'level': 3, 'score': 0.57495}, {'id': 'https://openalex.org/C14646407', 'wikidata': 'https://www.wikidata.org/wiki/Q1430750', 'display_name': 'Bellman equation', 'level': 2, 'score': 0.55598617}, {'id': 'https://openalex.org/C177264268', 'wikidata': 'https://www.wikidata.org/wiki/Q1514741', 'display_name': 'Set (abstract data type)', 'level': 2, 'score': 0.5477608}, {'id': 'https://openalex.org/C2776291640', 'wikidata': 'https://www.wikidata.org/wiki/Q2912517', 'display_name': 'Value (mathematics)', 'level': 2, 'score': 0.5002401}, {'id': 'https://openalex.org/C126255220', 'wikidata': 'https://www.wikidata.org/wiki/Q141495', 'display_name': 'Mathematical optimization', 'level': 1, 'score': 0.47240418}, {'id': 'https://openalex.org/C98763669', 'wikidata': 'https://www.wikidata.org/wiki/Q176645', 'display_name': 'Markov chain', 'level': 2, 'score': 0.43330202}, {'id': 'https://openalex.org/C14036430', 'wikidata': 'https://www.wikidata.org/wiki/Q3736076', 'display_name': 'Function (biology)', 'level': 2, 'score': 0.41167387}, {'id': 'https://openalex.org/C28826006', 'wikidata': 'https://www.wikidata.org/wiki/Q33521', 'display_name': 'Applied mathematics', 'level': 1, 'score': 0.3913471}, {'id': 'https://openalex.org/C159886148', 'wikidata': 'https://www.wikidata.org/wiki/Q176645', 'display_name': 'Markov process', 'level': 2, 'score': 0.37345198}, {'id': 'https://openalex.org/C144237770', 'wikidata': 'https://www.wikidata.org/wiki/Q747534', 'display_name': 'Mathematical economics', 'level': 1, 'score': 0.32774836}, {'id': 'https://openalex.org/C105795698', 'wikidata': 'https://www.wikidata.org/wiki/Q12483', 'display_name': 'Statistics', 'level': 1, 'score': 0.29149115}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.10789022}, {'id': 'https://openalex.org/C162324750', 'wikidata': 'https://www.wikidata.org/wiki/Q8134', 'display_name': 'Economics', 'level': 0, 'score': 0.076539814}, {'id': 'https://openalex.org/C55493867', 'wikidata': 'https://www.wikidata.org/wiki/Q7094', 'display_name': 'Biochemistry', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C185592680', 'wikidata': 'https://www.wikidata.org/wiki/Q2329', 'display_name': 'Chemistry', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C121955636', 'wikidata': 'https://www.wikidata.org/wiki/Q4116214', 'display_name': 'Accounting', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C78458016', 'wikidata': 'https://www.wikidata.org/wiki/Q840400', 'display_name': 'Evolutionary biology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C86803240', 'wikidata': 'https://www.wikidata.org/wiki/Q420', 'display_name': 'Biology', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C199360897', 'wikidata': 'https://www.wikidata.org/wiki/Q9143', 'display_name': 'Programming language', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C104317684', 'wikidata': 'https://www.wikidata.org/wiki/Q7187', 'display_name': 'Gene', 'level': 2, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1137/140968872', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S897311980', 'display_name': 'SIAM Journal on Control and Optimization', 'issn_l': '0363-0129', 'issn': ['0363-0129', '1095-7138'], 'is_oa': False, 'is_in_doaj': False, 'is_core': True, 'host_organization': 'https://openalex.org/P4310320508', 'host_organization_name': 'Society for Industrial and Applied Mathematics', 'host_organization_lineage': ['https://openalex.org/P4310320508'], 'host_organization_lineage_names': ['Society for Industrial and Applied Mathematics'], 'type': 'journal'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'display_name': 'Peace, justice, and strong institutions', 'score': 0.78, 'id': 'https://metadata.un.org/sdg/16'}], 'grants': [], 'datasets': [], 'versions': [], 'referenced_works_count': 24, 'referenced_works': ['https://openalex.org/W1006554781', 'https://openalex.org/W1578233523', 'https://openalex.org/W1967477050', 'https://openalex.org/W1976658031', 'https://openalex.org/W2010559195', 'https://openalex.org/W2010654234', 'https://openalex.org/W2014323878', 'https://openalex.org/W2023432628', 'https://openalex.org/W2029631728', 'https://openalex.org/W2045668042', 'https://openalex.org/W2049032169', 'https://openalex.org/W2062076630', 'https://openalex.org/W2065030872', 'https://openalex.org/W2079291782', 'https://openalex.org/W2082197891', 'https://openalex.org/W2133991852', 'https://openalex.org/W2150606382', 'https://openalex.org/W2161297030', 'https://openalex.org/W2162378322', 'https://openalex.org/W2164154633', 'https://openalex.org/W2334782222', 'https://openalex.org/W3101717792', 'https://openalex.org/W657312522', 'https://openalex.org/W945806452'], 'related_works': ['https://openalex.org/W4308702637', 'https://openalex.org/W2903299703', 'https://openalex.org/W2808418668', 'https://openalex.org/W2386410636', 'https://openalex.org/W2379651310', 'https://openalex.org/W2156021013', 'https://openalex.org/W2152670157', 'https://openalex.org/W2062076630', 'https://openalex.org/W2016648086', 'https://openalex.org/W176737593'], 'abstract_inverted_index': {'This': [0], 'paper': [1], 'considers': [2], 'the': [3, 34, 48, 52, 66, 75, 80, 90, 99, 116], 'discounted': [4], 'continuous-time': [5, 53], 'Markov': [6], 'decision': [7], 'processes': [8], '(MDPs)': [9], 'in': [10], 'Borel': [11], 'spaces': [12], 'and': [13, 27, 87, 93, 103], 'with': [14], 'unbounded': [15], 'transition': [16], 'rates.': [17], 'The': [18], 'discount': [19], 'factors': [20], 'are': [21, 111], 'allowed': [22], 'to': [23, 47, 55, 114], 'depend': [24], 'on': [25, 33], 'states': [26], 'actions.': [28], 'Main': [29], 'attention': [30], 'is': [31], 'concentrated': [32], 'set': [35, 81], '$F_g$': [36, 82], 'of': [37, 51, 68, 118], 'stationary': [38], 'policies': [39], 'attaining': [40], 'a': [41, 69, 84, 104], 'given': [42], 'mean': [43], 'performance': [44], '$g$': [45], 'up': [46], 'first': [49, 76], 'passage': [50, 77], 'MDP': [54], 'an': [56], 'arbitrarily': [57], 'fixed': [58], 'target': [59], 'set.': [60], 'Under': [61], 'suitable': [62], 'conditions,': [63], 'we': [64], 'prove': [65], 'existence': [67], '$g$-mean-variance': [70, 105], 'optimal': [71, 106], 'policy': [72, 94], 'that': [73], 'minimizes': [74], 'variance': [78], 'over': [79], 'using': [83], 'transformation': [85], 'technique,': [86], 'also': [88], 'give': [89], 'value': [91, 101], 'iteration': [92, 95], 'algorithms': [96], 'for': [97], 'computing': [98], '$g$-variance': [100], 'function': [102], 'policy,': [107], 'respectively.': [108], 'Two': [109], 'examples': [110], 'analytically': [112], 'solved': [113], 'demonstrate': [115], 'application': [117], 'our': [119], 'results.': [120]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W2182209928', 'counts_by_year': [{'year': 2023, 'cited_by_count': 2}, {'year': 2019, 'cited_by_count': 3}, {'year': 2018, 'cited_by_count': 2}, {'year': 2017, 'cited_by_count': 2}], 'updated_date': '2024-09-16T03:39:32.110467', 'created_date': '2016-06-24'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works