A Multi-Agent Deep Reinforcement Learning Framework for VWAP Strategy Optimization

Jiaqi Ye; Xiaodong Li; Yingying Wang
{'id': 'https://openalex.org/W4312896940', 'doi': 'https://doi.org/10.1109/ijcnn55064.2022.9892166', 'title': 'A Multi-Agent Deep Reinforcement Learning Framework for VWAP Strategy Optimization', 'display_name': 'A Multi-Agent Deep Reinforcement Learning Framework for VWAP Strategy Optimization', 'publication_year': 2022, 'publication_date': '2022-07-18', 'ids': {'openalex': 'https://openalex.org/W4312896940', 'doi': 'https://doi.org/10.1109/ijcnn55064.2022.9892166'}, 'language': 'en', 'primary_location': {'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/ijcnn55064.2022.9892166', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4363607707', 'display_name': '2022 International Joint Conference on Neural Networks (IJCNN)', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'conference'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}, 'type': 'article', 'type_crossref': 'proceedings-article', 'indexed_in': ['crossref'], 'open_access': {'is_oa': False, 'oa_status': 'closed', 'oa_url': None, 'any_repository_has_fulltext': False}, 'authorships': [{'author_position': 'first', 'author': {'id': 'https://openalex.org/A5084729175', 'display_name': 'Jiaqi Ye', 'orcid': 'https://orcid.org/0000-0002-9593-8995'}, 'institutions': [{'id': 'https://openalex.org/I163340411', 'display_name': 'Hohai University', 'ror': 'https://ror.org/01wd4xt90', 'country_code': 'CN', 'type': 'education', 'lineage': ['https://openalex.org/I163340411']}], 'countries': ['CN'], 'is_corresponding': False, 'raw_author_name': 'Jiaqi Ye', 'raw_affiliation_strings': ['College of Computer and Information Hohai University, Nanjing, China'], 'affiliations': [{'raw_affiliation_string': 'College of Computer and Information Hohai University, Nanjing, China', 'institution_ids': ['https://openalex.org/I163340411']}]}, {'author_position': 'middle', 'author': {'id': 'https://openalex.org/A5100731640', 'display_name': 'Xiaodong Li', 'orcid': 'https://orcid.org/0000-0001-6690-836X'}, 'institutions': [{'id': 'https://openalex.org/I163340411', 'display_name': 'Hohai University', 'ror': 'https://ror.org/01wd4xt90', 'country_code': 'CN', 'type': 'education', 'lineage': ['https://openalex.org/I163340411']}], 'countries': ['CN'], 'is_corresponding': False, 'raw_author_name': 'Xiaodong Li', 'raw_affiliation_strings': ['College of Computer and Information Hohai University, Nanjing, China'], 'affiliations': [{'raw_affiliation_string': 'College of Computer and Information Hohai University, Nanjing, China', 'institution_ids': ['https://openalex.org/I163340411']}]}, {'author_position': 'last', 'author': {'id': 'https://openalex.org/A5100380999', 'display_name': 'Yingying Wang', 'orcid': 'https://orcid.org/0000-0001-8158-7488'}, 'institutions': [{'id': 'https://openalex.org/I163340411', 'display_name': 'Hohai University', 'ror': 'https://ror.org/01wd4xt90', 'country_code': 'CN', 'type': 'education', 'lineage': ['https://openalex.org/I163340411']}], 'countries': ['CN'], 'is_corresponding': False, 'raw_author_name': 'Yingying Wang', 'raw_affiliation_strings': ['College of Computer and Information Hohai University, Nanjing, China'], 'affiliations': [{'raw_affiliation_string': 'College of Computer and Information Hohai University, Nanjing, China', 'institution_ids': ['https://openalex.org/I163340411']}]}], 'institution_assertions': [], 'countries_distinct_count': 1, 'institutions_distinct_count': 1, 'corresponding_author_ids': [], 'corresponding_institution_ids': [], 'apc_list': None, 'apc_paid': None, 'fwci': 0.0, 'has_fulltext': False, 'cited_by_count': 0, 'citation_normalized_percentile': {'value': 0.0, 'is_in_top_1_percent': False, 'is_in_top_10_percent': False}, 'cited_by_percentile_year': {'min': 0, 'max': 62}, 'biblio': {'volume': None, 'issue': None, 'first_page': None, 'last_page': None}, 'is_retracted': False, 'is_paratext': False, 'primary_topic': {'id': 'https://openalex.org/T11326', 'display_name': 'Predicting Stock Market Trends and Movements', 'score': 0.9965, 'subfield': {'id': 'https://openalex.org/subfields/1803', 'display_name': 'Management Science and Operations Research'}, 'field': {'id': 'https://openalex.org/fields/18', 'display_name': 'Decision Sciences'}, 'domain': {'id': 'https://openalex.org/domains/2', 'display_name': 'Social Sciences'}}, 'topics': [{'id': 'https://openalex.org/T11326', 'display_name': 'Predicting Stock Market Trends and Movements', 'score': 0.9965, 'subfield': {'id': 'https://openalex.org/subfields/1803', 'display_name': 'Management Science and Operations Research'}, 'field': {'id': 'https://openalex.org/fields/18', 'display_name': 'Decision Sciences'}, 'domain': {'id': 'https://openalex.org/domains/2', 'display_name': 'Social Sciences'}}, {'id': 'https://openalex.org/T10462', 'display_name': 'Reinforcement Learning Algorithms', 'score': 0.9957, 'subfield': {'id': 'https://openalex.org/subfields/1702', 'display_name': 'Artificial Intelligence'}, 'field': {'id': 'https://openalex.org/fields/17', 'display_name': 'Computer Science'}, 'domain': {'id': 'https://openalex.org/domains/3', 'display_name': 'Physical Sciences'}}, {'id': 'https://openalex.org/T11182', 'display_name': 'Mechanism Design in Auctions and Procurement Contracts', 'score': 0.9955, 'subfield': {'id': 'https://openalex.org/subfields/1803', 'display_name': 'Management Science and Operations Research'}, 'field': {'id': 'https://openalex.org/fields/18', 'display_name': 'Decision Sciences'}, 'domain': {'id': 'https://openalex.org/domains/2', 'display_name': 'Social Sciences'}}], 'keywords': [{'id': 'https://openalex.org/keywords/volume-weighted-average-price', 'display_name': 'Volume-weighted average price', 'score': 0.83245635}, {'id': 'https://openalex.org/keywords/trading-strategy', 'display_name': 'Trading strategy', 'score': 0.681568}, {'id': 'https://openalex.org/keywords/multi-agent-systems', 'display_name': 'Multi-Agent Systems', 'score': 0.556301}, {'id': 'https://openalex.org/keywords/reinforcement-learning', 'display_name': 'Reinforcement Learning', 'score': 0.553445}, {'id': 'https://openalex.org/keywords/stock', 'display_name': 'Stock (firearms)', 'score': 0.43239433}], 'concepts': [{'id': 'https://openalex.org/C115392374', 'wikidata': 'https://www.wikidata.org/wiki/Q7940823', 'display_name': 'Volume-weighted average price', 'level': 5, 'score': 0.83245635}, {'id': 'https://openalex.org/C97541855', 'wikidata': 'https://www.wikidata.org/wiki/Q830687', 'display_name': 'Reinforcement learning', 'level': 2, 'score': 0.72769797}, {'id': 'https://openalex.org/C78508483', 'wikidata': 'https://www.wikidata.org/wiki/Q139445', 'display_name': 'Algorithmic trading', 'level': 2, 'score': 0.71840364}, {'id': 'https://openalex.org/C131562839', 'wikidata': 'https://www.wikidata.org/wiki/Q1574928', 'display_name': 'Trading strategy', 'level': 2, 'score': 0.681568}, {'id': 'https://openalex.org/C41008148', 'wikidata': 'https://www.wikidata.org/wiki/Q21198', 'display_name': 'Computer science', 'level': 0, 'score': 0.6594431}, {'id': 'https://openalex.org/C75949130', 'wikidata': 'https://www.wikidata.org/wiki/Q848010', 'display_name': 'Database transaction', 'level': 2, 'score': 0.5363976}, {'id': 'https://openalex.org/C204036174', 'wikidata': 'https://www.wikidata.org/wiki/Q909380', 'display_name': 'Stock (firearms)', 'level': 2, 'score': 0.43239433}, {'id': 'https://openalex.org/C172428447', 'wikidata': 'https://www.wikidata.org/wiki/Q438711', 'display_name': 'Alternative trading system', 'level': 3, 'score': 0.42880723}, {'id': 'https://openalex.org/C200870193', 'wikidata': 'https://www.wikidata.org/wiki/Q11691', 'display_name': 'Stock exchange', 'level': 2, 'score': 0.41960546}, {'id': 'https://openalex.org/C2780299701', 'wikidata': 'https://www.wikidata.org/wiki/Q475000', 'display_name': 'Stock market', 'level': 3, 'score': 0.4156751}, {'id': 'https://openalex.org/C149782125', 'wikidata': 'https://www.wikidata.org/wiki/Q160039', 'display_name': 'Econometrics', 'level': 1, 'score': 0.28084224}, {'id': 'https://openalex.org/C154945302', 'wikidata': 'https://www.wikidata.org/wiki/Q11660', 'display_name': 'Artificial intelligence', 'level': 1, 'score': 0.25330687}, {'id': 'https://openalex.org/C18991353', 'wikidata': 'https://www.wikidata.org/wiki/Q1137319', 'display_name': 'Market maker', 'level': 4, 'score': 0.22921562}, {'id': 'https://openalex.org/C162324750', 'wikidata': 'https://www.wikidata.org/wiki/Q8134', 'display_name': 'Economics', 'level': 0, 'score': 0.1788263}, {'id': 'https://openalex.org/C106159729', 'wikidata': 'https://www.wikidata.org/wiki/Q2294553', 'display_name': 'Financial economics', 'level': 1, 'score': 0.17797726}, {'id': 'https://openalex.org/C10138342', 'wikidata': 'https://www.wikidata.org/wiki/Q43015', 'display_name': 'Finance', 'level': 1, 'score': 0.1639367}, {'id': 'https://openalex.org/C77088390', 'wikidata': 'https://www.wikidata.org/wiki/Q8513', 'display_name': 'Database', 'level': 1, 'score': 0.08766326}, {'id': 'https://openalex.org/C78519656', 'wikidata': 'https://www.wikidata.org/wiki/Q101333', 'display_name': 'Mechanical engineering', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C151730666', 'wikidata': 'https://www.wikidata.org/wiki/Q7205', 'display_name': 'Paleontology', 'level': 1, 'score': 0.0}, {'id': 'https://openalex.org/C2780762169', 'wikidata': 'https://www.wikidata.org/wiki/Q5905368', 'display_name': 'Horse', 'level': 2, 'score': 0.0}, {'id': 'https://openalex.org/C127413603', 'wikidata': 'https://www.wikidata.org/wiki/Q11023', 'display_name': 'Engineering', 'level': 0, 'score': 0.0}, {'id': 'https://openalex.org/C86803240', 'wikidata': 'https://www.wikidata.org/wiki/Q420', 'display_name': 'Biology', 'level': 0, 'score': 0.0}], 'mesh': [], 'locations_count': 1, 'locations': [{'is_oa': False, 'landing_page_url': 'https://doi.org/10.1109/ijcnn55064.2022.9892166', 'pdf_url': None, 'source': {'id': 'https://openalex.org/S4363607707', 'display_name': '2022 International Joint Conference on Neural Networks (IJCNN)', 'issn_l': None, 'issn': None, 'is_oa': False, 'is_in_doaj': False, 'is_core': False, 'host_organization': None, 'host_organization_name': None, 'host_organization_lineage': [], 'host_organization_lineage_names': [], 'type': 'conference'}, 'license': None, 'license_id': None, 'version': None, 'is_accepted': False, 'is_published': False}], 'best_oa_location': None, 'sustainable_development_goals': [{'id': 'https://metadata.un.org/sdg/17', 'score': 0.57, 'display_name': 'Partnerships for the goals'}], 'grants': [{'funder': 'https://openalex.org/F4320321001', 'funder_display_name': 'National Natural Science Foundation of China', 'award_id': '61602149'}], 'datasets': [], 'versions': [], 'referenced_works_count': 27, 'referenced_works': ['https://openalex.org/W1988270113', 'https://openalex.org/W2001770658', 'https://openalex.org/W2003205995', 'https://openalex.org/W2011903355', 'https://openalex.org/W2082211306', 'https://openalex.org/W2129432447', 'https://openalex.org/W2145339207', 'https://openalex.org/W2160519132', 'https://openalex.org/W2575731723', 'https://openalex.org/W2739364177', 'https://openalex.org/W2745300114', 'https://openalex.org/W2904489222', 'https://openalex.org/W2915117209', 'https://openalex.org/W2991046523', 'https://openalex.org/W3021603984', 'https://openalex.org/W3034520305', 'https://openalex.org/W3035464578', 'https://openalex.org/W3035703686', 'https://openalex.org/W3089248954', 'https://openalex.org/W3091266930', 'https://openalex.org/W3091492359', 'https://openalex.org/W3121958425', 'https://openalex.org/W3127596273', 'https://openalex.org/W3168987897', 'https://openalex.org/W3204405963', 'https://openalex.org/W34310527', 'https://openalex.org/W4256091939'], 'related_works': ['https://openalex.org/W880995879', 'https://openalex.org/W4311038708', 'https://openalex.org/W4214771350', 'https://openalex.org/W4206446936', 'https://openalex.org/W3176836227', 'https://openalex.org/W2580849908', 'https://openalex.org/W2392556764', 'https://openalex.org/W2363919525', 'https://openalex.org/W2187256384', 'https://openalex.org/W2112123416'], 'abstract_inverted_index': {'As': [0], 'a': [1, 26, 73, 145], 'classical': [2], 'optimal': [3], 'trade': [4], 'execution': [5, 47], 'algorithm,': [6], 'Volume': [7], 'Weighted': [8], 'Average': [9], 'Price': [10], '(VWAP)': [11], 'strategy': [12, 34], 'is': [13, 19, 38], 'favored': [14], 'by': [15], 'brokers.': [16], 'Since': [17], 'it': [18], 'schedule-based': [20], 'and': [21, 64, 95, 100, 125, 153, 167], 'cannot': [22], 'perform': [23], 'well': [24], 'in': [25, 139, 201], 'dynamic': [27], 'stock': [28, 93, 136, 164], 'market,': [29], 'optimizing': [30], 'the': [31, 43, 59, 82, 91, 154, 163], 'traditional': [32, 83], 'VWAP': [33, 84], 'via': [35], 'reinforcement': [36, 45], 'learning': [37], 'worth': [39], 'investigating.': [40], 'Most': [41], 'of': [42, 112, 178, 203], 'existing': [44], 'learning-based': [46], 'strategies': [48], 'focus': [49], 'on': [50, 184], 'formulating': [51], 'trading': [52, 55, 62, 65, 78, 98, 119, 131, 197], 'volumes': [53, 63, 101, 120], 'or': [54], 'prices': [56, 99], 'separately,': [57], 'ignoring': [58], 'cooperation': [60], 'between': [61], 'prices.': [66, 132], 'To': [67], 'address': [68], 'this': [69], 'issue,': [70], 'we': [71, 107], 'propose': [72], 'Multi-Agent': [74], 'Deep': [75], 'Q-Network': [76], '(MADQN)': [77], 'framework': [79, 198], 'to': [80, 90, 160], 'optimize': [81], 'strategy,': [85], 'which': [86, 140], 'can': [87], 'dynamically': [88], 'adapt': [89], 'complex': [92], 'market': [94, 137, 165], 'simultaneously': [96], 'formulate': [97], 'at': [102, 121], 'each': [103, 122], 'transaction': [104, 123], 'period.': [105], 'Specifically,': [106], 'design': [108], 'two': [109], 'different': [110, 182], 'types': [111], 'agents:': [113], '1)': [114], 'volume-driven': [115, 151], 'agent': [116, 128, 152, 156], 'for': [117, 129], 'determining': [118], 'period': [124], '2)': [126], 'price-driven': [127, 155], 'deciding': [130], 'We': [133, 173], 'model': [134], 'our': [135, 195], 'environment': [138, 166], 'multiple': [141], 'agents': [142], 'participate': [143], 'as': [144, 188], 'fully': [146], 'cooperative': [147], 'stochastic': [148], 'game.': [149], 'The': [150], 'take': [157], 'joint': [158], 'actions': [159], 'interact': [161], 'with': [162], 'then': [168], 'update': [169], 'their': [170], 'network': [171], 'respectively.': [172], 'use': [174], '9-month': [175], 'level-2': [176], 'data': [177], 'eight': [179], 'stocks': [180], 'from': [181], 'sectors': [183], 'Shanghai': [185], 'Stock': [186], 'Exchange': [187], 'experimental': [189], 'data.': [190], 'Experimental': [191], 'results': [192], 'demonstrate': [193], 'that': [194], 'MADQN': [196], 'outperforms': [199], 'baselines': [200], 'terms': [202], 'several': [204], 'evaluation': [205], 'metrics.': [206]}, 'cited_by_api_url': 'https://api.openalex.org/works?filter=cites:W4312896940', 'counts_by_year': [], 'updated_date': '2024-09-18T17:32:44.017588', 'created_date': '2023-01-05'}
Publication Information

Basic Information

Access and Citation

AI Researcher Chatbot

Primary Location

Authors

Topics

Keywords

Related Works