2021.01.05
Overview
A training system and a training method of reinforcement learning are disclosed. The training system includes a first computer device and a second computer device, and the computing power of the second computer device is better than that of the first computer device. The first computer device stores a reinforcement learning model; receives input data; and inputs the input data into the reinforcement learning model to generate a first output result. The second computer device stores a supervised learning model; receives the input data from the first computing device; inputs the input data to the supervised learning model to generate a second output result; and transmits the second output result to the first computer device. The first computer device further generates feedback data according to the first output result and the second output result, and trains the reinforcement learning model according to the feedback data.Category
發明
Patented
110100312
發明第I775265號
Filing Date
2021.01.05Expired Date
2041.01.04Notification
2023.11.10