详情 - 项目招募 - 大学生创新创业训练计划智能管理平台

基本情况

所属批次:

第二十七期“上海交通大学大学生创新实践计划”

选题名称:

Representation Learning for Deep Reinforcement Learning

选题类型:

创新训练项目

所属一级学科:

工学

所属二级学科:

电子信息类

项目研究类:

创意类

所属学院:

密西根学院

选题发起人:

PAUL AN-LIN WENG

选题发起人角色:

指导教师

选题发起人联系方式:

登录状态下查看

指导教师承担科研课题情况:

Reinforcement learning (RL) is a general model for adaptive control (e.g., autonomous driving, intelligent tutoring or robotics). In such a setting, an agent learns by interacting with an environment by trial and error. Recently, the combination of deep learning and reinforcement learning (called deep RL) has proved to be extremely powerful. Using such techniques, an agent can learn to play video games from visual inputs or the game of go at a superhuman level. Currently, research on RL and deep RL has become very active in the machine learning community, mainly because of the potential of this approach. Ongoing research work notably focuses on making those techniques more practical and efficient such that they could be applied to more diverse domains.

指导教师对本项目的支持情况:

This proposed project is the continuation of an exploratory project started as a collaboration with Huawei. The goal in that collaboration was to combine reinforcement learning methods and reasoning techniques to learn decision-making policies under the form of first-order logic programs. This research was conducted under the assumption that the input of the deep reinforcement was already available in the logic form. In this proposed project, we aim to learn object-centric representations from visual inputs such that the deep reinforcement learning agents can learn from high-level and more compact representations. The potential benefits are as follows: faster deep reinforcement training, more robust solutions, or interpretable policies.

选题信息:

A PhD student in my team has already started some work in that direction and designed a method based on visual transformers (paper under review). The undergraduate students joining this project will collaborate with my PhD student to help further improve the current method and perform more experiments.

The expected work would be as follows (in quarters):

- Q1: Learn the basics of deep reinforcement learning and deep learning; start performing some experiments with the current code written by my PhD student

- Q2: Improve the current method to make it more generic and extract object-centric information from images; perform further experiments with new method

- Q3: Analyze experiments data and tune the new method

- Q4: Demonstrate the effectiveness of the final method on various domains; write a research paper describing it

选题成员

已经选择选题成员数量:

1

指导教师

序号	教师姓名	电子邮箱	所属学院
1	PAUL AN-LIN WENG	登录状态下查看	密西根学院	第一指导教师

大学生创新创业训练计划智能管理平台

创新创业管理系统

详情

Representation Learning for Deep Reinforcement Learning

基本情况

选题成员

指导教师

选题附件