3rd Robot Learning Workshop:Grounding Machine Learning Development in the Real World

Important links

Abstract

Advances in learning-based methods for perception, decision making, and control continue to open up new possibilities for deployment on physical robot platforms. Recent examples are given by the considerable rate of progress in representation learning - enabling easier application for supervised and reinforcement learning to domains with image-based data. However, the development and evaluation of algorithmic progress are often constrained to simulation and rigid datasets, leading to overfitting to specific characteristics in these limited domains.

Experiments on physical platforms benefit from the complexity and variety of real-world data both for the generality of evaluation and richness of training data. While direct contact with the real-world provides a grounding for algorithmic performance, the question of deployment also introduces its own challenges for experimentation and reproducibility. Environments, tasks, and platforms have to be standardized, relevant, and broadly accessible. However, finding suitable compromises by improving the realism of datasets and simulators while addressing the limits of real-world experiments will be important to ensure that research insights survive the test of time.

The goal of the workshop is to discuss the challenges for machine learning research in the context of physical systems. This discussion involves the presentation of current methods and the experiences made during algorithm deployment on real-world platforms. Moreover, the workshop aims to strengthen further the ties between the robotics and machine learning communities by discussing how their respective recent directions result in new challenges, requirements, and opportunities for future research.

Rather than merely focusing on applications of machine learning in robotics, as in the previous, successful iterations of the workshop, the new interdisciplinary panel will foster discussion on how real-world applications such as robotics can trigger various impactful directions for the development of machine learning and vice versa. To further this discussion, we aim to improve the interaction and communication across a diverse set of scientists who are at various stages of their careers. Instead of the common trade-offs between attracting a wider audience with well-known speakers and enabling early-stage researchers to voice their opinion, we encourage each of our senior presenters to share their presentations with a PhD student or postdoc from their lab. We also ask all our presenters - invited and contributed - to add a “dirty laundry” slide, describing the limitations and shortcomings of their work. We expect this will aid further discussion in poster and panel sessions in addition to helping junior researchers avoid similar roadblocks along their path.

Scope of contributions:

Challenges in real-world application/deployment of machine learning.
Understanding, quantifying, and bridging the simulation to reality gap.
Reward specification/learning and safety.
Reproducibility, reliability, and robustness.
Data-efficiency via transfer/multitask/meta learning.
Self-supervised/semi-supervised/representation learning.
Online/active learning for calibration, system identification, and adapting to a changing dynamics model due to wear and other sources of covariate shift.
Domain adaptation including but not limited to: Sim-to-Real, Real-to-Sim, across multiple robotic platforms or environments.
Multi-task learning.
Standardized frameworks for physical, real-world evaluation of machine learning algorithms.

Important dates

Submission deadline: ~~09 October 2020~~ (Anywhere on Earth)
Notification: ~~23 October 2020~~ (Anywhere on Earth)
Poster due: ~~24 November 2020~~ (Anywhere on Earth)
Camera-ready due: ~~04 December 2020~~ (Anywhere on Earth)
Workshop: 11 December 2020

Invited Speakers

Dorsa Sadigh and Erdem Biyik (Stanford University)

Walking the Boundary of Learning and Interaction: There have been significant advances in the field of robot learning in the past decade. However, many challenges still remain when considering how robot learning can advance interactive agents such as robots that collaborate with humans. This includes autonomous vehicles that interact with human-driven vehicles or pedestrians, service robots collaborating with their users at homes over short or long periods of time, or assistive robots helping patients with disabilities. This introduces an opportunity for developing new robot learning algorithms that can help advance interactive autonomy. In this talk, we will discuss a formalism for human-robot interaction built upon ideas from representation learning. Specifically, we will first discuss the notion of latent strategies — low dimensional representations sufficient for capturing non-stationary interactions. We will then talk about the challenges of learning such representations when interacting with humans, and how we can develop data-efficient techniques that enable actively learning computational models of human behavior from demonstrations and preferences.

Pete Florence and Daniel Seita (Google Research, Mountain View)

Object- and Action-Centric Representational Robot Learning: In this talk we’ll discuss different views on representations for robot learning, in particular towards the goal of precise, generalizable vision-based manipulation skills that are sample-efficient and scalable to train. Object-centric representations, on the one hand, can enable using rich additional sources of learning, and can enable various efficient downstream behaviors. Action-centric representations, on the other hand, can learn high-level planning, and do not have to explicitly instantiate objectness. As case studies we’ll look at two recent papers in these two areas.

Carolina Parada (Google Research, Mountain View)

State of Robotics @ Google: Robotics@Google’s mission is to make robots useful in the real world through machine learning. We are excited about a new model for robotics, designed for generalization across diverse environments and instructions. This model is focused on scalable data-driven learning, which is task-agnostic, leverages simulation, learns from past experience, and can be quickly adapted to work in the real-world through limited interactions. In this talk, we’ll share some of our recent work in this direction in both manipulation and locomotion applications.

Jemin Hwangbo and JooWoong Byun (Korea Advanced Institute of Science and Technology)

Learning-based Control of a Legged Robot: Legged robots pose one of the greatest challenges in robotics. Dynamic and agile maneuvers of animals cannot be imitated by existing methods that are crafted by humans. A compelling alternative is reinforcement learning, which requires minimal craftsmanship and promotes the natural evolution of a control policy. However, so far, reinforcement learning research for legged robots is mainly limited to simulation, and only few and comparably simple examples have been deployed on real systems. The primary reason is that training with real robots, particularly with dynamically balancing systems, is complicated and expensive. Recent algorithmic improvements have made simulation even cheaper and more accurate at the same time. Leveraging such tools to obtain control policies is thus a seemingly promising direction. However, a few simulation-related issues have to be addressed before utilizing them in practice. The biggest obstacle is the so-called reality gap – discrepancies between the simulated and the real system. Hand-crafted models often fail to achieve a reasonable accuracy due to the complexities of actuation systems of existing robots. This talk will focus on how such obstacles can be overcome. The main approaches are twofold: a fast and accurate algorithm for solving contact dynamics and a data-driven simulation-augmentation method using deep learning. These methods are applied to the ANYmal robot, a sophisticated medium-dog-sized quadrupedal system. Using policies trained in simulation, the quadrupedal machine achieves locomotion skills that go beyond what had been achieved with prior methods: ANYmal is capable of precisely and energy-efficiently following high-level body velocity commands, running faster than ever before, and recovering from falling even in complex configurations.

Fabio Ramos and Anthony Tompkins (University of Sydney and NVIDIA)

RL with Sim2Real in the Loop/Online Domain Adaptation for Mapping: We will have two talks describing recent developments by the group. First, we will present a Bayesian solution to the problem of estimating posterior distributions of simulation parameters given real data. The uncertainty captured in the posterior can significantly improve the performance of reinforcement learning algorithms trained in simulation but deployed in the real world. We will also show that combining posterior parameter estimation and policy updates sequentially leads to further improvements on the convergence rate. In the second part, we will address the problem of mapping as an online classification problem. We will show that optimal transport can be a valuable theoretical framework to enable fast transformation of geometric information obtained in an environment or simulated environment into a secondary domain, leveraging prior information in an elegant and efficient manner.

Panelists

Peter Stone (UT Austin)
Jeannette Bohg (Stanford University)
Dorsa Sadigh (Stanford University)
Pete Florence (Google Research, Mountain View)
Carolina Parada (Google Research, Mountain View)
Jemin Hwangbo (Korea Advanced Institute of Science and Technology)
Fabio Ramos (University of Sydney and NVIDIA)

Submit your questions for our panel session here

Organizers

Masha Itkina (Stanford University)
Alex Bewley (Google Research, Zurich)
Igor Gilitschenski (Massachusetts Institute of Technology)
Julien Perez (Naver Labs Europe)
Ransalu Senanayake (Stanford University)
Markus Wulfmeier (Google DeepMind, London)
Roberto Calandra (Facebook AI Research)
Vincent Vanhoucke (Google Research, Mountain View)

Schedule

In Pacific Time (San Francisco Time)

07:30 - 07:45	Introduction
07:45 - 08.30	Invited talk 1 - “Walking the Boundary of Learning and Interaction” - Dorsa Sadigh and Erdem Biyik
08:30 - 08:45	Contributed talk 1 - “Accelerating Reinforcement Learning with Learned Skill Priors” (Best Paper Runner-Up) - Karl Pertsch
08:45 - 09:45	Poster session 1
09:45 - 10:30	Invited talk 2 - “Object- and Action-Centric Representational Robot Learning” - Pete Florence and Daniel Seita
10:30 - 11:15	Invited talk 3 - “State of Robotics @ Google” - Carolina Prada
11:15 - 15:00	Break
15:00 - 16:00	Panel discussion - Pete Florence, Dorsa Sadigh, Carolina Prada, Jeannette Bohg, Peter Stone, and Fabio Ramos
16:00 - 16:45	Invited talk 4 - “Learning-based Control of a Legged Robot” - Jemin Hwangbo and JooWoong Byun
16:45 - 17:00	Contributed talk 2 - “Multi-Robot Deep Reinforcement Learning via Hierarchically Integrated Models” (Best Paper) - Katie Kang
17:00 - 17:30	Break
17:30 - 18:15	Invited talk 5 - “RL with Sim2Real in the loop/Online Domain Adaptation for Mapping” - Fabio Ramos and Anthony Tompkins
18:15 - 19:15	Poster session 2
19:15 - 19:30	Closing

Poster Session

Gather.Town link

A1 (sessions 1 & 2): Making Hyper-parameters of Proximal Policy Optimization Robust to Time Discretization
Homayoon Farrahi (University of Alberta), Rupam Mahmood (University of Alberta)
A2 (sessions 1 & 2): Learning to solve multi-robot scheduling: mean-field inference theory for random GNN embedding and scalable auction with provable guaranteeHyunwook Kang (Texas A&M University), Seungwoo Schin (NCSoft), James R Morrison (KAIST), Jinkyoo Park (KAIST)
A3 (sessions 1 & 2): Self-Supervised Policy Adaptation during Deployment
Nicklas A Hansen (Technical University of Denmark), Rishabh Jangir (University of California San Diego), Yu Sun (UC Berkeley), Guillem Alenyà (IRI), Pieter Abbeel (UC Berkeley), Alexei A Efros (UC Berkeley), Lerrel Pinto (NYU/Berkeley), Xiaolong Wang (UCSD)
A4 (sessions 1 & 2): Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments
Jun Yamada (University of Southern California), Youngwoon Lee (University of Southern California), Gautam Salhotra (University of Southern California), Karl Pertsch (University of Southern California), Max Pflueger (University of Southern California), Gaurav Sukhatme (University of Southern California), Joseph J Lim (USC), Peter Englert (University of Southern California)
A5 (sessions 1 & 2): SAFARI: Safe and Active Robot Imitation Learning with Imagination
Norman Di Palo (Imperial College London), Edward Johns (Imperial College London)
A6 (session 1): COG: Connecting New Skills to Past Experiences with Offline Reinforcement Learning
Avi Singh (UC Berkeley), Albert Yu (UC Berkeley), Jonathan H Yang (UC Berkeley), Aviral Kumar (UC Berkeley), Jesse Zhang (UC Berkeley), Sergey Levine (UC Berkeley)
A7 (sessions 1 & 2): Model-based Policy Search for Partially Measurable Systems
Diego Romeres (MERL), Fabio Amadio (University of Padua), Alberto Dalla Libera (University of Padova), Ruggero Carli (University of Padova), Daniel Nikovski (University of Padova)
A8 (session 1): State Representations in Robotics: Identifying Relevant Factors of Variation using Weak Supervision
Constantinos Chamzas (Rice University), Martina Lippi (University of Salerno), Michael Welle (KTH Royal Institute of Technology), Anastasiia Varava (KTH Royal Institute of Technology), Alessandro Marino (University of Cassino and Southern Lazio), Lydia Kavraki (Rice University), Danica Kragic (KTH Royal Institute of Technology)
B1 (sessions 1 & 2): Contextual Reinforcement Learning of Visuo-tactile Multi-fingered Grasping Policies
Visak C V Kumar (Georgia Institute of Technology), Tucker Hermans (University of Utah), Dieter Fox (NVIDIA), Stan Birchfield (NVIDIA), Jonathan Tremblay (NVIDIA)
B2 (session 1): Same Object, Different Grasps: Data and Semantic Knowledge for Task-Oriented Grasping
Adithyavairavan Murali (Carnegie Mellon University Robotics Institute), Weiyu Liu (Georgia Institute of Technology), Kenneth Marino (Georgia Institute of Technology), Sonia Chernova (Georgia Institute of Technology), Abhinav Gupta (CMU/FAIR)
B3 (sessions 1 & 2): Multi-Agent Active Search and Rescue
Ramina Ghods (Carnegie Mellon University), Arundhati Banerjee (Carnegie Mellon University), William Durkin (Ohio State University), Jeff Schneider (CMU)
B4 (sessions 1 & 2): Multi-Robot Deep Reinforcement Learning via Hierarchically Integrated Models
Katie Kang (UC Berkeley), Gregory Kahn (UC Berkeley), Sergey Levine (University of California, Berkeley)
B5 (sessions 1 & 2): Learning Visual-Locomotion Policies that Generalize to Diverse Environments
Alejandro Escontrela (Google Brain), George Yu (Robotics at Google), Peng Xu (Google Inc), Atil Iscen (Google), Jie Tan (Google)
B6 (sessions 1 & 2): Structure Policy Representation: Imposing Stability in arbitrarily conditioned dynamic systems
Julen Urain (TU Darmstadt)
B7 (session 1): Safe Sequential Exploration and Exploitation
Thomas J Lew (Stanford University), Apoorva Sharma (Stanford University), James Harrison (Stanford University), Marco Pavone (Stanford University)
B8 (sessions 1 & 2): Batch Exploration with Examples for Scalable Robotic Reinforcement Learning
Annie S Chen (Stanford University), HyunJi Nam (Stanford University), Suraj Nair (Stanford University), Chelsea Finn (Stanford)
C1 (sessions 1 & 2): Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones
Ashwin Balakrishna (UC Berkeley), Brijen Thananjeyan (UC Berkeley), Suraj Nair (Stanford University), Michael Luo (UC Berkeley), Krishnan Srinivasan (Stanford University), Minho Hwang (UC Berkeley), Julian Ibarz (Google), Chelsea Finn (Stanford), Ken Goldberg (UC Berkeley)
C2 (sessions 1 & 2): Deep Affordance Foresight: Planning for What Can Be Done Next
Danfei Xu (Stanford University), Ajay Mandlekar (Stanford University), Roberto Martín-Martín (Stanford University), Yuke Zhu (Stanford University), Li Fei-Fei (Stanford University)
C3 (sessions 1 & 2): Accelerating Reinforcement Learning with Learned Skill Priors
Karl Pertsch (University of Southern California), Youngwoon Lee (University of Southern California), Joseph J Lim (USC)
C4 (sessions 1 & 2): TACTO: A Simulator for Learning Control from Touch Sensing
Shaoxiong Wang (MIT), Mike Lambeta (Facebook), Po-Wei Chou (Facebook), Roberto Calandra (Facebook)
C5 (sessions 1 & 2): Visual Imitation Made Easy
Sarah M Young (UC Berkeley), Dhiraj P Gandhi (Carnegie Mellon University), Shubham Tulsiani (Facebook AI Research), Abhinav Gupta (CMU/FAIR), Pieter Abbeel (UC Berkeley), Lerrel Pinto (NYU/Berkeley)
C6 (session 2): Parrot: Data-driven Behavioral Priors for Reinforcement Learning
Avi Singh (UC Berkeley), Huihan Liu (UC Berkeley ), Gaoyue Zhou (University of California, Berkeley), Albert Yu (UC Berkeley), Nick Rhinehart (UC Berkeley), Sergey Levine (UC Berkeley)
C7 (sessions 1 & 2): Transformer-based Meta-Imitation Learning for Robotic Manipulation
Julien Perez (Naver Labs Europe), Théo Cachet (Naver Labs Europe), Seungsu Kim (Naver Labs Europe)
C8 (session 2): Efficient Exploration in Reinforcement Learning Leveraging Automated Planning
Yohei Hayamizu (The University of Electro-Communications), Saeid Amiri (SUNY Binghamton), Kishan Chandan (Binghamton University), Keiki Takadama (The University of Electro-Communications), Shiqi Zhang (SUNY Binghamton)
D1 (sessions 1 & 2): Robust Maximum Entropy Behavior Cloning
Mostafa Hussein (University of New Hampshire), Marek Petrik (University of New Hampshire), Brendan Crowe (University of New Hampshire), Momotaz Begum (University of New Hampshire)
D2 (sessions 1 & 2): IV-SLAM: Introspective Vision for Simultaneous Localization and Mapping
Sadegh Rabiee (University of Texas at Austin), Joydeep Biswas (University of Texas at Austin)
D3 (sessions 1 & 2): Blending MPC & Value Function Approximation for Efficient Reinforcement Learning
Mohak Bhardwaj (University of Washington), Sanjiban Choudhury (University of Washington), Byron Boots (University of Washington)
D4 (session 2): Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
Qiang Zhang (Shanghai Jiao Tong University), Tete Xiao (UC Berkeley), Alexei A Efros (UC Berkeley), Lerrel Pinto (NYU/Berkeley), Xiaolong Wang (UCSD)
D5 (sessions 1 & 2): Differentiable SLAM-nets: Learning Task-Oriented SLAM for Visual Navigation
Peter Karkus (National University of Singapore), Shaojun Cai (National University of Singapore), David Hsu (NUS)
D6 (session 1): Learning from Simulation, Racing in Reality
Eugenio Chisari (ETH Zurich), Alexander Liniger (ETH Zurich), Alisa Rupenyan (ETH Zurich), Luc Van Gool (ETH Zurich), John Lygeros (ETH Zürich)
D7 (sessions 1 & 2): RMP2: A Differentiable Policy Class for Robotic Systems with Control-Theoretic Guarantees
Anqi Li (University of Washington), Ching-An Cheng (Microsoft), Muhammad Asif Rana (Georgia Tech), Nathan Ratliff (NVIDIA), Byron Boots (University of Washington)
D8 (sessions 1 & 2): AWAC: Accelerating Online Reinforcement Learning from Offline Datasets
Ashvin V Nair (UC Berkeley), Murtaza Dalal (CMU), Abhishek Gupta (UC Berkeley), Sergey Levine (UC Berkeley)

Program Committee

We would like to thank the program committee for shaping the excellent technical program. In alphabetical order they are: Achin Jain, Adithyavairavan Murali, Akshara Rai, Alex Bewley, Ashvin Nair, Brian Ichter, Caterina Buizza, Coline Devin, Djalel Benbouzid, Dushyant Rao, Edward Johns, Jacob Varley, James Harrison, Jayesh Gupta, Jianwei Yang, Jie Tan, Johannes A. Stork, Jonathan Tompson, Karol Hausman, Kunal Menda, Marcin Andrychowicz, Marco Ewerton, Marko Bjelonic, Misha Denil, Nantas Nardelli, Nemanja Rakicevic, Octavio Antonio Villarreal Magaa, Panpan Cai, Peter Karkus, Raunak Bhattacharyya, Ruohan Wang, Sasha Salter, Siddharth Reddy, Spencer Richards, Takayuki Osa, Tomi Silander, Tuomas Haarnoja, Vikas Sindhwani, Walter Goodwin, Yevgen Chebotar, Yizhe Wu, Yunzhu Li

Manuscript Submission Instructions

Submissions should use the NeurIPS Workshop template available here and be 4 pages (plus as many pages as necessary for references). The reviewing proces will be double blind, so please submit as anonymous by using ‘\usepackage{neurips_wrl2020}’ in your main tex file.

Accepted papers and eventual supplementary material will be made available on the workshop website. However, this does not constitute an archival publication and no formal workshop proceedings will be made available, meaning contributors are free to publish their work in archival journals or conference.

Submissions can be made at https://cmt3.research.microsoft.com/NEURIPSWRL2020/.

Poster and Camera-Ready Submission Instructions

Poster deadline (Nov 24, 2020 AOE)

You will be given access to gather.town to interact with the workshop attendees and present your work. We will provide more details about gather.town in early December. In the meantime, you are required to create a poster and upload before Nov 24, 2020 AOE.
The poster template is available at http://www.robot-learning.ml/2020/pptTemplate.pptx
We collect two files to ensure that your poster will have a good quality on gather.town. The first file is a PDF (< 20 Mb) and the second file is the PNG (<3MB with a resolution of at least 1000x560). The file names should be your paper ID. e.g. paperid.png and paperid.pdf
Both files should be uploaded by clicking Create Camera Ready Submission at https://cmt3.research.microsoft.com/NEURIPSWRL2020/ before *Nov 24, 2020 AOE**

Camera-ready paper deadline (Dec 4, 2020 AOE)

The camera-ready paper deadline is Dec 04, 2020 AOE.
Please replace your old latex style file with the new style file available at http://www.robot-learning.ml/2020/neurips_wrl2020.sty. We have fixed a glitch.
Make sure to include your name and affiliations. This can be done by adding the argument “final” as \usepackage[final]{neurips_wrl2020} in you tex file.
The camera-ready version is still 4 pages + references.
This file should be uploaded by clicking Create Camera Ready Submission at https://cmt3.research.microsoft.com/NEURIPSWRL2020/. This is the same space where you uploaded your two poster files.
If you have supplementary materials as a pdf, you can add them as extra pages to the camera-ready paper (below 4 pages and references) and submit as a single pdf.
If you have videos, code, or any other non-pdf materials, unfortunately, we cannot accept them. However, you are encouraged to separately upload them to your own website, google drive, dropbox, github, youtube, etc. and include the links in the paper.

FAQ

Can supplementary material be added beyond the 4-page limit and are there any restrictions on it?

Yes, you may include additional supplementary material, but we ask that it be limited to a reasonable amount (max 10 pages in addition to the main submission) and that it follow the same NeurIPS format as the paper. References do not count towards the limit of 4 pages.
Can a submission to this workshop be submitted to another NeurIPS workshop in parallel?

We discourage this, as it leads to more work for reviewers across multiple workshops. Our suggestion is to pick one workshop to submit to.
Can a paper be submitted to the workshop that has already appeared at a previous conference with published proceedings?

We will not be accepting such submissions unless they have been adapted to contain significantly new results (where novelty is one of the qualities reviewers will be asked to evaluate). However, we will accept submissions that are under review at the time of submission to our workshop (i.e. before Oct 9). For instance, papers that have been submitted to the conference on Robot Learning (CoRL) 2020 can be submitted to our workshop.
My real-robot experiments are affected by Covid-19. Can I include simulation results instead?

If your paper requires conducting experiments on physical robots and access to the experimental platform is limited due to Covid-19 workplace access restrictions, whenever possible, you may validate your methods through simulation.

Contacts

For any faher questions, you can contact us at neuripswrl2020@robot-learning.ml

3rd Robot Learning Workshop:
Grounding Machine Learning Development in the Real World

@NeurIPS 2020 (Virtual conference)
Friday 11 December 2020

Important links

Abstract

Important dates

Invited Speakers

Panelists

Organizers

Schedule

Poster Session

Program Committee

Manuscript Submission Instructions

Poster and Camera-Ready Submission Instructions

FAQ

Contacts

Sponsors

3rd Robot Learning Workshop:Grounding Machine Learning Development in the Real World

@NeurIPS 2020 (Virtual conference) Friday 11 December 2020

Important links

Abstract

Important dates

Invited Speakers

Panelists

Organizers

Schedule

Poster Session

Program Committee

Manuscript Submission Instructions

Poster and Camera-Ready Submission Instructions

FAQ

Contacts

Sponsors

3rd Robot Learning Workshop:
Grounding Machine Learning Development in the Real World

@NeurIPS 2020 (Virtual conference)
Friday 11 December 2020