Robotics Dataset Problems

Scale and Time Cost RT-1: 17 months, 13 robots, 130k Episodes over 700 tasks PaLM model has 780B tokens. Assuming 100 robots that produce one action token at 30 Hz for each robot, then the 100 ro...

Sep 7, 2024 Notes, Robotics, MachineLearning

General-Purpose Robot Roadmap

Good Enough Prior Goal: achieve baby-level of control of the body and is capable of doing fundamental instruction following. This could significantly narrow down the search space for the follo...

Sep 7, 2024 Notes, Robotics, MachineLearning

Embodied AI Paper Reviews

Imitation Learning BC-Z Gato Did not evaluate generalization to new tasks. Limited real-world data and evaluation. Robocat RT-1 It has its own action tokenization. It discretized eac...

Sep 7, 2024 Notes, Robotics, MachineLearning

Embodied AI Paper Reviews

Sep 6, 2024 Notes

Flop

Sep 5, 2024

Vectorization in Convolution Neural Networks

Sep 4, 2024 Notes, MachineLearning

System Design Template

Requirements Functional Requirements What do we want to achieve? Non-Functional Requirements Scale, qps Read heav...

Jul 21, 2024 Notes

Register Buffer

What does Pytorch register_buffer do? register_buffer saves the parameters in state_dict, but no gradients will be computed (hence no updates). model.parameters() will not return the ...

Jul 21, 2024 Notes

Reinforcement Learning Review Notes

It has been ~4 years since I read through Sutton’s Reinforcement Learning textbook. It is time to review it especially in this post-ChatGPT eura, and see there are any “温故而知新” moments. $k$-arm Ban...

Jul 12, 2024 Notes, MachineLearning

Deep Learning Training Tricks

Intro Here is a blog to summarize all the deep learning training tricks I encountered these days. AdamW Optimizer “W” means weight decay, which is the same concept of weight decay in L2 regulari...

Feb 17, 2024 Notes, MachineLearning

Robotics Dataset Problems

General-Purpose Robot Roadmap

Embodied AI Paper Reviews

Embodied AI Paper Reviews

Flop

Vectorization in Convolution Neural Networks

System Design Template

Register Buffer

Reinforcement Learning Review Notes

Deep Learning Training Tricks

Trending Tags