- [June 2025] -- Our new preprint on grounding VLMs for TAMP is out! See the project page and arXiv here.
- [April 2025] -- I graduated from Waseda University and started my master's degree at the University of Tokyo!
- [November 2024] -- I was invited as a speaker at Tokyo AI and gave an introductory talk on TAMP. See the speaker deck here.
- [May 2024] -- Our paper won the Best Video Award at the ICRA 2024 Cooking Robotics Workshop!
|
Research
My ultimate research goal is develop autonomous robots that can be incorporated in our daily lives. I envision robots that can perform long-horizon decision-making in complex environments, while continuously learn new skills throughout their lifetime. To this end, my current research explores foundation models for planning, data-efficient imitation learning, and multimodal representation learning.
|
|
Grounded Vision-Language Interpreter for Integrated Task and Motion Planning
Jeremy Siburian*, Keisuke Shirai*, Cristian C. Beltran-Hernandez*, Masashi Hamaya, Michael Görner, Atsushi Hashimoto
Preprint (In Submission)
Project Page / arXiv
TLDR: While recent advances in vision-language models (VLMs) have accelerated the development of language-guided robot planners, their black-box nature often lacks safety guarantees and interpretability crucial for real-world deployment. This paper proposes ViLaIn-TAMP, a hybrid planning framework for enabling verifiable, interpretable, and autonomous robot behaviors.
|
|
Integrated Task and Motion Planning for Real-World Cooking Tasks
Jeremy Siburian*, Cristian C. Beltran Hernandez*, Masashi Hamaya
SII 2025; ICRA 2024 Cooking Robotics Workshop
🎉 Best Video Award 🎉
OpenReview /
YouTube
TLDR: Our framework integrates PDDLStream, an existing TAMP framework, with the MoveIt Task Constructor, a multi-stage manipulation planner, to enhance multi-step motion planning for interdependent tasks. We augment our framework with various cooking-related skills, such as object fixturing, force-based tip detection, and slicing using Reinforcement Learning (RL).
|
|
Comparative Study of Robotic Slip Detection Algorithms using Distributed 3-Axis Tactile Sensing
Jeremy Siburian, Alexander Schmitz, Tito Pradhono Tomo, Sophon Somlor, Gang Yan, Satoshi Funabashi, Shigeki Sugano
42rd Annual Conference of the Robotics Society of Japan (RSJ) 2024; Undergraduate Thesis
Paper
TLDR: To be able to perform dexterous manipulation, slip detection is a crucial skill for robot hands. In this study, we perform a comparison of different slip detection algorithms using a distributed tri-axial tactile sensor. We evaluate different sets of tactile features to investigate the importance of distributed 3-axis measurements for slip recognition.
|
Outreach
I am a product of numerous people (family, friends, professors, mentors) who have helped and supported me along the way.
To pay it forward, please feel free to reach out about research, master's applications, life in Japan, or anything in general.
|
About Me Personally
• I was born and raised in Indonesia. If you are an Indonesian working on robotics and machine learning, let's connect!
• In my spare time, I enjoy exploring coffee shops in Tokyo, consuming anime/manga, and anything sci-fi/fantasy.
• When not locked in at my lab desk, I enjoy doing a whole lot of physical activities. I am "best" at basketball and currently pursuing snowboarding (though I still suck at it).
• My life has been deeply inspired by three things that I am overly invested in: Star Wars, Stephen Curry, and Haikyuu.
• Making it my mission to go to the US next year (2026) to watch Steph Curry play in person before he retires :)
|
This awesome template was adapted from Jon Barron!
|
|