About Menlo
Menlo Research is an Applied R&D lab building Asimov, an open-source humanoid robot platform, and the full software stack that powers it. Our mission is to make humanoid labor economically viable -- turning software into physical labor at scale. We build across the full stack: hardware architecture, locomotion, autonomy, simulation, and infrastructure. We move fast, ship to real robots, and open-source everything we can. If you want your work to matter beyond a paper or a demo, this is the place.
The Role
We are building the sensory substrate that lets Asimov understand its environment. As a Robotics Researcher in Perception and Vision, you will own the pipeline from raw sensor data through object detection, 3D scene understanding, and semantic representation -- producing the outputs that downstream planning and manipulation systems depend on. Your models run on the robot, in real time, in the real world. Closing the sim-to-real gap is not someone else's problem; it is core to this role.
What You Will Do
Design, train, and deploy perception systems for object detection, segmentation, depth estimation, and 3D scene reconstruction
Build multi-modal pipelines that fuse RGB, depth, and inertial data into robust real-time representations
Develop and scale vision models that transfer reliably from Uranus to physical hardware
Optimize inference pipelines for performance constraints on embedded compute
Work closely with navigation and manipulation teams to ensure perception outputs meet downstream requirements
Drive systematic evaluation on hardware and iterate on failure modes
Contribute to open-source releases of perception models and tooling
What You Will Bring
Deep foundations in computer vision, 3D geometry, and deep learning
Hands-on experience building and deploying perception systems on physical robots or real-time embedded platforms
Proficiency in Python and C++; strong experience with PyTorch or JAX
Track record taking perception models from research prototype to deployed inference
Experience with sensor fusion across camera, depth, and inertial modalities
Practical instincts for understanding why models break in the real world
Nice to Have
Experience with vision-language models, open-vocabulary detection, or embodied scene understanding
Familiarity with NeRF, Gaussian splatting, or differentiable rendering approaches
Prior work on manipulation or mobile robotics perception
Publications at CVPR, ICCV, ECCV, CoRL, or equivalent venues
Why Join Menlo
This is applied robotics research with real stakes -- your code runs on a physical humanoid. We open-source aggressively, so your contributions reach the broader community. You will work alongside researchers and engineers across the full stack, in a team that values shipping over presenting. Competitive compensation and equity.
A Note on AI
You don't need deep AI expertise for every role, but we do expect everyone at Menlo to be intellectually curious, drawn to tinkering and discovery, and excited to use AI as a real collaborator in their work. For some roles, AI fluency is a core requirement. When that's the case, we'll say so explicitly in the qualifications. People who thrive here don't treat AI as a novelty. They use it to think better, and make their work easier for others to build on.
Equal Opportunity and Accommodations
We hire talented people from a wide range of backgrounds. If you're excited about a role but don't meet every bullet, we still encourage you to apply. Menlo Research is an equal opportunity employer and does not discriminate on the basis of any legally protected characteristic. Menlo provides reasonable accommodations during the application process. If you need one, please let your recruiter know.