About the Team
Our team handles the collection, storage, and processing of large datasets from autonomous vehicles and delivery robots. This includes sensor data from cameras, lidars, radars, and other sensors. Scaling reliable storage and providing efficient compute tools is essential for teams working with this data, including ML, simulation, and algorithm development teams. Our data processing involves specialized algorithms similar to those running directly on autonomous vehicles.
About the Role
We are looking for a Technical Lead to define and build a scalable, user-friendly compute platform for Simulation and ML teams (Perception, Prediction, Planner). You will set the strategic vision for data processing infrastructure, ensuring its scalability, robustness, and alignment with team requirements. You'll make key decisions, prototype solutions, write code, improve frameworks, and provide clear documentation and examples to support adoption.
What You'll Do
- Closely collaborate with internal teams (Simulation and ML, including Perception, Prediction, and Planner) to deeply understand their requirements and use cases.
- Evaluate, integrate, and enhance open-source (Apache Spark, Ray, Apache Beam, Argo) and internal tools.
- Prototype, develop, and refine scalable compute solutions by writing code and validating them in practical scenarios.
- Lead and coordinate infrastructure engineering teams.
- Define and communicate the technical vision clearly, providing mentorship and creating documentation to support platform adoption.
What You'll Need
- Strong proficiency in Python (required); C++ experience is highly desirable.
- Hands-on experience with distributed computing frameworks (Apache Spark, Ray, Apache Beam, Apache Airflow, Argo Workflows, Kubernetes).
- Proven leadership in architecture design and development of data processing infrastructures.
- Excellent communication and collaboration skills.
Nice to Have
- Understanding of ML/AI and experience building ML pipelines.
- Experience with sensor data (camera, lidar, radar) from autonomous vehicles or robotics.
- Experience optimizing costs and performance of distributed computing systems.
Candidates are required to be authorized to work in the U.S. The employer is not offering relocation sponsorship, and remote work options are not available.