Senior Prompt and Benchmark Engineer, Evaluation of World Models
NVIDIA(1 month ago)
About this role
A position on NVIDIA’s Cosmos generative AI engineering team focused on evaluation of world foundation models for video, simulation, and physical environments. The role centers on designing domain-specific benchmarks and scalable evaluation methodologies to assess generative and understanding models. It collaborates closely with researchers, annotators, and domain experts to translate evaluation needs into standardized test cases.
Required Skills
- Prompt Engineering
- Benchmark Design
- Evaluation
- Multimodal Models
- Vision-Language Models
- Annotation Workflows
- Ensemble Methods
- Collaboration
- Model Analysis
- Communication
Qualifications
- BS
- MS
- Equivalent Background
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Finance Expert - Quantitative Trading
xAI(21 days ago)
Research Scientist Intern, Embodied Foundation Models (Evaluation)
Wayve(4 months ago)
Research Intern – Video World Models
Tencent(12 days ago)
Machine Learning Engineer - Evaluation
Wayve(7 months ago)
Staff AI Researcher, Foundation Models
Verily(2 months ago)
Engineering Manager, Machine Learning, Model Evaluations and Data Curation (AI Foundations)
Netflix(4 months ago)