Neural Network Optimization Engineer
Recraft(2 months ago)
About this role
A Neural Network Optimization Engineer at Recraft is responsible for enhancing the performance, latency, and throughput of neural network inference workflows. This role involves optimizing models through techniques such as model quantization and utilizing tools like TensorRT and Triton, as well as collaborating closely with machine learning researchers to ensure efficient production deployments. The engineer will also benchmark and analyze performance on various hardware platforms while staying abreast of advancements in model optimization technologies.
Required Skills
- Neural Network Optimization
- Inference Performance
- Latency Reduction
- Model Quantization
- Benchmarking
- Collaboration
- TensorRT
- Triton Language
- CUDA Programming
- Python
+4 more
About Recraft
www.recraft.aiRecraft is an innovative design platform that utilizes advanced AI technology to empower designers, creatives, sellers, and teams. It features a top-ranked text-to-image model capable of generating photorealistic images and vector graphics, customizable styles, and mockups. Recraft aims to streamline the creative process by providing tools that enhance artistic expression and productivity, making it an essential resource for modern design needs.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Recraft
Similar Jobs
Member of Technical Staff, GPU Optimization
Mirage(2 months ago)
Manager, Engineering - Hardware Acceleration (CUDA)
Torc Robotics(1 month ago)
Member of Technical Staff - Efficient ML
Moonlake AI(2 months ago)
Member of Technical Staff, Model Efficiency
Cohere(2 months ago)
Member of Technical Staff, ML Engineer
Mirage(2 months ago)
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius(10 months ago)