Data Engineer/Scientist for ML
Samsung Research America(8 days ago)
About this role
This role involves managing the full lifecycle of training data used in AI models, focusing on the curation, processing, and management of speech and text datasets for advanced ASR, TTS, and MT systems. The position requires designing data pipelines, extracting unstructured data, and collaborating with machine learning teams to optimize data quality for model training.
Required Skills
- Python
- Pandas
- NumPy
- Librosa
- Torchaudio
- SQL
- NoSQL
- Data Cleaning
- Data Pipelines
- NLP
About Samsung Research America
samsung.comSamsung is a global technology company that designs and sells consumer electronics, mobile devices, TVs, home appliances, and related services. In the U.S. it offers Galaxy smartphones and tablets, QLED/OLED and smart TVs, laptops and monitors, plus connected home appliances and IoT integration via SmartThings. Samsung pairs hardware innovation (displays, camera and chipset technology) with software and services to deliver integrated experiences across devices. It sells direct-to-consumer through its website and retail partners while also supplying components and enterprise solutions to industry customers.
View more jobs at Samsung Research America →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Samsung Research America
Similar Jobs
Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS)
Hippocratic AI(8 months ago)
Data Scientist, Chemoinformatics
Atomic AI(1 month ago)
Data Scientist
Bloomberg Industry Group(2 months ago)
Data Scientist, Mid
BRKZ(3 months ago)
Data Scientist, Senior
BRKZ(29 days ago)
Senior Data Scientist
Definitive Healthcare, US(19 days ago)