Machine Learning Engineer
Job Description
Design and develop highly scalable, Real timesystems using Hadoop ecosystem components(Iceberg, Spark, Ozone, Trino, Hive,Ranger, Kafka, Flink and Nifi)
• Build robust data ingestion andtransformation frameworks using Java, Spark, Python, and shell scripting foringesting multi model data(image, audio, video, unstructured documents) withboth batch and real-time.
• Develop full stack applications and internalengineering tools using Python, shell scripting, and modern web frameworks(e.g., Flask, React).
• Collaborate closely with data scientists tooperationalize machine learning models using Cloudera Machine Learning (CML).
Mandatory skills
• Hadoop ecosystem (Spark, Hive, Kafka, Flink, NiFi, Iceberg, Trino)
• Java, Python, Spark (batch & real-time processing)
• Data ingestion & transformation frameworks
• Performance tuning on Hadoop platforms
• Shell scripting
• Real-time data processing systems
• ML model operationalization (CML / Spark ML)