Technical Skills
Proficiency in Python (e.g., Pandas, NumPy, FastAPI)
Domain Knowledge
Experience in designing experiments (e.g., A/B testing) and interpreting statistical significance
Ability to understand and interpret domain- specific business problems and translate them into data science solutions
Experience with one of the following ETL tools (Airflow, DBT, Azure Data Factory, AWS Glue, SSIS)
Experienced in report modeling and building dashboards with BI tools such as Power BI, Tableau, or Looker
Experience with one of the following cloud data platforms (e.g., Databricks, Microsoft Fabric, Amazon Redshift, Google BigQuery, Snowflake)
Experience with one of the following domains such as media, banking, finance, healthcare, retail, manufacturing, or insurance
Good foundation in machine learning concepts and deep learning (e.g., CNNs, RNNs, Transformers)
Solid knowledge and hands- on experience with building training/inferencing pipeline using machine learning algorithms such as classification, regression, clustering, time series forecasting, and anomaly detection using frameworks like Scikit- learn, PyTorch, and TensorFlow
Strong foundation in probability, statistics, optimization, and linear algebra
Ability to apply math- driven thinking to AI/ML model design
English Requirements
Math & Statistical Knowledge
Good communication skills in English, both written and verbal
Strong experience with SQL and working with relational and NoSQL databases
Familiarity with big data technologies (e.g., Spark, Databricks, Delta Lake, Kafka)
Soft Skills
Collaborative, agile mindset with a self- starter attitude
Strong problem- solving and critical thinking abilities
Ability to clearly communicate technical findings to non- technical stakeholders
Comfortable working in fast- paced, cross- functional teams
Nice to have
Awareness or working knowledge of LLMs (e.g., OpenAI models, Claude, LLaMA, Mistral)
Familiarity with GenAI use cases such as building AI agents, chatbots, MCP, ...
Ability to integrate GenAI APIs (OpenAI, Azure OpenAI, Hugging Face) into applications
Exposure to GenAI frameworks such as LangChain, Semantic Kernel, or Autogen
Experience with MLOps tools (e.g., MLflow, SageMaker, Azure ML, Vertex AI)
Understanding of prompt engineering, embeddings, and concepts like RAG, GraphRAG, Agentic AI