Middle - Senior
Full-time
Hanoi
Negotiable
🚀 About Oraichain Labs
Oraichain Labs is a deep-tech company at the frontier of AI and Blockchain integration. As the core contributor to Oraichain – the world’s first AI Layer 1 – we are building infrastructure, tools, and products that power AI-enhanced smart contracts, DeFi applications, and next-generation Web3 systems.
At Oraichain, you’ll work with a tight-knit team of engineers, researchers, and product builders to shape how intelligent decentralized systems operate at scale.
We’re looking for a hands-on Data Engineer with solid experience in building data pipelines, modeling complex datasets, and using modern tools like Airflow, Kafka, and dbt. You won’t need to worry about system architecture or infrastructure – your core focus is on making data clean, fast, and useful across Oraichain's AI and blockchain products.
This role is ideal for someone who thrives in a product-focused team, enjoys working with real-world data, and wants to see their work directly power smart AI applications and decentralized systems.
Responsibilities
- Design and build batch & streaming data pipelines for AI, LLM, RAG systems, and Web3 products.
- Manage ETL/ELT workflows using Apache Airflow.
- Ingest and transform real-time data streams using Apache Kafka.
- Build and maintain modular, testable dbt models with clear documentation.
- Work with Data Warehouses (e.g. Snowflake, BigQuery, or Redshift) to support analytics and product insights.
- Collaborate with AI and product teams to provide structured, reliable data for downstream systems.
- Write clean, efficient code in Python and optimized SQL for scalable data processing.
Requirements
- 3+ years of experience as a Data Engineer or in a similar role.
- Strong skills in SQL, data modeling, and building scalable data workflows.
- Practical experience with Airflow, Kafka, and dbt.
- Familiarity with PostgreSQL, MongoDB, or equivalent RDBMS/NoSQL solutions.
- Solid Python programming skills for building custom data tools and automation.
- Experience working with modern Data Warehouse platforms.
Nice to Have
- Experience with vector databases (e.g., FAISS, Qdrant, Weaviate) for AI/LLM systems.
- Understanding of LLM pipelines, embeddings, or retrieval-based architectures (RAG).
- Experience in fintech, DeFi, or blockchain data pipelines.
- Familiarity with data quality tools and version-controlled data transformation.
Apply via hr@orai.io or contact to your partner here:
Ms. Linh Nguyen Thuy | Oraichain Talent Acquisition Department
Tel: (+84) 358 652 819 | E: linh.nt@orai.io | LinkedIn: Linhthuyng22
Oraichain Labs Team is your partner in this process and is here to answer any questions you have along the way.