Big Data Engineer

2 weeks ago


Philippines ActiveFence Full time

Big Data Engineer

  • Intelligence
  • Philippines
  • Associate
  • Full-time

Description

We are seeking a Senior Big Data Engineer with a strong background in managing structured and unstructured data pipelines, who thrives in a fast-paced AI-focused environment. You will be instrumental in building and scaling our data lake architecture, supporting a system designed to fuel intelligent AI agents for data collection, labeling, and analytical reasoning. This includes integrating vector databases and optimizing for retrieval-augmented generation (RAG) workflows deployed on AWS Bedrock and other AI stacks.

Responsibilities

  • Design and implement scalable ingestion pipelines for structured/unstructured data using AWS and Databricks Unity Catalog.
  • Build and maintain high-throughput ETL/ELT pipelines with Apache Airflow and Databricks.
  • Architect and manage data modeling, storage, and indexing strategies in PostgreSQL and RDS, ensuring compatibility with AI retrieval systems.
  • Integrate and manage vector databases to support fast semantic and embedding-based search in RAG pipelines.
  • Collaborate with AI engineers to ensure seamless compatibility with LangGraph and LangSmith agent systems.
  • Implement robust data validation, lineage, and governance systems using Unity Catalog.
  • Optimize performance across distributed compute environments (Databricks, EC2).
  • Deploy and maintain Lambda-based microservices for scalable, real-time data ingestion and enrichment.

Requirements

  • 5+ years working with big data systems in production environments.
  • Proven expertise with Databricks, Unity Catalog, and Apache Spark.
  • Proficiency in Airflow, AWS stack (Lambda, EC2, RDS), and cloud-based data lake architectures.
  • Strong SQL and database design skills (PostgreSQL preferred).

Preferred Qualifications

  • Experience with AI agent pipelines or large-scale ML model support.
  • Working knowledge of vector databases (Chroma, Pinecone, FAISS).
  • Solid understanding of data lifecycle management in ML/AI contexts.
  • Bonus: Familiarity with LangGraph, LangSmith, LangChain, or similar agent orchestration tools.
  • Emphasis on data observability, security, and lineage tracking.
  • Hands-on with RAG architecture, including vector storage and semantic retrieval.
  • Exposure to AWS Bedrock and model deployment orchestration.

About ActiveFence

ActiveFence is the leading provider of security and safety solutions for online experiences, safeguarding more than 3 billion users, top foundation models, and the world’s largest enterprises and tech platforms every day.

As a trusted ally to major technology firms and Fortune 500 brands that build user-generated and GenAI products, ActiveFence empowers security, AI, and policy teams with low-latency Real-Time Guardrails and a continuous Red Teaming program that pressure-tests systems with adversarial prompts and emerging threat techniques. Powered by deep threat intelligence, unmatched harmful-content detection, and coverage of 117+ languages, ActiveFence enables organizations to deliver engaging and trustworthy experiences at global scale while operating safely and responsibly across all threat landscapes.

#J-18808-Ljbffr
  • Big Data Engineer

    2 weeks ago


    , Metro Manila, Philippines Activefence Full time

    Description We are seeking a Senior Big Data Engineer with a strong background in managing structured and unstructured data pipelines, who thrives in a fast-paced AI-focused environment. You will be instrumental in building and scaling our data lake architecture, supporting a system designed to fuel intelligent AI agents for data collection, labeling, and...

  • PH - Data Engineer

    2 weeks ago


    , , Philippines Thinking Machines Data Science Full time

    Working at Thinking Machines Thinking Machines is a technology consultancy building AI & data platforms to solve high-impact problems for our clients. Our vision is a future where data-driven decision-making is a norm and where AI is used to support humans in making excellent decisions. To do that, we create data cultures, one organization at a time. We’re...

  • Data Engineer

    1 day ago


    , Metro Manila, Philippines HRTX Full time

    Overview A Data Engineer designs, builds, and maintains scalable data systems and pipelines to collect, store, and process large volumes of data from various sources. They ensure data quality, security, and governance, and collaborate with other teams to make data analytics-ready. Qualifications At least 1-3 years experience with Data Engineering/Processing...

  • Data Engineer

    24 hours ago


    , Bulacan, Philippines Motolite Full time

    JOB RESPONSIBILITIES Design, build, and maintain scalable data pipelines to collect, process, and store structured and unstructured data. Develop and manage ETL (Extract, Transform, and Load) processes to prepare data for analytics, and machine learning models. Perform web scraping and data mining to extract data from various online sources and APIs....

  • Senior Data Engineer

    22 hours ago


    , , Philippines The VITO Group Full time

    Key Responsibilities Design, build, and maintain cloud-native data infrastructure for the MRV system Develop robust data pipelines, warehouses, and integrations for geospatial, climate, community, and biodiversity datasets Ensure scalable, secure, and standards-compliant data operations for carbon accounting and environmental reporting Collaborate with...

  • Data Engineers

    2 weeks ago


    , Metro Manila, Philippines Samsung Southeast Asia & Oceania Full time

    Join to apply for the Data Engineers role at Samsung Southeast Asia & Oceania 2 days ago Be among the first 25 applicants Join to apply for the Data Engineers role at Samsung Southeast Asia & Oceania Position SummaryWe are looking for world class server software engineers with Big Data Engineering and Data Analysis experience to join our technology...

  • Data Engineer

    2 weeks ago


    , , Philippines Tyler Technologies, Inc. Full time

    The Data Engineer role within the Central Operations department is responsible for developing, deploying, and maintaining data solutions across the Courts and Justice (C&J) division. Primary responsibilities will include querying databases, maintaining data transformation pipelines through ETL tools and systems into data warehouses, provide and support data...

  • Data Engineer

    24 hours ago


    , , Philippines Questronix Corporation Full time

    Responsibilities Analyze and organize raw data Build data systems and pipelines Evaluate business needs and objectives Combine raw information from different sources Explore ways to enhance data quality and reliability Identify opportunities for data acquisition Develop analytical tools and programs Qualifications Strong analytical and planning skills; Good...

  • Senior Data Engineer

    2 weeks ago


    , Agusan del Norte, Philippines Aickman and Greene Full time

    Design, build, and maintain the data infrastructure that powers the MRV system; Develop robust data pipelines, manage cloud-based data warehouses, and ensure seamless integration of geospatial, climate, community, and biodiversity datasets; Enable scalable, secure, and standards-compliant data operations that support carbon accounting, biodiversity...

  • Data Engineer

    1 day ago


    , Metro Manila, Philippines Tenet Global Business Center, Inc. Full time

    Work Arrangement: Night shift work schedule Hybrid work setup Benefits: 15% Night differential 20 Paid Time Off (PTO) per year Annual Appraisal Annual Incentive Hybrid Work Arrangement HMO with FREE dependents Group life insurance The Data Engineer is primarily responsible for participating in the design, development, and implementation of...