
Big Data Engineer
2 weeks ago
Big Data Engineer
- Intelligence
- Philippines
- Associate
- Full-time
Description
We are seeking a Senior Big Data Engineer with a strong background in managing structured and unstructured data pipelines, who thrives in a fast-paced AI-focused environment. You will be instrumental in building and scaling our data lake architecture, supporting a system designed to fuel intelligent AI agents for data collection, labeling, and analytical reasoning. This includes integrating vector databases and optimizing for retrieval-augmented generation (RAG) workflows deployed on AWS Bedrock and other AI stacks.
Responsibilities
- Design and implement scalable ingestion pipelines for structured/unstructured data using AWS and Databricks Unity Catalog.
- Build and maintain high-throughput ETL/ELT pipelines with Apache Airflow and Databricks.
- Architect and manage data modeling, storage, and indexing strategies in PostgreSQL and RDS, ensuring compatibility with AI retrieval systems.
- Integrate and manage vector databases to support fast semantic and embedding-based search in RAG pipelines.
- Collaborate with AI engineers to ensure seamless compatibility with LangGraph and LangSmith agent systems.
- Implement robust data validation, lineage, and governance systems using Unity Catalog.
- Optimize performance across distributed compute environments (Databricks, EC2).
- Deploy and maintain Lambda-based microservices for scalable, real-time data ingestion and enrichment.
Requirements
- 5+ years working with big data systems in production environments.
- Proven expertise with Databricks, Unity Catalog, and Apache Spark.
- Proficiency in Airflow, AWS stack (Lambda, EC2, RDS), and cloud-based data lake architectures.
- Strong SQL and database design skills (PostgreSQL preferred).
Preferred Qualifications
- Experience with AI agent pipelines or large-scale ML model support.
- Working knowledge of vector databases (Chroma, Pinecone, FAISS).
- Solid understanding of data lifecycle management in ML/AI contexts.
- Bonus: Familiarity with LangGraph, LangSmith, LangChain, or similar agent orchestration tools.
- Emphasis on data observability, security, and lineage tracking.
- Hands-on with RAG architecture, including vector storage and semantic retrieval.
- Exposure to AWS Bedrock and model deployment orchestration.
About ActiveFence
ActiveFence is the leading provider of security and safety solutions for online experiences, safeguarding more than 3 billion users, top foundation models, and the world’s largest enterprises and tech platforms every day.
As a trusted ally to major technology firms and Fortune 500 brands that build user-generated and GenAI products, ActiveFence empowers security, AI, and policy teams with low-latency Real-Time Guardrails and a continuous Red Teaming program that pressure-tests systems with adversarial prompts and emerging threat techniques. Powered by deep threat intelligence, unmatched harmful-content detection, and coverage of 117+ languages, ActiveFence enables organizations to deliver engaging and trustworthy experiences at global scale while operating safely and responsibly across all threat landscapes.
#J-18808-Ljbffr-
Big Data Engineer
2 weeks ago
, Metro Manila, Philippines Activefence Full timeDescription We are seeking a Senior Big Data Engineer with a strong background in managing structured and unstructured data pipelines, who thrives in a fast-paced AI-focused environment. You will be instrumental in building and scaling our data lake architecture, supporting a system designed to fuel intelligent AI agents for data collection, labeling, and...
-
PH - Data Engineer
2 weeks ago
, , Philippines Thinking Machines Data Science Full timeWorking at Thinking Machines Thinking Machines is a technology consultancy building AI & data platforms to solve high-impact problems for our clients. Our vision is a future where data-driven decision-making is a norm and where AI is used to support humans in making excellent decisions. To do that, we create data cultures, one organization at a time. We’re...
-
Data Engineer
1 day ago
, Metro Manila, Philippines HRTX Full timeOverview A Data Engineer designs, builds, and maintains scalable data systems and pipelines to collect, store, and process large volumes of data from various sources. They ensure data quality, security, and governance, and collaborate with other teams to make data analytics-ready. Qualifications At least 1-3 years experience with Data Engineering/Processing...
-
Data Engineer
24 hours ago
, Bulacan, Philippines Motolite Full timeJOB RESPONSIBILITIES Design, build, and maintain scalable data pipelines to collect, process, and store structured and unstructured data. Develop and manage ETL (Extract, Transform, and Load) processes to prepare data for analytics, and machine learning models. Perform web scraping and data mining to extract data from various online sources and APIs....
-
Senior Data Engineer
22 hours ago
, , Philippines The VITO Group Full timeKey Responsibilities Design, build, and maintain cloud-native data infrastructure for the MRV system Develop robust data pipelines, warehouses, and integrations for geospatial, climate, community, and biodiversity datasets Ensure scalable, secure, and standards-compliant data operations for carbon accounting and environmental reporting Collaborate with...
-
Data Engineers
2 weeks ago
, Metro Manila, Philippines Samsung Southeast Asia & Oceania Full timeJoin to apply for the Data Engineers role at Samsung Southeast Asia & Oceania 2 days ago Be among the first 25 applicants Join to apply for the Data Engineers role at Samsung Southeast Asia & Oceania Position SummaryWe are looking for world class server software engineers with Big Data Engineering and Data Analysis experience to join our technology...
-
Data Engineer
2 weeks ago
, , Philippines Tyler Technologies, Inc. Full timeThe Data Engineer role within the Central Operations department is responsible for developing, deploying, and maintaining data solutions across the Courts and Justice (C&J) division. Primary responsibilities will include querying databases, maintaining data transformation pipelines through ETL tools and systems into data warehouses, provide and support data...
-
Data Engineer
24 hours ago
, , Philippines Questronix Corporation Full timeResponsibilities Analyze and organize raw data Build data systems and pipelines Evaluate business needs and objectives Combine raw information from different sources Explore ways to enhance data quality and reliability Identify opportunities for data acquisition Develop analytical tools and programs Qualifications Strong analytical and planning skills; Good...
-
Senior Data Engineer
2 weeks ago
, Agusan del Norte, Philippines Aickman and Greene Full timeDesign, build, and maintain the data infrastructure that powers the MRV system; Develop robust data pipelines, manage cloud-based data warehouses, and ensure seamless integration of geospatial, climate, community, and biodiversity datasets; Enable scalable, secure, and standards-compliant data operations that support carbon accounting, biodiversity...
-
Data Engineer
1 day ago
, Metro Manila, Philippines Tenet Global Business Center, Inc. Full timeWork Arrangement: Night shift work schedule Hybrid work setup Benefits: 15% Night differential 20 Paid Time Off (PTO) per year Annual Appraisal Annual Incentive Hybrid Work Arrangement HMO with FREE dependents Group life insurance The Data Engineer is primarily responsible for participating in the design, development, and implementation of...