CAPCO
Job ID · 13978

Data Engineer · Bengaluru

CAPCO · CAPCO TECHNOLOGIES PRIVATE LIMITED

Explore more opportunities at CAPCO. See all open roles.

Job highlights

Designation Offered

Data Engineer

Job Role

Data Engineer

Department

AI Machine Learning and Data Analytics

Job Type

Fulltime

Salary

28.00Lacs

Experience

12–16 years

Job Location

Bengaluru

Education

Bachelors/Undergraduate Degree

Posted by

CAPCO

Posted On

5 Dec 2025

Valid until

3 Feb 2026

Skillset required

Data VisualisationData WarehouseMongoDBMySQLPower BIPythonSQLTableauData CleaningData Architecture

Job Description for Data Engineer

A role-focused description with responsibilities, expectations, and qualifications for this opening.

AI Machine Learning and Data Analytics

Deliver

  • Data products (To-Be): Channel Ops Warehouse (~30-day high-perf layer) and Channel Analytics Lake (7+ yrs). Expose status and statements APIs with clear SLAs.

  • Platform architecture: S3/Glue/Athena/Iceberg lakehouse, Redshift for BI/ops. QuickSight for PO/ops dashboards. Lambda/Step Functions for stream processing orchestration.

  • Streaming & ingest: Kafka (K4/K5/Confluent) and AWS MSK/Kinesis; connectors/CDC to DW/Lake. Partitioning, retention, replay, idempotency. EventBridge for AWS-native event routing.

  • Event contracts: Avro/Protobuf, Schema Registry, compatibility rules, versioning strategy.

  • As-Is → To-Be: Inventory APIs/File/SWIFT feeds and stores (Aurora Postgres, Kafka). Define migration waves, cutover runbooks.

  • Governance & quality: Data-as-a-product ownership, lineage, access controls, quality rules, retention.

  • Observability & FinOps: Grafana/Prometheus/CloudWatch for TPS, success rate, lag, spend per 1M events. Runbooks + actionable alerts.

  • Scale & resilience: Tens of millions of payments/day, multi-AZ/region patterns, pragmatic RPO/RTO.

  • Security: Data classification, KMS encryption, tokenization where needed, least-privilege IAM, immutable audit.

  • Hands-on build: Python/Scala/SQL; Spark/Glue; Step Functions/Lambda; IaC (Terraform); CI/CD (GitLab/Jenkins); automated tests.

Must-Have Skills

  • Streaming & EDA: Kafka (Confluent) and AWS MSK/Kinesis/Kinesis Firehose; outbox, ordering, replay, exactly/at-least-once semantics. EventBridge for event routing and filtering.

  • Schema management: Avro/Protobuf + Schema Registry (compatibility, subject strategy, evolution).

  • AWS data stack: S3/Glue/Athena, Redshift, Step Functions, Lambda; Iceberg-ready lakehouse patterns. Kinesis→S3→Glue streaming pipelines; Glue Streaming; DLQ patterns.

  • Payments & ISO 20022: PAIN/PACS/CAMT, lifecycle modeling, reconciliation/advices; API/File/SWIFT channel knowledge.

  • Governance: Data-mesh mindset; ownership, quality SLAs, access, retention, lineage.

  • Observability & FinOps: Build dashboards, alerts, and cost KPIs; troubleshoot lag/throughput at scale.

  • Delivery: Production code, performance profiling, code reviews, automated tests, secure by design.

  • Data Architecture Fundamentals (Must-Have)

  • Logical data modeling: Entity-relationship diagrams, normalization (1NF through Boyce-Codd/BCNF), denormalization trade-offs; identify functional dependencies and key anomalies.

  • Physical data modeling: Table design, partitioning strategies, indexes; SCD types; dimensional vs. transactional schemas; storage patterns for OLTP vs. analytics.

  • Normalization & design: Normalize to 3NF/BCNF for OLTP; understand when to denormalize for queries; trade-offs between 3NF, Data Vault, and star schemas.

  • CQRS (Command Query Responsibility Segregation): Separate read/write models; event sourcing and state reconstruction; eventual consistency patterns; when CQRS is justified vs. overkill.

  • Event-Driven Architecture (EDA): Event-first design; aggregate boundaries and invariants; publish/subscribe patterns; saga orchestration; idempotency and at-least-once delivery.

  • Bounded contexts & domain modeling: Core/supporting/generic subdomains; context maps (anti-corruption layers, shared kernel, conformist, published language); ubiquitous language.

  • Entities, value objects & repositories: Domain entity identity; immutability for value objects; repository abstraction over persistence; temporal/versioned records.

  • Domain events & contracts: Schema versioning (Avro/Protobuf); backward/forward compatibility; event replay; mapping domain events to Kafka topics and Aurora tables.

Nice-to-Have

  • QuickSight/Tableau; Redshift tuning; ksqlDB/Flink; Aurora Postgres internals.

  • Edge/API constraints (Apigee/API-GW), mTLS/webhook patterns.

About this opening

CAPCO is hiring a Data Engineer in the AI Machine Learning and Data Analytics team based in Bengaluru.

This role is fulltime, work from office (wfo), 12–16 years experience, up to ₹28 lakh per year—matched against UnoJobs' verified employer data.

Skills evaluated for this opening include Data Visualisation, Data Warehouse, MongoDB, MySQL, Power BI, Python. Apply directly through UnoJobs to keep your application visible to CAPCO without bouncing across multiple sites.

Role
Data Engineer
Department
AI Machine Learning and Data Analytics
Location
Bengaluru
Work mode
Work from office (WFO)
Experience
12–16 years
Compensation
up to ₹28 lakh per year

Explore similar searches

Use these category pages to compare nearby roles, cities, and salary benchmarks before you apply.

More at CAPCO

Every opening, one place. No noise—just what's hiring now.

See open roles