Pipeline run

79cc0c87-274e-478c-9d46-cddb2391610e

Pipeline LLM cost (USD)

API 1: $0.0031 API 2: $0.0002 API 3: $0.0000 Total: $0.0033

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

role baseline loaded sources · ai_index: jd · nature_of_work: jd · tech_stack_maturity: jd

Nature of work · Data pipeline development

Build and run reliable ETL/ELT pipelines moving data into warehouses and real-time systems, monitor SLA performance, and work with analysts/scientists to shape internal data products and investigate data issues.

"Design, construct, and deploy highly efficient and reliable data pipelines"

Tech stack maturity

Mainstream Modern

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

0.00 / 5

· Title match

· Has AI skill

· AI skill (primary)

· AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): —

Models / concepts (×3): —

Evidence — skills matched in JD (5)

ETL ELT Data Warehouses Real-time systems Data models

Skill cluster (1 dimension groups, role-scoped)

Cross-cutting / unaligned

ETL ELT Data Warehouses Real-time systems Data models

Show KRA description ↓

• Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems. • Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance. • Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization. • 2 to 4 years of experience including internship with a technology company. • Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science. • High level understanding of data models and ETL/ELT fundamentals. • Structured thinking, and know how to slice and dice the data to investigate issues. • Communicate clearly, especially to technical audience effectively, both verbally and in writing.

Signals

Skill —

—

Alias data-engineer

1.00

KRA data-engineer

0.66

Post-classification

Centroidupdated · n=232

Alias collision log—

New-role queue—

New skills captured5

New KRA captured—

Captured for admin review

ETL primary ↔ Data Engineer pending

ELT primary ↔ Data Engineer pending

Data Warehouses primary ↔ Data Engineer pending

Real-time systems primary ↔ Data Engineer pending

Data models primary ↔ Data Engineer pending

Status: completed Created: 2026-05-27T14:53:47.492831Z Updated: 2026-06-12T17:06:58.977176Z API 3 duration: 2968 ms

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

Data Engineer

CASE A

slug: data-engineer · id: 2 · source: db

Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

New skills

Skill↔dim saved

Role↔dim saved

Skipped

Job description

With Confluent, organisations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better everyday – we're creating an entirely new category within data infrastructure - data streaming. This technology will allow every organisation to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.

One Confluent. One team. One Data Streaming Platform.

Data Connects Us.

About The Role

As a Data Engineer in the Data team you will take on big data challenges in an agile way. You will build data pipelines that enable data scientists, operation teams, and stakeholders across the wider business to make data accessible to the entire company. You will also build data models to deliver insightful analytics while ensuring the highest standard in data integrity. You are encouraged to think out of the box and utilize the latest technologies. Successful candidates will have strong technical capabilities, a can-do attitude, and are highly collaborative.

What You Will Do

What You Will Bring

• 2 to 4 years of experience including internship with a technology company.
• Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science.
• High level understanding of data models and ETL/ELT fundamentals.
• Structured thinking, and know how to slice and dice the data to investigate issues.
• Communicate clearly, especially to technical audience effectively, both verbally and in writing.

Come As You Are

At Confluent, equality is a core tenet of our culture. We are committed to building an inclusive global team that represents a variety of backgrounds, perspectives, beliefs, and experiences. The more diverse we are, the richer our community and the broader our impact. Employment decisions are made on the basis of job-related criteria without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by applicable law.

Click HERE to review our Candidate Privacy Notice which describes how and when Confluent, Inc., and its group companies, collects, uses, and shares certain personal information of California job applicants and prospective employees.

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

ETL Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)