← Back to history

Pipeline run

79cc0c87-274e-478c-9d46-cddb2391610e

Pipeline LLM cost (USD)
API 1: $0.0031 API 2: $0.0002 API 3: $0.0000 Total: $0.0033

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description
role baseline loaded sources · ai_index: jd · nature_of_work: jd · tech_stack_maturity: jd
Nature of work · Data pipeline development
Build and run reliable ETL/ELT pipelines moving data into warehouses and real-time systems, monitor SLA performance, and work with analysts/scientists to shape internal data products and investigate data issues.
"Design, construct, and deploy highly efficient and reliable data pipelines"
Tech stack maturity
Mainstream Modern
AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)
0.00 / 5
· Title match
· Has AI skill
· AI skill (primary)
· AI skill (secondary)
· On AI team
· Builds AI products
vocab breakdown (legacy)
Assistants (×1):
Frameworks (×2):
Models / concepts (×3):
Evidence — skills matched in JD (5)
ETL ELT Data Warehouses Real-time systems Data models
Skill cluster (1 dimension groups, role-scoped)
Cross-cutting / unaligned
ETL ELT Data Warehouses Real-time systems Data models
Show KRA description ↓
• Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems. • Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance. • Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization. • 2 to 4 years of experience including internship with a technology company. • Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science. • High level understanding of data models and ETL/ELT fundamentals. • Structured thinking, and know how to slice and dice the data to investigate issues. • Communicate clearly, especially to technical audience effectively, both verbally and in writing.

Signals

Skill
Alias data-engineer
1.00
KRA data-engineer
0.66

Post-classification

Centroidupdated · n=232
Alias collision log
New-role queue
New skills captured5
New KRA captured

Captured for admin review

ETL primary Data Engineer pending
ELT primary Data Engineer pending
Data Warehouses primary Data Engineer pending
Real-time systems primary Data Engineer pending
Data models primary Data Engineer pending
Status: completed Created: 2026-05-27T14:53:47.492831Z Updated: 2026-06-12T17:06:58.977176Z API 3 duration: 2968 ms
Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

Data Engineer

CASE A

slug: data-engineer · id: 2 · source: db

Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

0
New skills
0
Skill↔dim saved
0
Role↔dim saved
0
Skipped

Job description

With Confluent, organisations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better everyday – we're creating an entirely new category within data infrastructure - data streaming. This technology will allow every organisation to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.

One Confluent. One team. One Data Streaming Platform.

Data Connects Us.

About The Role

As a Data Engineer in the Data team you will take on big data challenges in an agile way. You will build data pipelines that enable data scientists, operation teams, and stakeholders across the wider business to make data accessible to the entire company. You will also build data models to deliver insightful analytics while ensuring the highest standard in data integrity. You are encouraged to think out of the box and utilize the latest technologies. Successful candidates will have strong technical capabilities, a can-do attitude, and are highly collaborative.

What You Will Do

• Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.
• Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.
• Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization.


What You Will Bring

• 2 to 4 years of experience including internship with a technology company.
• Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science.
• High level understanding of data models and ETL/ELT fundamentals.
• Structured thinking, and know how to slice and dice the data to investigate issues.
• Communicate clearly, especially to technical audience effectively, both verbally and in writing.


Come As You Are

At Confluent, equality is a core tenet of our culture. We are committed to building an inclusive global team that represents a variety of backgrounds, perspectives, beliefs, and experiences. The more diverse we are, the richer our community and the broader our impact. Employment decisions are made on the basis of job-related criteria without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by applicable law.

Click HERE to review our Candidate Privacy Notice which describes how and when Confluent, Inc., and its group companies, collects, uses, and shares certain personal information of California job applicants and prospective employees.

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

ETL Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
PRACTICE
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
ELT Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
PRACTICE
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Data Warehouses Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Databases
Sub-category
general
Skill nature
CONCEPT
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Real-time systems Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Architectural Concepts
Sub-category
general
Skill nature
CONCEPT
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Data models Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
CONCEPT
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED

Library artifacts (this run)

Kind Detail DB id
canonical_skill_proposed ETL | type=Data Engineering Tools subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed ELT | type=Data Engineering Tools subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed Data Warehouses | type=Databases subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed Real-time systems | type=Architectural Concepts subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed Data models | type=Data Engineering Tools subtype=general nature=CONCEPT lifespan=MULTI_YEAR
nano JD Parser — gpt-4.1-nano click to toggle
RoleData Engineer
CompanyConfluent
Experience2 to 4 years of experience including internship
DomainSoftware & SaaS Products
JD type pass
Show raw JSON
{
  "JD_type": "pass",
  "about_company": {
    "source_marker": {
      "first_5_words": "With Confluent, organisations can harness",
      "last_5_words": "One Data Streaming Platform."
    },
    "text": "With Confluent, organisations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better everyday \u2013 we\u0027re creating an entirely new category within data infrastructure - data streaming. This technology will allow every organisation to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.\n\nOne Confluent. One team. One Data Streaming Platform.\n\nData Connects Us.",
    "word_count": 84
  },
  "certifications": [],
  "company_name": "Confluent",
  "ctc": null,
  "domain": {
    "primary": {
      "aliases": [
        "SaaS",
        "Data Streaming"
      ],
      "domain": "Software \u0026 SaaS Products"
    },
    "secondary": null
  },
  "education": [
    {
      "level": "Bachelor\u0027s",
      "qualification": "BTECH/BE/BSC - Information Technology (or related)",
      "raw": "Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science.",
      "requirement": "required"
    }
  ],
  "experience": {
    "max": 4,
    "min": 2,
    "raw": "2 to 4 years of experience including internship"
  },
  "job_locations": [],
  "role": "Data Engineer",
  "role_aliases": [
    "Data Engineer",
    "Big Data Engineer",
    "ETL Engineer"
  ],
  "role_archetype": "Data",
  "roles_and_responsibilities": [
    {
      "bullet_count": 3,
      "heading": "What You Will Do",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "\u2022 Design, construct, and deploy",
        "last_5_words": "boosting operational efficiencies across the organization."
      },
      "text": "\u2022 Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.\n\u2022 Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.\n\u2022 Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization.",
      "word_count": 54
    },
    {
      "bullet_count": 5,
      "heading": "What You Will Bring",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "\u2022 2 to 4 years of",
        "last_5_words": "effectively, both verbally and in writing."
      },
      "text": "\u2022 2 to 4 years of experience including internship with a technology company.\n\u2022 Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science.\n\u2022 High level understanding of data models and ETL/ELT fundamentals.\n\u2022 Structured thinking, and know how to slice and dice the data to investigate issues.\n\u2022 Communicate clearly, especially to technical audience effectively, both verbally and in writing.",
      "word_count": 66
    }
  ],
  "urls": []
}
API 1 — extract-from-jd click to toggle
{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "ETL"
    },
    {
      "is_primary": true,
      "skill_name": "ELT"
    },
    {
      "is_primary": true,
      "skill_name": "Data Warehouses"
    },
    {
      "is_primary": true,
      "skill_name": "Real-time systems"
    },
    {
      "is_primary": true,
      "skill_name": "Data models"
    }
  ],
  "jd_role": {
    "display_name": "Data Engineer",
    "rationale": null,
    "role_aliases": [
      "Data Engineer",
      "Big Data Engineer",
      "ETL Engineer"
    ],
    "role_archetype": "Data",
    "slug": ""
  },
  "nano_parsed": {
    "JD_type": "pass",
    "about_company": {
      "source_marker": {
        "first_5_words": "With Confluent, organisations can harness",
        "last_5_words": "One Data Streaming Platform."
      },
      "text": "With Confluent, organisations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better everyday \u2013 we\u0027re creating an entirely new category within data infrastructure - data streaming. This technology will allow every organisation to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.\n\nOne Confluent. One team. One Data Streaming Platform.\n\nData Connects Us.",
      "word_count": 84
    },
    "certifications": [],
    "company_name": "Confluent",
    "ctc": null,
    "domain": {
      "primary": {
        "aliases": [
          "SaaS",
          "Data Streaming"
        ],
        "domain": "Software \u0026 SaaS Products"
      },
      "secondary": null
    },
    "education": [
      {
        "level": "Bachelor\u0027s",
        "qualification": "BTECH/BE/BSC - Information Technology (or related)",
        "raw": "Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science.",
        "requirement": "required"
      }
    ],
    "experience": {
      "max": 4,
      "min": 2,
      "raw": "2 to 4 years of experience including internship"
    },
    "job_locations": [],
    "role": "Data Engineer",
    "role_aliases": [
      "Data Engineer",
      "Big Data Engineer",
      "ETL Engineer"
    ],
    "role_archetype": "Data",
    "roles_and_responsibilities": [
      {
        "bullet_count": 3,
        "heading": "What You Will Do",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "\u2022 Design, construct, and deploy",
          "last_5_words": "boosting operational efficiencies across the organization."
        },
        "text": "\u2022 Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.\n\u2022 Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.\n\u2022 Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization.",
        "word_count": 54
      },
      {
        "bullet_count": 5,
        "heading": "What You Will Bring",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "\u2022 2 to 4 years of",
          "last_5_words": "effectively, both verbally and in writing."
        },
        "text": "\u2022 2 to 4 years of experience including internship with a technology company.\n\u2022 Bachelors or Masters degree or equivalent experience in Information Technology or Computer Science.\n\u2022 High level understanding of data models and ETL/ELT fundamentals.\n\u2022 Structured thinking, and know how to slice and dice the data to investigate issues.\n\u2022 Communicate clearly, especially to technical audience effectively, both verbally and in writing.",
        "word_count": 66
      }
    ],
    "urls": []
  },
  "rejected": false,
  "rejection_reason": null,
  "run_id": "79cc0c87-274e-478c-9d46-cddb2391610e",
  "stage3_signals": {
    "alias_found": true,
    "alias_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 1.0,
        "slug": "data-engineer",
        "total_count": null
      }
    ],
    "kra_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": [
          {
            "kra_text": "Works with data analysts, data scientists, and business stakeholders to define data models, ingestion schedules, and data delivery requirements.",
            "sentence": "Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization.",
            "similarity": 0.6956
          },
          {
            "kra_text": "Builds data ingestion pipelines to collect data from transactional databases, third-party APIs, event streams, and file sources into centralized data platforms.",
            "sentence": "Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.",
            "similarity": 0.666
          },
          {
            "kra_text": "Monitors pipeline health, SLA breach alerts, and job failure notifications, and performs root cause analysis for data pipeline incidents.",
            "sentence": "Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.",
            "similarity": 0.619
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 0.6602,
        "slug": "data-engineer",
        "total_count": null
      },
      {
        "display_name": "DevOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Monitors CI/CD pipeline reliability, identifies bottlenecks in delivery workflows, and improves deployment frequency, lead time, and failure recovery rate.",
            "sentence": "Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.",
            "similarity": 0.524
          },
          {
            "kra_text": "Monitors CI/CD pipeline reliability, identifies bottlenecks in delivery workflows, and improves deployment frequency, lead time, and failure recovery rate.",
            "sentence": "Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.",
            "similarity": 0.4589
          },
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Collaborate with Data Analysts, Data Scientists and business stakeholders to create internal data products aimed at boosting operational efficiencies across the organization.",
            "similarity": 0.4448
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 10,
        "score": 0.4759,
        "slug": "devops-engineer",
        "total_count": null
      },
      {
        "display_name": "ML Engineer",
        "kra_matches": [
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.",
            "similarity": 0.4844
          },
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.",
            "similarity": 0.4656
          },
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "High level understanding of data models and ETL/ELT fundamentals.",
            "similarity": 0.4508
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 3,
        "score": 0.467,
        "slug": "ml-engineer",
        "total_count": null
      },
      {
        "display_name": "Java Backend Developer",
        "kra_matches": [
          {
            "kra_text": "persistence and data modeling",
            "sentence": "High level understanding of data models and ETL/ELT fundamentals.",
            "similarity": 0.5006
          },
          {
            "kra_text": "persistence and data modeling",
            "sentence": "Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.",
            "similarity": 0.4312
          },
          {
            "kra_text": "persistence and data modeling",
            "sentence": "Structured thinking, and know how to slice and dice the data to investigate issues.",
            "similarity": 0.4208
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 79,
        "score": 0.4509,
        "slug": "java-backend-developer",
        "total_count": null
      },
      {
        "display_name": "MLOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Automates ML platform operations including scheduled retraining triggers, pipeline orchestration, evaluation workflows, and alerting configuration.",
            "sentence": "Develop deep expertise in these data pipelines and manage their Service Level Agreements (SLAs) to ensure optimal performance.",
            "similarity": 0.4778
          },
          {
            "kra_text": "Automates ML platform operations including scheduled retraining triggers, pipeline orchestration, evaluation workflows, and alerting configuration.",
            "sentence": "Design, construct, and deploy highly efficient and reliable data pipelines that seamlessly transfer data across various platforms, including Data Warehouses and real-time systems.",
            "similarity": 0.4576
          },
          {
            "kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
            "sentence": "High level understanding of data models and ETL/ELT fundamentals.",
            "similarity": 0.3909
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 16,
        "score": 0.4421,
        "slug": "ml-ops-engineer",
        "total_count": null
      }
    ],
    "skill_match_roles": []
  },
  "stage4_decision": {
    "alias_collision_detected": false,
    "case": "A",
    "chosen_role": {
      "display_name": "Data Engineer",
      "kra_matches": null,
      "matched_count": null,
      "matched_skills": null,
      "role_id": 2,
      "score": 1.0,
      "slug": "data-engineer",
      "total_count": null
    },
    "confidence": 1.0,
    "is_new_role": false,
    "llm2_fired": false,
    "llm2_reasoning": null,
    "matched_dimensions": [],
    "matched_kras": [],
    "matched_skills": [],
    "new_role_display_name": null,
    "new_role_slug": null,
    "queued": false,
    "reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "sub_role": null
  },
  "stage5_updates": {
    "centroid_n_after": 232,
    "centroid_updated": true,
    "collision_log_id": null,
    "new_kra_attached": null,
    "new_skills_attached": [
      {
        "is_primary": true,
        "queue_id": 11597,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "ETL",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11598,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "ELT",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11599,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Warehouses",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11600,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Real-time systems",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11601,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data models",
        "status": "pending"
      }
    ],
    "queue_entry_id": null,
    "v3_pipeline_triggered": false,
    "v3_role_slug": null,
    "v3_run_id": null
  }
}
API 2 — extract-details
{
  "alias_matches": [],
  "candidate_roles": [],
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "dimensions": [],
  "input_final_skills": [
    "ETL",
    "ELT",
    "Data Warehouses",
    "Real-time systems",
    "Data models"
  ],
  "input_llm_skills": [
    "ETL",
    "ELT",
    "Data Warehouses",
    "Real-time systems",
    "Data models"
  ],
  "new_aliases_persisted": 0,
  "run_id": "79cc0c87-274e-478c-9d46-cddb2391610e",
  "skills_detail": [
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "ETL",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "etl",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "ELT",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "elt",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Data Warehouses",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Databases",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "data-warehouses",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Real-time systems",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Architectural Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "real-time-systems",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Data models",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "data-models",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": [
    "ETL",
    "ELT",
    "Data Warehouses",
    "Real-time systems",
    "Data models"
  ]
}
API 3 — final-role-output
{
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "ETL",
      "tag": "new"
    },
    {
      "skill": "ELT",
      "tag": "new"
    },
    {
      "skill": "Data Warehouses",
      "tag": "new"
    },
    {
      "skill": "Real-time systems",
      "tag": "new"
    },
    {
      "skill": "Data models",
      "tag": "new"
    }
  ],
  "llm_cost_api1_usd": null,
  "llm_cost_api2_usd": null,
  "llm_cost_api3_usd": null,
  "llm_cost_total_usd": null,
  "persistence": {
    "items": [],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "79cc0c87-274e-478c-9d46-cddb2391610e"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…