Pipeline run

fbe0218e-4418-46a3-be37-7d2bfc5e421a

Pipeline LLM cost (USD)

API 1: $0.0095 API 2: $0.0007 API 3: $0.0000 Total: $0.0103

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

Nature of work · Data Engineering / ETL

Builds, troubleshoots, and maintains Statistica workflows and data pipelines, including requirements/design, testing, documentation, deployments, and upgrades. Also supports incident/change/release management and works across Oracle, Windows, Linux shell scripting, and big-data tools like Spark/Hive/Kafka.

"Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets."

Tech stack maturity

Mainstream Modern

The skill set centers on widely adopted big-data and streaming technologies like Spark, Hive, Kafka, and NoSQL, which are common in modern enterprise platforms but not inherently cloud-native or bleeding-edge.

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

0.00 / 5

· Title match

· Has AI skill

· AI skill (primary)

· AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): —

Models / concepts (×3): —

Evidence — skills matched in JD (25)

Statistica Oracle Windows MapReduce Hive Spark Oozie Kafka NoSQL Unix Linux Shell scripting Distributed computing Massive parallel processing OSI PI MES SAP LIMS ITSM Incident Management Problem Management Change Management Release Management Entity-relationship modeling KPI

Skill cluster (2 dimension groups, role-scoped)

Asynchronous Messaging and Event Streaming

Kafka

Cross-cutting / unaligned

Statistica Oracle Windows MapReduce Hive Spark Oozie NoSQL Unix Linux Shell scripting Distributed computing Massive parallel processing OSI PI MES SAP LIMS ITSM Incident Management Problem Management Change Management Release Management Entity-relationship modeling KPI

Show KRA description ↓

• Requirements: Min. 4-8 years of experience in STATISTICA Application. • Responsible for Design, develop, test, document and deliver the Business requirements related to Statistica Workflows. • Knowledgeable in Statistica Workspaces, Data Configurations, Analysis Configurations, Oracle Databases & Windows architecture. • Responsible for troubleshooting different existing STATISTICA Workflows. • Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets. • Experience working with Big data Ecosystem like (MapReduce, Hive, Spark, Oozie, Kafka and Any NoSQL databases). • Experience in Unix/Linux shell scripting. • Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required. • Responsible for delivery of various ITSM services like Incident Management, problem Management, Change and Release management etc. • Experienced in new implementation and upgrade projects from requirement gathering stage to qualification and final delivery Understanding of various Data interfaces OSI PI, MES, SAP, LIMS system. • Identify, design and implement internal process improvements. • Autonomous review and clarification of business requirements before implementation, creation, and documentation of technical design/entity-relationship including alignment with stakeholders • Deliver & Maintain Services adhering to KPI. • Able to work on shift basis. • Good business acumen with a developer mindset.

Signals

Skill data-engineer

0.14

Alias —

—

KRA data-engineer

0.61

Post-classification

Centroidupdated · n=1

Alias collision log—

New-role queue—

New skills captured21

New KRA capturedyes

Captured for admin review

Statistica primary ↔ ITSM Engineer pending

Oracle primary ↔ ITSM Engineer pending

Windows primary ↔ ITSM Engineer pending

MapReduce primary ↔ ITSM Engineer pending

Oozie primary ↔ ITSM Engineer pending

Unix primary ↔ ITSM Engineer pending

Linux primary ↔ ITSM Engineer pending

Shell scripting primary ↔ ITSM Engineer pending

Distributed computing primary ↔ ITSM Engineer pending

Massive parallel processing primary ↔ ITSM Engineer pending

OSI PI ↔ ITSM Engineer pending

MES ↔ ITSM Engineer pending

SAP ↔ ITSM Engineer pending

LIMS ↔ ITSM Engineer pending

ITSM ↔ ITSM Engineer pending

Incident Management ↔ ITSM Engineer pending

Problem Management ↔ ITSM Engineer pending

Change Management ↔ ITSM Engineer pending

Release Management ↔ ITSM Engineer pending

Entity-relationship modeling ↔ ITSM Engineer pending

KPI ↔ ITSM Engineer pending

R&R fragment (sim 0.00) ↔ ITSM Engineer pending

Status: completed Created: 2026-05-27T14:54:59.171676Z Updated: 2026-06-12T17:08:35.074126Z API 3 duration: 5594 ms

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

ITSM Engineer

domain · ITSM & Automation CASE DOMAIN

slug: itsm-engineer · id: 206 · source: db

Domain=ITSM & Automation; The JD is centered on ITSM service delivery, incident/problem/change/release management, and maintaining services to KPI, which best matches an ITSM Engineer despite the Statistica technical focus.

Matched skills

STATISTICA ApplicationStatistica WorkflowsStatistica WorkspacesData ConfigurationsAnalysis ConfigurationsOracle DatabasesWindows architectureMapReduceHiveSparkOozieKafkaUnix/Linux shell scriptingNoSQL databasesOSI PI

Matched dimensions

ITSM service deliveryWorkflow troubleshootingData pipeline engineeringBig data and distributed computingApplication deployment and upgradesRequirements analysis and technical designProcess improvementIndustrial data interface integration

Matched KRAs

Design, develop, test, document and deliver the Business requirementsTroubleshooting different existing STATISTICA WorkflowsBuilding and optimizing data pipelines, architectures, and data setsResponsible for Application deployment, upgrade, Enhancements, Hotfix implementationsDelivery of various ITSM services like Incident Managementproblem Management, Change and Release managementIdentify, design and implement internal process improvementsDeliver & Maintain Services adhering to KPI

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

New skills

Skill↔dim saved

Role↔dim saved

Skipped

Job description

YASH Technologies is a leading technology integrator specializing in helping clients reimagine operating models, enhance competitiveness, optimize costs, foster exceptional stakeholder experiences, and drive business transformation.

At YASH, we’re a cluster of the brightest stars working with cutting-edge technologies. Our purpose is anchored in a single truth – bringing real positive changes in an increasingly virtual world and it drives us beyond generational gaps and disruptions of the future.

We are looking forward to hire Statistica Professionals in the following areas :

Job Description

Our Digital Service Line is currently looking for industry-leading seasoned "Statistica Technical Expert" with hands-on experience. The shortlisted candidate should have the ability to analyze technical needs and work with the customers to develop project scope of work documents and Project Plans.

The responsibilities are primarily technical, although there is a strong element of functional understanding of the business process.

Job Title: Statistica Technical Expert

• Requirements: Min. 4-8 years of experience in STATISTICA Application.
• Responsible for Design, develop, test, document and deliver the Business requirements related to Statistica Workflows.
• Knowledgeable in Statistica Workspaces, Data Configurations, Analysis Configurations, Oracle Databases & Windows architecture.
• Responsible for troubleshooting different existing STATISTICA Workflows.
• Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets.
• Experience working with Big data Ecosystem like (MapReduce, Hive, Spark, Oozie, Kafka and Any NoSQL databases).
• Experience in Unix/Linux shell scripting.
• Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.
• Responsible for delivery of various ITSM services like Incident Management, problem Management, Change and Release management etc.
• Experienced in new implementation and upgrade projects from requirement gathering stage to qualification and final delivery Understanding of various Data interfaces OSI PI, MES, SAP, LIMS system.
• Identify, design and implement internal process improvements.
• Autonomous review and clarification of business requirements before implementation, creation, and documentation of technical design/entity-relationship including alignment with stakeholders
• Deliver & Maintain Services adhering to KPI.
• Able to work on shift basis.
• Good business acumen with a developer mindset.

At YASH, you are empowered to create a career that will take you to where you want to go while working in an inclusive team environment. We leverage career-oriented skilling models and optimize our collective intelligence aided with technology for continuous learning, unlearning, and relearning at a rapid pace and scale.

Our Hyperlearning workplace is grounded upon four principles

• Flexible work arrangements, Free spirit, and emotional positivity
• Agile self-determination, trust, transparency, and open collaboration
• All Support needed for the realization of business goals,
• Stable employment with a great atmosphere and ethical corporate culture

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

Statistica Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Data Engineering Tools
Sub-category: general
Skill nature: TOOL
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Oracle Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Databases
Sub-category: general
Skill nature: TOOL
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Windows Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Operating Systems
Sub-category: general
Skill nature: PLATFORM
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

MapReduce Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Concepts
Sub-category: general
Skill nature: CONCEPT
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Hive Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Hive id=2754 · hive

Aliases — catalog

Hive (CANONICAL) primary

Context tags (catalog)

Apache Apache Hive Bucketing ETL HQL Hive Metastore Hive SerDe HiveQL MapReduce SQL SQL-on-Hadoop big data bucketing columnar storage data lakes data warehousing integration metadata partitioning schema evolution

Stored enrichment (catalog DB)

Category: Datastore
Sub-category: Local Key Value Store
Vendor: Apache Software Foundation
License: apache_2
Year introduced: 2010
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Hive appears in Flutter/mobile JDs and package docs, but JD volume is far below SQLite/Realm and it’s mainly used for local key-value storage in Flutter apps.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 3
Sub-category id: 2242
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Local Persistence and Offline Behavior Catalog dimension db id 85

Library dimension (catalog)

Roles linked in library: Android Developer, Flutter Developer, Hybrid Mobile Developer, Native Mobile Developer, React Native Developer, iOS Developer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Local Persistence and Offline Behavior local-persistence-and-offline-behavior	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Spark Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Apache Spark id=1350 · apache-spark

Aliases — catalog

Apache Spark (CANONICAL)
apache spark 3 (VERSION)
spark (VERSION)
spark 3 (VERSION)
spark 3.x (VERSION)
spark3 (VERSION)

Context tags (catalog)

Apache Kafka Cluster Manager DAGScheduler Data Lake DataFrame ETL Hadoop MLlib Machine Learning PySpark RDD Scala Spark SQL Spark Streaming SparkSession

Stored enrichment (catalog DB)

Category: Framework
Sub-category: Distributed Data Processing Framework
Vendor: Apache Software Foundation
License: apache_2
Year introduced: 2010
Confidence: 0.94
Version strategy: SEPARATE_ENTITY
Version tag: 3.x

Maturity reasoning: Apache Spark appears in many data engineering JDs and remains a standard for distributed ETL/ELT; its GitHub and vendor ecosystem activity stay strong, with Databricks and cloud platforms still promoting it.

Skill profile (library / DB)

Skill nature: FRAMEWORK
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 5
Sub-category id: 1021
Extractable: True
Also category: False

Dimensions (API 2 worklist)

ETL and ELT Tooling Catalog dimension db id 24

Library dimension (catalog)

Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
ETL and ELT Tooling etl-and-elt-tooling	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Oozie Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Data Engineering Tools
Sub-category: general
Skill nature: TOOL
Volatility: FAST
Typical lifespan: SHORT_LIVED
Version strategy: VERSIONED

Kafka Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Kafka id=36 · kafka

Aliases — catalog

Kafka (CANONICAL) primary

Context tags (catalog)

Apache Flink Apache Kafka Apache Pulsar Apache Spark Avro KSQL Kafka API Kafka Connect Kafka Streams ZooKeeper Zookeeper backpressure brokers consumer consumer group consumer groups event sourcing event-driven architecture exactly-once semantics fault tolerance high throughput log compaction message broker message queue microservices offsets partition partitioning partitions producer producer API real-time analytics real-time data replication schema registry stream processing topic topic partitioning topics

Stored enrichment (catalog DB)

Category: Datastore
Sub-category: Event Stream Store
Vendor: Confluent
License: apache_2
Year introduced: 2011
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Kafka appears in many production JDs for event streaming and data pipelines, and remains a standard platform in cloud/vendor offerings (e.g., Confluent, AWS MSK), indicating broad hiring demand.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 3
Sub-category id: 3533
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Asynchronous Messaging and Event Streaming Catalog dimension db id 297

Library dimension (catalog)

Roles linked in library: .NET Backend Developer, Go Backend Developer, Kotlin Backend Developer, Node.js Backend Developer, Scala Backend Developer
Messaging and Background Jobs Catalog dimension db id 291

Library dimension (catalog)

Roles linked in library: PHP Backend Developer, Python Backend Developer, Ruby Backend Developer
Messaging and Event Streaming Catalog dimension db id 8

Library dimension (catalog)

Roles linked in library: Backend Developer, Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Asynchronous Messaging and Event Streaming asynchronous-messaging-and-event-streaming	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Messaging and Background Jobs messaging-and-background-jobs	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Messaging and Event Streaming messaging-and-event-streaming	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

NoSQL Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: NoSQL id=1346 · nosql

Aliases — catalog

NoSQL (CANONICAL)

Context tags (catalog)

CAP theorem Cassandra DynamoDB MongoDB Redis column-family data modeling document store eventual consistency graph database horizontal scaling key-value store query language schema-less sharding

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Database Paradigm
Confidence: 0.93
Version strategy: NOT_APPLICABLE

Maturity reasoning: NoSQL is broadly listed in job descriptions across backend/data roles, with MongoDB, DynamoDB, and Cassandra appearing as common market signals; it remains a hiring-pipeline staple rather than a niche or sunset tech.

Skill profile (library / DB)

Skill nature: CONCEPT
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 2
Sub-category id: 1019
Extractable: True
Also category: False

Dimensions (API 2 worklist)

NoSQL Databases Catalog dimension db id 19

Library dimension (catalog)

Roles linked in library: Backend Developer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
NoSQL Databases nosql-databases	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Unix Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Operating Systems
Sub-category: general
Skill nature: PLATFORM
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Linux Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Operating Systems
Sub-category: general
Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Version strategy: UNVERSIONED

Shell scripting Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Programming Languages
Sub-category: general
Skill nature: LANGUAGE
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Distributed computing Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Concepts
Sub-category: general
Skill nature: CONCEPT
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Massive parallel processing Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Concepts
Sub-category: general
Skill nature: CONCEPT
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

OSI PI Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Data Engineering Tools
Sub-category: general
Skill nature: TOOL
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

MES Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Applications
Sub-category: general
Skill nature: TOOL
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

SAP Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Applications
Sub-category: general
Skill nature: PLATFORM
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

LIMS Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Applications
Sub-category: general
Skill nature: PLATFORM
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

ITSM Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Concepts
Sub-category: general
Skill nature: CONCEPT
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Incident Management Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Practices
Sub-category: general
Skill nature: PRACTICE
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Problem Management Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Practices
Sub-category: general
Skill nature: PRACTICE
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Change Management Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Practices
Sub-category: general
Skill nature: PRACTICE
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Release Management Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Practices
Sub-category: general
Skill nature: PRACTICE
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Entity-relationship modeling Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Concepts
Sub-category: general
Skill nature: CONCEPT
Volatility: STABLE
Typical lifespan: EVERGREEN
Version strategy: UNVERSIONED

KPI Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Concepts
Sub-category: general
Skill nature: CONCEPT
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

All API 3 persistence rows

Same grid as the skill-extractor “Persistence items” table: one row per (skill × dimension) work item.

Skill	Tag	Dimension	Skill↔dim	Role↔dim	Outcome
Hive	in_db	Local Persistence and Offline Behavior local-persistence-and-offline-behavior	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Spark	in_db	ETL and ELT Tooling etl-and-elt-tooling	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kafka	in_db	Asynchronous Messaging and Event Streaming asynchronous-messaging-and-event-streaming	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kafka	in_db	Messaging and Background Jobs messaging-and-background-jobs	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kafka	in_db	Messaging and Event Streaming messaging-and-event-streaming	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
NoSQL	in_db	NoSQL Databases nosql-databases	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Library artifacts (this run)

Kind	Detail	DB id
canonical_skill_proposed	Statistica \| type=Data Engineering Tools subtype=general nature=TOOL lifespan=MULTI_YEAR
canonical_skill_proposed	Oracle \| type=Databases subtype=general nature=TOOL lifespan=MULTI_YEAR
canonical_skill_proposed	Windows \| type=Operating Systems subtype=general nature=PLATFORM lifespan=MULTI_YEAR
canonical_skill_proposed	MapReduce \| type=Concepts subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed	Oozie \| type=Data Engineering Tools subtype=general nature=TOOL lifespan=SHORT_LIVED
canonical_skill_proposed	Unix \| type=Operating Systems subtype=general nature=PLATFORM lifespan=MULTI_YEAR
canonical_skill_proposed	Linux \| type=Operating Systems subtype=general nature=PLATFORM lifespan=EVERGREEN
canonical_skill_proposed	Shell scripting \| type=Programming Languages subtype=general nature=LANGUAGE lifespan=MULTI_YEAR
canonical_skill_proposed	Distributed computing \| type=Concepts subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed	Massive parallel processing \| type=Concepts subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed	OSI PI \| type=Data Engineering Tools subtype=general nature=TOOL lifespan=MULTI_YEAR
canonical_skill_proposed	MES \| type=Applications subtype=general nature=TOOL lifespan=MULTI_YEAR
canonical_skill_proposed	SAP \| type=Applications subtype=general nature=PLATFORM lifespan=MULTI_YEAR
canonical_skill_proposed	LIMS \| type=Applications subtype=general nature=PLATFORM lifespan=MULTI_YEAR
canonical_skill_proposed	ITSM \| type=Concepts subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed	Incident Management \| type=Practices subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed	Problem Management \| type=Practices subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed	Change Management \| type=Practices subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed	Release Management \| type=Practices subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed	Entity-relationship modeling \| type=Concepts subtype=general nature=CONCEPT lifespan=EVERGREEN
canonical_skill_proposed	KPI \| type=Concepts subtype=general nature=CONCEPT lifespan=MULTI_YEAR

nano JD Parser — gpt-4.1-nano click to toggle

RoleStatistica Technical Expert

CompanyYASH Technologies

ExperienceMin. 4-8 years of experience in STATISTICA Application.

DomainIT Services & Consulting

JD type pass

Show raw JSON

{
  "JD_type": "pass",
  "about_company": {
    "source_marker": {
      "first_5_words": "YASH Technologies is a leading",
      "last_5_words": "and drive business transformation."
    },
    "text": "YASH Technologies is a leading technology integrator specializing in helping clients reimagine operating models, enhance competitiveness, optimize costs, foster exceptional stakeholder experiences, and drive business transformation.",
    "word_count": 40
  },
  "certifications": [],
  "company_name": "YASH Technologies",
  "ctc": null,
  "domain": {
    "primary": {
      "aliases": [
        "Technology Integration",
        "Business Transformation"
      ],
      "domain": "IT Services \u0026 Consulting"
    },
    "secondary": null
  },
  "education": [],
  "experience": {
    "max": 8,
    "min": 4,
    "raw": "Min. 4-8 years of experience in STATISTICA Application."
  },
  "job_locations": [],
  "role": "Statistica Technical Expert",
  "role_aliases": [
    "Statistica Expert",
    "Statistica Consultant",
    "Statistica Developer"
  ],
  "role_archetype": "Engineering",
  "roles_and_responsibilities": [
    {
      "bullet_count": 13,
      "heading": "Responsibilities",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "\u2022 Requirements: Min. 4-8 years",
        "last_5_words": "with a developer mindset."
      },
      "text": "\u2022 Requirements: Min. 4-8 years of experience in STATISTICA Application.\n\u2022 Responsible for Design, develop, test, document and deliver the Business requirements related to Statistica Workflows.\n\u2022 Knowledgeable in Statistica Workspaces, Data Configurations, Analysis Configurations, Oracle Databases \u0026 Windows architecture.\n\u2022 Responsible for troubleshooting different existing STATISTICA Workflows.\n\u2022 Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets.\n\u2022 Experience working with Big data Ecosystem like (MapReduce, Hive, Spark, Oozie, Kafka and Any NoSQL databases).\n\u2022 Experience in Unix/Linux shell scripting.\n\u2022 Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.\n\u2022 Responsible for delivery of various ITSM services like Incident Management, problem Management, Change and Release management etc.\n\u2022 Experienced in new implementation and upgrade projects from requirement gathering stage to qualification and final delivery Understanding of various Data interfaces OSI PI, MES, SAP, LIMS system.\n\u2022 Identify, design and implement internal process improvements.\n\u2022 Autonomous review and clarification of business requirements before implementation, creation, and documentation of technical design/entity-relationship including alignment with stakeholders\n\u2022 Deliver \u0026 Maintain Services adhering to KPI.\n\u2022 Able to work on shift basis.\n\u2022 Good business acumen with a developer mindset.",
      "word_count": 284
    }
  ],
  "urls": []
}

API 1 — extract-from-jd click to toggle

{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "Statistica"
    },
    {
      "is_primary": true,
      "skill_name": "Oracle"
    },
    {
      "is_primary": true,
      "skill_name": "Windows"
    },
    {
      "is_primary": true,
      "skill_name": "MapReduce"
    },
    {
      "is_primary": true,
      "skill_name": "Hive"
    },
    {
      "is_primary": true,
      "skill_name": "Spark"
    },
    {
      "is_primary": true,
      "skill_name": "Oozie"
    },
    {
      "is_primary": true,
      "skill_name": "Kafka"
    },
    {
      "is_primary": true,
      "skill_name": "NoSQL"
    },
    {
      "is_primary": true,
      "skill_name": "Unix"
    },
    {
      "is_primary": true,
      "skill_name": "Linux"
    },
    {
      "is_primary": true,
      "skill_name": "Shell scripting"
    },
    {
      "is_primary": true,
      "skill_name": "Distributed computing"
    },
    {
      "is_primary": true,
      "skill_name": "Massive parallel processing"
    },
    {
      "is_primary": false,
      "skill_name": "OSI PI"
    },
    {
      "is_primary": false,
      "skill_name": "MES"
    },
    {
      "is_primary": false,
      "skill_name": "SAP"
    },
    {
      "is_primary": false,
      "skill_name": "LIMS"
    },
    {
      "is_primary": false,
      "skill_name": "ITSM"
    },
    {
      "is_primary": false,
      "skill_name": "Incident Management"
    },
    {
      "is_primary": false,
      "skill_name": "Problem Management"
    },
    {
      "is_primary": false,
      "skill_name": "Change Management"
    },
    {
      "is_primary": false,
      "skill_name": "Release Management"
    },
    {
      "is_primary": false,
      "skill_name": "Entity-relationship modeling"
    },
    {
      "is_primary": false,
      "skill_name": "KPI"
    }
  ],
  "jd_role": {
    "display_name": "Statistica Technical Expert",
    "rationale": null,
    "role_aliases": [
      "Statistica Expert",
      "Statistica Consultant",
      "Statistica Developer"
    ],
    "role_archetype": "Engineering",
    "slug": ""
  },
  "nano_parsed": {
    "JD_type": "pass",
    "about_company": {
      "source_marker": {
        "first_5_words": "YASH Technologies is a leading",
        "last_5_words": "and drive business transformation."
      },
      "text": "YASH Technologies is a leading technology integrator specializing in helping clients reimagine operating models, enhance competitiveness, optimize costs, foster exceptional stakeholder experiences, and drive business transformation.",
      "word_count": 40
    },
    "certifications": [],
    "company_name": "YASH Technologies",
    "ctc": null,
    "domain": {
      "primary": {
        "aliases": [
          "Technology Integration",
          "Business Transformation"
        ],
        "domain": "IT Services \u0026 Consulting"
      },
      "secondary": null
    },
    "education": [],
    "experience": {
      "max": 8,
      "min": 4,
      "raw": "Min. 4-8 years of experience in STATISTICA Application."
    },
    "job_locations": [],
    "role": "Statistica Technical Expert",
    "role_aliases": [
      "Statistica Expert",
      "Statistica Consultant",
      "Statistica Developer"
    ],
    "role_archetype": "Engineering",
    "roles_and_responsibilities": [
      {
        "bullet_count": 13,
        "heading": "Responsibilities",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "\u2022 Requirements: Min. 4-8 years",
          "last_5_words": "with a developer mindset."
        },
        "text": "\u2022 Requirements: Min. 4-8 years of experience in STATISTICA Application.\n\u2022 Responsible for Design, develop, test, document and deliver the Business requirements related to Statistica Workflows.\n\u2022 Knowledgeable in Statistica Workspaces, Data Configurations, Analysis Configurations, Oracle Databases \u0026 Windows architecture.\n\u2022 Responsible for troubleshooting different existing STATISTICA Workflows.\n\u2022 Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets.\n\u2022 Experience working with Big data Ecosystem like (MapReduce, Hive, Spark, Oozie, Kafka and Any NoSQL databases).\n\u2022 Experience in Unix/Linux shell scripting.\n\u2022 Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.\n\u2022 Responsible for delivery of various ITSM services like Incident Management, problem Management, Change and Release management etc.\n\u2022 Experienced in new implementation and upgrade projects from requirement gathering stage to qualification and final delivery Understanding of various Data interfaces OSI PI, MES, SAP, LIMS system.\n\u2022 Identify, design and implement internal process improvements.\n\u2022 Autonomous review and clarification of business requirements before implementation, creation, and documentation of technical design/entity-relationship including alignment with stakeholders\n\u2022 Deliver \u0026 Maintain Services adhering to KPI.\n\u2022 Able to work on shift basis.\n\u2022 Good business acumen with a developer mindset.",
        "word_count": 284
      }
    ],
    "urls": []
  },
  "rejected": false,
  "rejection_reason": null,
  "run_id": "fbe0218e-4418-46a3-be37-7d2bfc5e421a",
  "stage3_signals": {
    "alias_found": false,
    "alias_match_roles": [],
    "kra_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": [
          {
            "kra_text": "Works with data analysts, data scientists, and business stakeholders to define data models, ingestion schedules, and data delivery requirements.",
            "sentence": "Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets.",
            "similarity": 0.6438
          },
          {
            "kra_text": "Develops batch and real-time streaming data pipelines using Apache Spark, Apache Kafka, Apache Flink, or Airflow for data movement and processing at scale.",
            "sentence": "Experience working with Big data Ecosystem like (MapReduce, Hive, Spark, Oozie, Kafka and Any NoSQL databases).",
            "similarity": 0.6163
          },
          {
            "kra_text": "Develops batch and real-time streaming data pipelines using Apache Spark, Apache Kafka, Apache Flink, or Airflow for data movement and processing at scale.",
            "sentence": "Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.",
            "similarity": 0.569
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 0.6097,
        "slug": "data-engineer",
        "total_count": null
      },
      {
        "display_name": "ML Engineer",
        "kra_matches": [
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets.",
            "similarity": 0.5491
          },
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.",
            "similarity": 0.4617
          },
          {
            "kra_text": "Translates product requirements into machine learning system specifications including feature definitions, model architecture choices, and success metric definitions.",
            "sentence": "Autonomous review and clarification of business requirements before implementation, creation, and documentation of technical design/entity-relationship including alignment with stakeholders",
            "similarity": 0.4572
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 3,
        "score": 0.4894,
        "slug": "ml-engineer",
        "total_count": null
      },
      {
        "display_name": "DevOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Manages release management processes including environment promotion gates, deployment approval workflows, change management records, and rollback procedures.",
            "sentence": "Responsible for delivery of various ITSM services like Incident Management, problem Management, Change and Release management etc. \u2022 Experienced in new implementation and upgrade projects from requirement gathering stage to qualification and final delivery Understanding of various Data interfaces OSI PI, MES, SAP, LIMS system.",
            "similarity": 0.4974
          },
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.",
            "similarity": 0.4689
          },
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Identify, design and implement internal process improvements.",
            "similarity": 0.4541
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 10,
        "score": 0.4734,
        "slug": "devops-engineer",
        "total_count": null
      },
      {
        "display_name": "MLOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
            "sentence": "Autonomous review and clarification of business requirements before implementation, creation, and documentation of technical design/entity-relationship including alignment with stakeholders",
            "similarity": 0.5073
          },
          {
            "kra_text": "Maintains ML platform runbooks, on-call escalation playbooks, and deployment procedure documentation for production operations teams.",
            "sentence": "Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.",
            "similarity": 0.4624
          },
          {
            "kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
            "sentence": "Assemble large and complex data sets that meet functional / non-functional business requirements Building and optimizing data pipelines, architectures, and data sets.",
            "similarity": 0.4392
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 16,
        "score": 0.4696,
        "slug": "ml-ops-engineer",
        "total_count": null
      },
      {
        "display_name": "Backend Developer",
        "kra_matches": [
          {
            "kra_text": "Investigates and resolves production incidents, API bugs, and service degradation through root cause analysis, hotfixes, and post-mortems.",
            "sentence": "Responsible for delivery of various ITSM services like Incident Management, problem Management, Change and Release management etc. \u2022 Experienced in new implementation and upgrade projects from requirement gathering stage to qualification and final delivery Understanding of various Data interfaces OSI PI, MES, SAP, LIMS system.",
            "similarity": 0.4836
          },
          {
            "kra_text": "Investigates and resolves production incidents, API bugs, and service degradation through root cause analysis, hotfixes, and post-mortems.",
            "sentence": "Experience working with large data sets in distributed computing to perform Massive parallel processing which Contributes to build analytics pipeline Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations as and when required.",
            "similarity": 0.4769
          },
          {
            "kra_text": "Investigates and resolves production incidents, API bugs, and service degradation through root cause analysis, hotfixes, and post-mortems.",
            "sentence": "Responsible for troubleshooting different existing STATISTICA Workflows.",
            "similarity": 0.429
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 1,
        "score": 0.4632,
        "slug": "backend-engineer",
        "total_count": null
      }
    ],
    "skill_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": 2,
        "matched_skills": [
          "Apache Spark",
          "Kafka"
        ],
        "role_id": 2,
        "score": 0.1429,
        "slug": "data-engineer",
        "total_count": 14
      },
      {
        "display_name": "Backend Developer",
        "kra_matches": null,
        "matched_count": 2,
        "matched_skills": [
          "Kafka",
          "NoSQL"
        ],
        "role_id": 1,
        "score": 0.1429,
        "slug": "backend-engineer",
        "total_count": 14
      },
      {
        "display_name": "iOS Developer",
        "kra_matches": null,
        "matched_count": 1,
        "matched_skills": [
          "Hive"
        ],
        "role_id": 6,
        "score": 0.0714,
        "slug": "ios-engineer",
        "total_count": 14
      },
      {
        "display_name": "Hybrid Mobile Developer",
        "kra_matches": null,
        "matched_count": 1,
        "matched_skills": [
          "Hive"
        ],
        "role_id": 11,
        "score": 0.0714,
        "slug": "hybrid-mobile-developer",
        "total_count": 14
      },
      {
        "display_name": "Android Developer",
        "kra_matches": null,
        "matched_count": 1,
        "matched_skills": [
          "Hive"
        ],
        "role_id": 4,
        "score": 0.0714,
        "slug": "android-engineer",
        "total_count": 14
      }
    ]
  },
  "stage4_decision": {
    "alias_collision_detected": false,
    "case": "DOMAIN",
    "chosen_role": {
      "display_name": "ITSM Engineer",
      "kra_matches": null,
      "matched_count": null,
      "matched_skills": null,
      "role_id": 206,
      "score": 0.84,
      "slug": "itsm-engineer",
      "total_count": null
    },
    "confidence": 0.84,
    "is_new_role": false,
    "llm2_fired": false,
    "llm2_reasoning": null,
    "matched_dimensions": [
      "ITSM service delivery",
      "Workflow troubleshooting",
      "Data pipeline engineering",
      "Big data and distributed computing",
      "Application deployment and upgrades",
      "Requirements analysis and technical design",
      "Process improvement",
      "Industrial data interface integration"
    ],
    "matched_kras": [
      "Design, develop, test, document and deliver the Business requirements",
      "Troubleshooting different existing STATISTICA Workflows",
      "Building and optimizing data pipelines, architectures, and data sets",
      "Responsible for Application deployment, upgrade, Enhancements, Hotfix implementations",
      "Delivery of various ITSM services like Incident Management",
      "problem Management, Change and Release management",
      "Identify, design and implement internal process improvements",
      "Deliver \u0026 Maintain Services adhering to KPI"
    ],
    "matched_skills": [
      "STATISTICA Application",
      "Statistica Workflows",
      "Statistica Workspaces",
      "Data Configurations",
      "Analysis Configurations",
      "Oracle Databases",
      "Windows architecture",
      "MapReduce",
      "Hive",
      "Spark",
      "Oozie",
      "Kafka",
      "Unix/Linux shell scripting",
      "NoSQL databases",
      "OSI PI"
    ],
    "new_role_display_name": null,
    "new_role_slug": null,
    "queued": false,
    "reasoning": "Domain=ITSM \u0026 Automation; The JD is centered on ITSM service delivery, incident/problem/change/release management, and maintaining services to KPI, which best matches an ITSM Engineer despite the Statistica technical focus.",
    "sub_role": null
  },
  "stage5_updates": {
    "centroid_n_after": 1,
    "centroid_updated": true,
    "collision_log_id": null,
    "new_kra_attached": {
      "best_kra_similarity": 0.0,
      "queue_id": 785,
      "r_and_r_preview": "\u2022 Requirements: Min. 4-8 years of experience in STATISTICA Application.\n\u2022 Responsible for Design, develop, test, document and deliver the Business requirements related to Statistica Workflows.\n\u2022 Knowl",
      "role_display_name": "ITSM Engineer",
      "role_slug": "itsm-engineer",
      "status": "pending"
    },
    "new_skills_attached": [
      {
        "is_primary": true,
        "queue_id": 11698,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Statistica",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11699,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Oracle",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11700,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Windows",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11701,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "MapReduce",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11702,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Oozie",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11703,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Unix",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11704,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Linux",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11705,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Shell scripting",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11706,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Distributed computing",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 11707,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Massive parallel processing",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11708,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "OSI PI",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11709,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "MES",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11710,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "SAP",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11711,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "LIMS",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11712,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "ITSM",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11713,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Incident Management",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11714,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Problem Management",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11715,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Change Management",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11716,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Release Management",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11717,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "Entity-relationship modeling",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 11718,
        "role_display_name": "ITSM Engineer",
        "role_slug": "itsm-engineer",
        "skill_name": "KPI",
        "status": "pending"
      }
    ],
    "queue_entry_id": null,
    "v3_pipeline_triggered": false,
    "v3_role_slug": null,
    "v3_run_id": null
  }
}

API 2 — extract-details

{
  "alias_matches": [
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 4198,
      "existing_alias_text": "Hive",
      "input_term": "Hive",
      "matched_canonical": {
        "category_id": 3,
        "display_name": "Hive",
        "id": 2754,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "hive",
        "sub_category_id": 2242,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 2510,
      "existing_alias_text": "spark",
      "input_term": "Spark",
      "matched_canonical": {
        "category_id": 5,
        "display_name": "Apache Spark",
        "id": 1350,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "FRAMEWORK",
        "slug": "apache-spark",
        "sub_category_id": 1021,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 173,
      "existing_alias_text": "Kafka",
      "input_term": "Kafka",
      "matched_canonical": {
        "category_id": 3,
        "display_name": "Kafka",
        "id": 36,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "kafka",
        "sub_category_id": 3533,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 1989,
      "existing_alias_text": "NoSQL",
      "input_term": "NoSQL",
      "matched_canonical": {
        "category_id": 2,
        "display_name": "NoSQL",
        "id": 1346,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "nosql",
        "sub_category_id": 1019,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    }
  ],
  "candidate_roles": [
    {
      "display_name": "Android Developer",
      "id": 4,
      "rationale": null,
      "role_archetype": null,
      "slug": "android-engineer",
      "source": "db"
    },
    {
      "display_name": "Flutter Developer",
      "id": 74,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "flutter-developer",
      "source": "db"
    },
    {
      "display_name": "Hybrid Mobile Developer",
      "id": 11,
      "rationale": null,
      "role_archetype": null,
      "slug": "hybrid-mobile-developer",
      "source": "db"
    },
    {
      "display_name": "Native Mobile Developer",
      "id": 75,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "native-mobile-developer",
      "source": "db"
    },
    {
      "display_name": "React Native Developer",
      "id": 73,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "react-native-developer",
      "source": "db"
    },
    {
      "display_name": "iOS Developer",
      "id": 6,
      "rationale": null,
      "role_archetype": null,
      "slug": "ios-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Engineer",
      "id": 2,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-engineer",
      "source": "db"
    },
    {
      "display_name": ".NET Backend Developer",
      "id": 83,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "dotnet-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Go Backend Developer",
      "id": 81,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "go-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Kotlin Backend Developer",
      "id": 84,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "kotlin-server-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Node.js Backend Developer",
      "id": 82,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "node-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Scala Backend Developer",
      "id": 87,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "scala-backend-developer",
      "source": "db"
    },
    {
      "display_name": "PHP Backend Developer",
      "id": 86,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "php-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Python Backend Developer",
      "id": 80,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "python-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Ruby Backend Developer",
      "id": 85,
      "rationale": null,
      "role_archetype": "Engineering",
      "slug": "ruby-backend-developer",
      "source": "db"
    },
    {
      "display_name": "Backend Developer",
      "id": 1,
      "rationale": null,
      "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
      "slug": "backend-engineer",
      "source": "db"
    }
  ],
  "chosen_role": {
    "display_name": "ITSM Engineer",
    "id": 206,
    "rationale": "Domain=ITSM \u0026 Automation; The JD is centered on ITSM service delivery, incident/problem/change/release management, and maintaining services to KPI, which best matches an ITSM Engineer despite the Statistica technical focus.",
    "role_archetype": null,
    "slug": "itsm-engineer",
    "source": "db"
  },
  "dimensions": [
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Local Persistence and Offline Behavior",
        "id": 85,
        "rationale": "On-device storage used for caching, offline support, and durable client state. This cluster is coherent because iOS apps often need to preserve user progress and data when connectivity is limited.",
        "slug": "local-persistence-and-offline-behavior",
        "source": "db"
      },
      "input_skill": "Hive",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Android Developer",
          "id": 4,
          "rationale": null,
          "role_archetype": null,
          "slug": "android-engineer",
          "source": "db"
        },
        {
          "display_name": "Flutter Developer",
          "id": 74,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "flutter-developer",
          "source": "db"
        },
        {
          "display_name": "Hybrid Mobile Developer",
          "id": 11,
          "rationale": null,
          "role_archetype": null,
          "slug": "hybrid-mobile-developer",
          "source": "db"
        },
        {
          "display_name": "Native Mobile Developer",
          "id": 75,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "native-mobile-developer",
          "source": "db"
        },
        {
          "display_name": "React Native Developer",
          "id": 73,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "react-native-developer",
          "source": "db"
        },
        {
          "display_name": "iOS Developer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "ios-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "ETL and ELT Tooling",
        "id": 24,
        "rationale": "Packaged tools for extracting, loading, and transforming data across systems. This dimension covers connector-based ingestion, transformation frameworks, and managed integration products.",
        "slug": "etl-and-elt-tooling",
        "source": "db"
      },
      "input_skill": "Spark",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 2,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Asynchronous Messaging and Event Streaming",
        "id": 297,
        "rationale": "Asynchronous communication patterns and broker technologies used to decouple backend services and move work off the request path. Includes queues, pub/sub, event streams, consumer groups, dead-letter queues, and delivery semantics across systems such as Kafka, RabbitMQ, NATS, SQS/SNS, Pulsar, and ActiveMQ.",
        "slug": "asynchronous-messaging-and-event-streaming",
        "source": "db"
      },
      "input_skill": "Kafka",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": ".NET Backend Developer",
          "id": 83,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "dotnet-backend-developer",
          "source": "db"
        },
        {
          "display_name": "Go Backend Developer",
          "id": 81,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "go-backend-developer",
          "source": "db"
        },
        {
          "display_name": "Kotlin Backend Developer",
          "id": 84,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "kotlin-server-backend-developer",
          "source": "db"
        },
        {
          "display_name": "Node.js Backend Developer",
          "id": 82,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "node-backend-developer",
          "source": "db"
        },
        {
          "display_name": "Scala Backend Developer",
          "id": 87,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "scala-backend-developer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Messaging and Background Jobs",
        "id": 291,
        "rationale": "Asynchronous processing patterns and worker systems used to decouple backend work from request handling. This is a coherent cluster because the role supports background jobs, retries, and deferred processing.",
        "slug": "messaging-and-background-jobs",
        "source": "db"
      },
      "input_skill": "Kafka",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "PHP Backend Developer",
          "id": 86,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "php-backend-developer",
          "source": "db"
        },
        {
          "display_name": "Python Backend Developer",
          "id": 80,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "python-backend-developer",
          "source": "db"
        },
        {
          "display_name": "Ruby Backend Developer",
          "id": 85,
          "rationale": null,
          "role_archetype": "Engineering",
          "slug": "ruby-backend-developer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Messaging and Event Streaming",
        "id": 8,
        "rationale": "Transport-layer systems used to move events and decouple producers from consumers. Data engineers use these systems to ingest, buffer, and distribute event data before downstream processing.",
        "slug": "messaging-and-event-streaming",
        "source": "db"
      },
      "input_skill": "Kafka",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Developer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
          "slug": "backend-engineer",
          "source": "db"
        },
        {
          "display_name": "Data Engineer",
          "id": 2,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "NoSQL Databases",
        "id": 19,
        "rationale": "Models and manages data using non-relational database systems.",
        "slug": "nosql-databases",
        "source": "db"
      },
      "input_skill": "NoSQL",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Developer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
          "slug": "backend-engineer",
          "source": "db"
        }
      ]
    }
  ],
  "input_final_skills": [
    "Statistica",
    "Oracle",
    "Windows",
    "MapReduce",
    "Hive",
    "Spark",
    "Oozie",
    "Kafka",
    "NoSQL",
    "Unix",
    "Linux",
    "Shell scripting",
    "Distributed computing",
    "Massive parallel processing",
    "OSI PI",
    "MES",
    "SAP",
    "LIMS",
    "ITSM",
    "Incident Management",
    "Problem Management",
    "Change Management",
    "Release Management",
    "Entity-relationship modeling",
    "KPI"
  ],
  "input_llm_skills": [
    "Statistica",
    "Oracle",
    "Windows",
    "MapReduce",
    "Hive",
    "Spark",
    "Oozie",
    "Kafka",
    "NoSQL",
    "Unix",
    "Linux",
    "Shell scripting",
    "Distributed computing",
    "Massive parallel processing",
    "OSI PI",
    "MES",
    "SAP",
    "LIMS",
    "ITSM",
    "Incident Management",
    "Problem Management",
    "Change Management",
    "Release Management",
    "Entity-relationship modeling",
    "KPI"
  ],
  "new_aliases_persisted": 0,
  "run_id": "fbe0218e-4418-46a3-be37-7d2bfc5e421a",
  "skills_detail": [
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Statistica",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "TOOL",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "statistica",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Oracle",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Databases",
          "skill_nature": "TOOL",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "oracle",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Windows",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Operating Systems",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "windows",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "MapReduce",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "mapreduce",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Hive",
          "alias_type": "CANONICAL",
          "id": 4198,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 3,
        "display_name": "Hive",
        "id": 2754,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "hive",
        "sub_category_id": 2242,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Local Persistence and Offline Behavior",
            "id": 85,
            "rationale": "On-device storage used for caching, offline support, and durable client state. This cluster is coherent because iOS apps often need to preserve user progress and data when connectivity is limited.",
            "slug": "local-persistence-and-offline-behavior",
            "source": "db"
          },
          "input_skill": "Hive",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Android Developer",
              "id": 4,
              "rationale": null,
              "role_archetype": null,
              "slug": "android-engineer",
              "source": "db"
            },
            {
              "display_name": "Flutter Developer",
              "id": 74,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "flutter-developer",
              "source": "db"
            },
            {
              "display_name": "Hybrid Mobile Developer",
              "id": 11,
              "rationale": null,
              "role_archetype": null,
              "slug": "hybrid-mobile-developer",
              "source": "db"
            },
            {
              "display_name": "Native Mobile Developer",
              "id": 75,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "native-mobile-developer",
              "source": "db"
            },
            {
              "display_name": "React Native Developer",
              "id": 73,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "react-native-developer",
              "source": "db"
            },
            {
              "display_name": "iOS Developer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "ios-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Hive",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Apache Spark",
          "alias_type": "CANONICAL",
          "id": 2004,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "apache spark 3",
          "alias_type": "VERSION",
          "id": 2006,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "spark",
          "alias_type": "VERSION",
          "id": 2510,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "spark 3",
          "alias_type": "VERSION",
          "id": 2007,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "spark 3.x",
          "alias_type": "VERSION",
          "id": 2009,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "spark3",
          "alias_type": "VERSION",
          "id": 2008,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 5,
        "display_name": "Apache Spark",
        "id": 1350,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "FRAMEWORK",
        "slug": "apache-spark",
        "sub_category_id": 1021,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "ETL and ELT Tooling",
            "id": 24,
            "rationale": "Packaged tools for extracting, loading, and transforming data across systems. This dimension covers connector-based ingestion, transformation frameworks, and managed integration products.",
            "slug": "etl-and-elt-tooling",
            "source": "db"
          },
          "input_skill": "Spark",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 2,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Spark",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Oozie",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "TOOL",
          "sub_category": "general",
          "typical_lifespan": "SHORT_LIVED",
          "version_strategy": "VERSIONED",
          "volatility": "FAST"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "oozie",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Kafka",
          "alias_type": "CANONICAL",
          "id": 173,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 3,
        "display_name": "Kafka",
        "id": 36,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "kafka",
        "sub_category_id": 3533,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Asynchronous Messaging and Event Streaming",
            "id": 297,
            "rationale": "Asynchronous communication patterns and broker technologies used to decouple backend services and move work off the request path. Includes queues, pub/sub, event streams, consumer groups, dead-letter queues, and delivery semantics across systems such as Kafka, RabbitMQ, NATS, SQS/SNS, Pulsar, and ActiveMQ.",
            "slug": "asynchronous-messaging-and-event-streaming",
            "source": "db"
          },
          "input_skill": "Kafka",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": ".NET Backend Developer",
              "id": 83,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "dotnet-backend-developer",
              "source": "db"
            },
            {
              "display_name": "Go Backend Developer",
              "id": 81,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "go-backend-developer",
              "source": "db"
            },
            {
              "display_name": "Kotlin Backend Developer",
              "id": 84,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "kotlin-server-backend-developer",
              "source": "db"
            },
            {
              "display_name": "Node.js Backend Developer",
              "id": 82,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "node-backend-developer",
              "source": "db"
            },
            {
              "display_name": "Scala Backend Developer",
              "id": 87,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "scala-backend-developer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Messaging and Background Jobs",
            "id": 291,
            "rationale": "Asynchronous processing patterns and worker systems used to decouple backend work from request handling. This is a coherent cluster because the role supports background jobs, retries, and deferred processing.",
            "slug": "messaging-and-background-jobs",
            "source": "db"
          },
          "input_skill": "Kafka",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "PHP Backend Developer",
              "id": 86,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "php-backend-developer",
              "source": "db"
            },
            {
              "display_name": "Python Backend Developer",
              "id": 80,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "python-backend-developer",
              "source": "db"
            },
            {
              "display_name": "Ruby Backend Developer",
              "id": 85,
              "rationale": null,
              "role_archetype": "Engineering",
              "slug": "ruby-backend-developer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Messaging and Event Streaming",
            "id": 8,
            "rationale": "Transport-layer systems used to move events and decouple producers from consumers. Data engineers use these systems to ingest, buffer, and distribute event data before downstream processing.",
            "slug": "messaging-and-event-streaming",
            "source": "db"
          },
          "input_skill": "Kafka",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Developer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
              "slug": "backend-engineer",
              "source": "db"
            },
            {
              "display_name": "Data Engineer",
              "id": 2,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Kafka",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "NoSQL",
          "alias_type": "CANONICAL",
          "id": 1989,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 2,
        "display_name": "NoSQL",
        "id": 1346,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "nosql",
        "sub_category_id": 1019,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "NoSQL Databases",
            "id": 19,
            "rationale": "Models and manages data using non-relational database systems.",
            "slug": "nosql-databases",
            "source": "db"
          },
          "input_skill": "NoSQL",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Developer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
              "slug": "backend-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "NoSQL",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Unix",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Operating Systems",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "unix",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Linux",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Operating Systems",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "UNVERSIONED",
          "volatility": "STABLE"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "linux",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Shell scripting",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Programming Languages",
          "skill_nature": "LANGUAGE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "shell-scripting",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Distributed computing",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "distributed-computing",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Massive parallel processing",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "massive-parallel-processing",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "OSI PI",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "TOOL",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "osi-pi",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "MES",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Applications",
          "skill_nature": "TOOL",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "mes",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "SAP",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Applications",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "sap",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "LIMS",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Applications",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "lims",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "ITSM",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "itsm",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Incident Management",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Practices",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "incident-management",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Problem Management",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Practices",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "problem-management",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Change Management",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Practices",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "change-management",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Release Management",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Practices",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "release-management",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Entity-relationship modeling",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "UNVERSIONED",
          "volatility": "STABLE"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "entity-relationship-modeling",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "KPI",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concepts",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "kpi",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": [
    "Statistica",
    "Oracle",
    "Windows",
    "MapReduce",
    "Oozie",
    "Unix",
    "Linux",
    "Shell scripting",
    "Distributed computing",
    "Massive parallel processing",
    "OSI PI",
    "MES",
    "SAP",
    "LIMS",
    "ITSM",
    "Incident Management",
    "Problem Management",
    "Change Management",
    "Release Management",
    "Entity-relationship modeling",
    "KPI"
  ]
}

API 3 — final-role-output

{
  "chosen_role": {
    "display_name": "ITSM Engineer",
    "id": 206,
    "rationale": "Domain=ITSM \u0026 Automation; The JD is centered on ITSM service delivery, incident/problem/change/release management, and maintaining services to KPI, which best matches an ITSM Engineer despite the Statistica technical focus.",
    "role_archetype": null,
    "slug": "itsm-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "Statistica",
      "tag": "new"
    },
    {
      "skill": "Oracle",
      "tag": "new"
    },
    {
      "skill": "Windows",
      "tag": "new"
    },
    {
      "skill": "MapReduce",
      "tag": "new"
    },
    {
      "skill": "Hive",
      "tag": "in_db"
    },
    {
      "skill": "Spark",
      "tag": "in_db"
    },
    {
      "skill": "Oozie",
      "tag": "new"
    },
    {
      "skill": "Kafka",
      "tag": "in_db"
    },
    {
      "skill": "NoSQL",
      "tag": "in_db"
    },
    {
      "skill": "Unix",
      "tag": "new"
    },
    {
      "skill": "Linux",
      "tag": "new"
    },
    {
      "skill": "Shell scripting",
      "tag": "new"
    },
    {
      "skill": "Distributed computing",
      "tag": "new"
    },
    {
      "skill": "Massive parallel processing",
      "tag": "new"
    },
    {
      "skill": "OSI PI",
      "tag": "new"
    },
    {
      "skill": "MES",
      "tag": "new"
    },
    {
      "skill": "SAP",
      "tag": "new"
    },
    {
      "skill": "LIMS",
      "tag": "new"
    },
    {
      "skill": "ITSM",
      "tag": "new"
    },
    {
      "skill": "Incident Management",
      "tag": "new"
    },
    {
      "skill": "Problem Management",
      "tag": "new"
    },
    {
      "skill": "Change Management",
      "tag": "new"
    },
    {
      "skill": "Release Management",
      "tag": "new"
    },
    {
      "skill": "Entity-relationship modeling",
      "tag": "new"
    },
    {
      "skill": "KPI",
      "tag": "new"
    }
  ],
  "llm_cost_api1_usd": null,
  "llm_cost_api2_usd": null,
  "llm_cost_api3_usd": null,
  "llm_cost_total_usd": null,
  "persistence": {
    "items": [
      {
        "chosen_role_id": 206,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Local Persistence and Offline Behavior",
          "id": 85,
          "rationale": "On-device storage used for caching, offline support, and durable client state. This cluster is coherent because iOS apps often need to preserve user progress and data when connectivity is limited.",
          "slug": "local-persistence-and-offline-behavior",
          "source": "db"
        },
        "dimension_id": 85,
        "input_skill": "Hive",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Android Developer",
            "id": 4,
            "rationale": null,
            "role_archetype": null,
            "slug": "android-engineer",
            "source": "db"
          },
          {
            "display_name": "Flutter Developer",
            "id": 74,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "flutter-developer",
            "source": "db"
          },
          {
            "display_name": "Hybrid Mobile Developer",
            "id": 11,
            "rationale": null,
            "role_archetype": null,
            "slug": "hybrid-mobile-developer",
            "source": "db"
          },
          {
            "display_name": "Native Mobile Developer",
            "id": 75,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "native-mobile-developer",
            "source": "db"
          },
          {
            "display_name": "React Native Developer",
            "id": 73,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "react-native-developer",
            "source": "db"
          },
          {
            "display_name": "iOS Developer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "ios-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2754,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 206,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "ETL and ELT Tooling",
          "id": 24,
          "rationale": "Packaged tools for extracting, loading, and transforming data across systems. This dimension covers connector-based ingestion, transformation frameworks, and managed integration products.",
          "slug": "etl-and-elt-tooling",
          "source": "db"
        },
        "dimension_id": 24,
        "input_skill": "Spark",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 2,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 1350,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 206,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Asynchronous Messaging and Event Streaming",
          "id": 297,
          "rationale": "Asynchronous communication patterns and broker technologies used to decouple backend services and move work off the request path. Includes queues, pub/sub, event streams, consumer groups, dead-letter queues, and delivery semantics across systems such as Kafka, RabbitMQ, NATS, SQS/SNS, Pulsar, and ActiveMQ.",
          "slug": "asynchronous-messaging-and-event-streaming",
          "source": "db"
        },
        "dimension_id": 297,
        "input_skill": "Kafka",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": ".NET Backend Developer",
            "id": 83,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "dotnet-backend-developer",
            "source": "db"
          },
          {
            "display_name": "Go Backend Developer",
            "id": 81,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "go-backend-developer",
            "source": "db"
          },
          {
            "display_name": "Kotlin Backend Developer",
            "id": 84,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "kotlin-server-backend-developer",
            "source": "db"
          },
          {
            "display_name": "Node.js Backend Developer",
            "id": 82,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "node-backend-developer",
            "source": "db"
          },
          {
            "display_name": "Scala Backend Developer",
            "id": 87,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "scala-backend-developer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 36,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 206,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Messaging and Background Jobs",
          "id": 291,
          "rationale": "Asynchronous processing patterns and worker systems used to decouple backend work from request handling. This is a coherent cluster because the role supports background jobs, retries, and deferred processing.",
          "slug": "messaging-and-background-jobs",
          "source": "db"
        },
        "dimension_id": 291,
        "input_skill": "Kafka",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "PHP Backend Developer",
            "id": 86,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "php-backend-developer",
            "source": "db"
          },
          {
            "display_name": "Python Backend Developer",
            "id": 80,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "python-backend-developer",
            "source": "db"
          },
          {
            "display_name": "Ruby Backend Developer",
            "id": 85,
            "rationale": null,
            "role_archetype": "Engineering",
            "slug": "ruby-backend-developer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 36,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 206,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Messaging and Event Streaming",
          "id": 8,
          "rationale": "Transport-layer systems used to move events and decouple producers from consumers. Data engineers use these systems to ingest, buffer, and distribute event data before downstream processing.",
          "slug": "messaging-and-event-streaming",
          "source": "db"
        },
        "dimension_id": 8,
        "input_skill": "Kafka",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Developer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
            "slug": "backend-engineer",
            "source": "db"
          },
          {
            "display_name": "Data Engineer",
            "id": 2,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 36,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 206,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "NoSQL Databases",
          "id": 19,
          "rationale": "Models and manages data using non-relational database systems.",
          "slug": "nosql-databases",
          "source": "db"
        },
        "dimension_id": 19,
        "input_skill": "NoSQL",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Developer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
            "slug": "backend-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 1346,
        "skill_tag": "in_db",
        "skipped_reason": null
      }
    ],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "fbe0218e-4418-46a3-be37-7d2bfc5e421a"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…