← Back to history

Pipeline run

325b3712-22a1-4837-9bfe-bdc7cb1fbce6

Pipeline LLM cost (USD)

API 1: $0.0003 API 2: $0.0474 API 3: $0.0022 Total: $0.0499

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

role baseline loaded sources · ai_index: jd · nature_of_work: no_kras · tech_stack_maturity: jd

Nature of work no kras

Vague JD — no KRAs present to derive a specific nature of work.

Tech stack maturity

Mainstream Modern

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

0.20 / 5

· Title match

✓ Has AI skill

· AI skill (primary)

· AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): —

Models / concepts (×3): RAG, LLMs, MLOps, NLP, AI, ML, AI/ML, Generative AI, Machine Learning, Deep Learning

Evidence — skills matched in JD (15)

Python TensorFlow PyTorch Scikit-learn REST Docker Kubernetes SQL NoSQL AWS GCP Git CI/CD MLflow Kubeflow

Skill cluster (7 dimension groups, role-scoped)

Cloud Provider Platforms

AWS GCP

Container Orchestration Platforms

Kubernetes

Containerization and Image Builds

Docker

Integration Protocols & Standards

REST

ML Frameworks and Libraries

PyTorch

Python Programming

Python

Cross-cutting / unaligned

TensorFlow Scikit-learn SQL NoSQL Git CI/CD MLflow Kubeflow

Status: completed Created: 2026-05-12T11:24:47.900829Z Updated: 2026-05-12T11:26:39.550703Z API 3 duration: 5608 ms

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

Data Scientist

slug: data-scientist · id: 7 · source: db

Data Scientist is the most fitting role as it emphasizes the primary skill of Python and encompasses advanced analytical capabilities.

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

New skills

Skill↔dim saved

Role↔dim saved

Skipped

Job description

Job Description – Machine Learning Engineer

Experience: 2–5 Years
Location: Bangalore / Hybrid
Employment Type: Full-Time

About the Role

We are seeking a Machine Learning Engineer to build and deploy scalable AI/ML solutions for real-world applications. You will work closely with data scientists, backend engineers, and product teams to develop intelligent systems and production-grade ML pipelines.

Key Responsibilities
Design, train, and optimize machine learning models
Build scalable ML pipelines for data processing and inference
Deploy ML models using cloud and container technologies
Work with large structured and unstructured datasets
Improve model accuracy, latency, and reliability
Integrate AI services into backend applications and APIs
Monitor model performance and retrain models when required
Collaborate with cross-functional teams on AI features
Required Skills
Strong knowledge of Python and machine learning libraries
Experience with TensorFlow, PyTorch, or Scikit-learn
Understanding of deep learning and NLP concepts
Experience with REST APIs and backend integration
Familiarity with Docker and Kubernetes
Knowledge of SQL/NoSQL databases
Understanding of cloud platforms such as AWS or GCP
Experience with Git and CI/CD workflows
Preferred Qualifications
Experience with LLMs or Generative AI
Knowledge of MLOps tools like MLflow or Kubeflow
Familiarity with vector databases and RAG pipelines
Exposure to distributed systems and big data tools
Benefits
Flexible hybrid work model
AI research and learning budget
Health and wellness benefits
Fast-paced product engineering environment
Opportunity to work on cutting-edge AI systems

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

Python Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Python id=393 · python

Aliases — catalog

Cobalt Strike (CANONICAL) primary

Context tags (catalog)

Malleable C2 beacon credential dumping kerberos lateral movement payload phishing post-exploitation privilege escalation psexec red team sleep mask smb stager team server

Stored enrichment (catalog DB)

Category: Tool
Sub-category: Adversary Simulation Tool
Vendor: Fortra
License: proprietary
Year introduced: 2012
Confidence: 0.98
Version strategy: NOT_APPLICABLE

Maturity reasoning: Appears in a limited set of red-team/pentest JDs and security vendor training, but far below mainstream devops tools; market signal is specialized adversary-simulation usage rather than broad hiring demand.

Skill profile (library / DB)

Skill nature: LANGUAGE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 5
Sub-category id: 54
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Analytical Programming Languages Catalog dimension db id 82

Library dimension (catalog)

Roles linked in library: Data Analyst, Data Scientist
Automation Scripting and CLI Catalog dimension db id 48

Library dimension (catalog)

Roles linked in library: Azure Cloud Engineer, Cloud Engineer
Automation and Scripting for Operations Catalog dimension db id 361

Library dimension (catalog)

Roles linked in library: Virtualization Engineer
Network Automation and Scripting Catalog dimension db id 285

Library dimension (catalog)

Roles linked in library: Network Engineer
Programming Languages for AI Workflows Catalog dimension db id 261

Library dimension (catalog)

Roles linked in library: AI Engineer
Programming Languages for Backend Systems Catalog dimension db id 140

Library dimension (catalog)

Roles linked in library: Backend Engineer
Programming Languages for Data Work Catalog dimension db id 67

Library dimension (catalog)

Roles linked in library: Data Engineer
Programming Languages for ML Systems Catalog dimension db id 113

Library dimension (catalog)

Roles linked in library: Machine Learning Engineer
Programming Languages for Security Work Catalog dimension db id 328

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer
Programming Languages for Test Automation Catalog dimension db id 193

Library dimension (catalog)

Roles linked in library: Automation Tester
Security Automation and Scripting Catalog dimension db id 258

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Analytical Programming Languages analytical-programming-languages	✓	✓	Existing dimension (library) · Role↔dimension saved
Automation Scripting and CLI automation-scripting-and-cli	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Automation and Scripting for Operations automation-and-scripting-for-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Network Automation and Scripting network-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for AI Workflows programming-languages-for-ai-workflows	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Backend Systems programming-languages-for-backend-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Data Work programming-languages-for-data-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for ML Systems programming-languages-for-ml-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Security Work programming-languages-for-security-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Test Automation programming-languages-for-test-automation	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Security Automation and Scripting security-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

TensorFlow Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: TensorFlow id=558 · tensorflow

Aliases — catalog

shader graphs (CANONICAL) primary

Context tags (catalog)

GLSL HLSL PBR UV mapping fragment shader material editor node graph node-based normal map procedural texturing render pipeline shader compiler tessellation vertex shader visual scripting

Stored enrichment (catalog DB)

Category: Framework
Sub-category: Visual Shader Authoring Framework
Vendor: Unity Technologies
License: proprietary
Year introduced: 2018
Confidence: 0.74
Version strategy: NOT_APPLICABLE

Maturity reasoning: Shader graphs appear in some Unity/Unreal and VFX job postings, but JD volume is far below core graphics skills like HLSL/GLSL; market use is concentrated in game/real-time rendering teams.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 456
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	✓	Existing dimension (library) · Role↔dimension saved

PyTorch Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: PyTorch id=557 · pytorch

Aliases — catalog

GLSL (CANONICAL) primary

Context tags (catalog)

GPU HLSL OpenGL SPIR-V Vulkan WebGL compute shader fragment shader rendering pipeline shader shader pipeline texture sampling uniform varying vertex shader

Stored enrichment (catalog DB)

Category: Language
Sub-category: Shader Language
Vendor: Khronos Group
License: other_open
Year introduced: 2004
Confidence: 0.99
Version strategy: NOT_APPLICABLE

Maturity reasoning: GLSL appears in graphics/game-engine JDs but at much lower volume than mainstream languages; it’s specialized for shader programming and often replaced in newer pipelines by HLSL/Metal Shading Language or higher-level abstractions.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 456
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	✓	Existing dimension (library) · Role↔dimension saved

Scikit-learn Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: scikit-learn id=554 · scikit-learn

Aliases — catalog

post-processing (CANONICAL) primary

Context tags (catalog)

GPU anti-aliasing bloom color grading compositing depth of field fragment shader framebuffer image filtering motion blur render pipeline render target screen-space shader tone mapping

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Graphics Effect Concept
Confidence: 0.86
Version strategy: NOT_APPLICABLE

Maturity reasoning: Job postings rarely list "post-processing" as a standalone skill; it appears mainly in graphics/VFX roles, while broader JDs usually specify tools like Unreal/Unity or Photoshop instead.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 458
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	✓	Existing dimension (library) · Role↔dimension saved

REST Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: REST id=121 · rest

Aliases — catalog

DStreams (VERSION)
Spark 2.x (VERSION)
Spark 3.x (VERSION)
Spark Streaming (VERSION)
Spark Structured Streaming (VERSION)
Structured Streaming (VERSION)

Context tags (catalog)

DStreams Kafka Kinesis Structured Streaming backpressure checkpointing event time exactly-once micro-batch stateful processing streaming ETL trigger intervals watermarking window functions windowing

Stored enrichment (catalog DB)

Category: Framework
Sub-category: Stream Processing Framework
Vendor: Apache Software Foundation
License: apache_2
Year introduced: 2013
Confidence: 0.90
Version strategy: SEPARATE_ENTITY
Version tag: Structured Streaming (Spark 2.0+)

Maturity reasoning: JD volume is far lower than Structured Streaming; most Spark streaming roles now specify Structured Streaming or Kafka/Flink, and Spark docs position Spark Streaming as the older API.

Skill profile (library / DB)

Skill nature: PROTOCOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 8
Sub-category id: 67
Extractable: True
Also category: False

Dimensions (API 2 worklist)

API Integration and Data Fetching Catalog dimension db id 9

Library dimension (catalog)

Roles linked in library: Frontend Engineer, Full Stack Developer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
API Integration and Data Fetching api-integration-and-data-fetching	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Docker Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Docker id=153 · docker

Aliases — catalog

Metabase (CANONICAL) primary

Context tags (catalog)

BigQuery MySQL PostgreSQL Redshift SQL ad hoc analysis cards collections dashboards data visualization embedded analytics filters questions segments self-service BI

Stored enrichment (catalog DB)

Category: Tool
Sub-category: Bi Analytics Tool
Vendor: Metabase, Inc.
License: apache_2
Year introduced: 2014
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Metabase appears in many BI/analytics job postings and is growing in GitHub usage, but it is still far less universal than Tableau/Power BI in enterprise JDs.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 11
Sub-category id: 170
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Containerization and Image Delivery Catalog dimension db id 24

Library dimension (catalog)

Roles linked in library: DevOps Engineer
Model Serving Deployment and Runtime Packaging Catalog dimension db id 52

Library dimension (catalog)

Roles linked in library: MLOps Engineer, Machine Learning Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Containerization and Image Delivery containerization-and-image-delivery	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Kubernetes Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Kubernetes id=158 · kubernetes

Aliases — catalog

Column-level security (CANONICAL) primary

Context tags (catalog)

ABAC PII access policies attribute-based access control audit logging data governance data masking database permissions dynamic masking fine-grained access control least privilege policy enforcement row-level security sensitive data static masking

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Access Control Concept
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Appears in cloud/data platform JDs and vendor docs for Snowflake, BigQuery, and PostgreSQL RLS/column masking, but is not yet a universal hiring staple like core IAM or RBAC.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 1524
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Orchestration Platforms Catalog dimension db id 25

Library dimension (catalog)

Roles linked in library: Cloud Engineer, DevOps Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Orchestration Platforms orchestration-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

SQL Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: SQL id=2601 · sql

Aliases — from this run (catalog unavailable)

SQL (CANONICAL)

Skill profile (library / DB)

Skill nature: LANGUAGE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 5
Sub-category id: 55
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Relational Data Modeling Catalog dimension db id 71

Library dimension (catalog)

Roles linked in library: Backend Engineer, Data Engineer
Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Relational Data Modeling relational-data-modeling	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

NoSQL Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

Maturity well_known confidence 0.93

Broadly listed across job postings for MongoDB, DynamoDB, Cassandra, and Redis; cloud vendors and hiring pipelines treat NoSQL as a standard database category rather than a niche skill.

Vendor & license

·since 1998 (0.93)

Context keywords

MongoDB Cassandra DynamoDB Redis document store key-value store column-family sharding replication CAP theorem eventual consistency schema-less aggregation pipeline TTL index MapReduce

Ambiguity low

NoSQL is a well-established database paradigm and is usually mentioned in a clear data-storage context. In typical JDs it is unlikely to be mistaken for a different catalog skill.

Versioning

Not versioned

Type assignment

Concept ·non_relational_database_concept confidence 0.90

NoSQL is fundamentally a knowledge unit describing non-relational database approaches, so by the Concept vs Methodology rule it is a Concept rather than a Datastore or Format.

Derived legacy fields

Category: Concept
Sub-category: non_relational_database_concept
Skill nature: CONCEPT
Volatility: STABLE
Typical lifespan: EVERGREEN
Version strategy: NOT_APPLICABLE

Dimensions (API 2 worklist)

Data Access and Query Optimization Catalog dimension db id 74

Library dimension (catalog)

Roles linked in library: Data Engineer
Data Access and Query Optimization Catalog dimension db id 74

Library dimension (catalog)

Roles linked in library: Data Engineer

Locked dimensions (v3 placement)

Data Access and Query Optimization
Reuses catalog slug

Covers choosing and tuning data access patterns for fast, reliable reads and writes across database systems. NoSQL belongs here because it is fundamentally about non-relational storage models, indexing, and query behavior used to retrieve data efficiently.
Data Access and Query Optimization
Reuses catalog slug

Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Data Access and Query Optimization data-access-and-query-optimization	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

AWS Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: AWS id=163 · aws

Aliases — catalog

Compaction (CANONICAL) primary

Context tags (catalog)

Bloom filter LSM tree SSTable checkpointing defragmentation garbage collection leveling log-structured merge policy segment merge storage engine tiered compaction tombstones vacuum write amplification

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Storage Maintenance Concept
Confidence: 0.74
Version strategy: NOT_APPLICABLE

Maturity reasoning: Compaction is a standard storage-maintenance concept in widely used systems like LSM databases and Kafka; it appears in many JDs for Cassandra, RocksDB, and Kafka ops roles, indicating broad market demand.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 161
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Platform Operations Catalog dimension db id 26

Library dimension (catalog)

Roles linked in library: DevOps Engineer
Cloud Security Platforms Catalog dimension db id 332

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Platform Operations cloud-platform-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Cloud Security Platforms cloud-security-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

GCP Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: GCP id=2304 · gcp

Aliases — catalog

ASGI (CANONICAL) primary

Context tags (catalog)

ASGI app ASGI server Django Channels FastAPI HTTP/2 Starlette WebSocket application scope asyncio background tasks concurrency event loop lifespan middleware routing

Stored enrichment (catalog DB)

Category: Protocol
Sub-category: Web Application Protocol
Vendor: Django Software Foundation
License: bsd
Year introduced: 2016
Confidence: 0.95
Version strategy: NOT_APPLICABLE

Maturity reasoning: ASGI appears in many Python web JDs for async frameworks like FastAPI/Starlette, but WSGI remains the broader default in legacy stacks; market signal shows growing adoption rather than universal demand.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 161
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Security Platforms Catalog dimension db id 332

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Security Platforms cloud-security-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Git Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Git id=2578 · git

Aliases — from this run (catalog unavailable)

Git (CANONICAL)

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 11
Sub-category id: 2101
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

CI/CD Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: CI/CD id=2579 · ci-cd

Aliases — from this run (catalog unavailable)

CI/CD (CANONICAL)

Skill profile (library / DB)

Skill nature: METHODOLOGY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 7
Sub-category id: 2102
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

MLflow Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

Maturity emerging confidence 0.86

MLflow appears in many MLOps job descriptions and is widely used in model tracking/registry, but it is not yet as universal as core cloud or data-stack tools.

Vendor & license

Databricks ·apache_2 ·since 2018 (0.98)

Context keywords

experiment tracking model registry artifact store MLproject MLflow Tracking MLflow Projects MLflow Models pyfunc model serving run metadata parameters metrics artifacts Databricks MLOps

Ambiguity low

MLflow is a specific MLOps tool with a distinctive name; in typical JDs it is unlikely to be mistaken for another catalog skill.

Versioning

Not versioned

Type assignment

Tool ·mlops_tool confidence 0.93

MLflow is software you run to manage experiments, models, and deployments, so by the Tool vs Framework rule it is a Tool rather than a framework or platform.

Derived legacy fields

Category: Tool
Sub-category: mlops_tool
Skill nature: TOOL
Volatility: EMERGING
Typical lifespan: EVERGREEN
Version strategy: NOT_APPLICABLE

Dimensions (API 2 worklist)

Model Serving Deployment, Packaging, and Runtime Operations Proposed / LLM

Proposed / LLM dimension (no DB id yet)
Version Control Systems Catalog dimension db id 365

Library dimension (catalog)
Project Delivery and Coordination Catalog dimension db id 366

Library dimension (catalog)

Locked dimensions (v3 placement)

Model Serving Deployment, Packaging, and Runtime Operations
Pipeline tentative id

Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, creating release artifacts, coordinating rollout, and handing off to inference systems. Covers model registry/versioning workflows, MLflow, serving frameworks such as TensorFlow Serving, TorchServe, Triton, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled contain
Machine Learning Experiment Tracking
Pipeline tentative id

Tools and practices for logging runs, parameters, metrics, artifacts, and model lineage during ML development. MLflow fits here because its core value is reproducible experiment tracking and comparison across training runs.
Machine Learning Model Registry
Pipeline tentative id

Systems and practices for registering, versioning, approving, and promoting machine learning models across environments. MLflow belongs here because its registry capabilities manage model lifecycle, stage transitions, and governance.

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Model Serving Deployment, Packaging, and Runtime Operations d_merge_01	✓	—	New skill saved · Existing dimension (reconciliation merge) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems d_init_01	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Project Delivery and Coordination d_init_02	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Kubeflow Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

Maturity niche confidence 0.86

Kubeflow appears in some ML platform and MLOps JDs, but far less often than Kubernetes/AWS; GitHub activity is steady yet the market signal is specialized adoption rather than broad hiring demand.

Vendor & license

Kubeflow ·apache_2 ·since 2018 (0.98)

Context keywords

Kubernetes Argo Workflows Tekton ML pipelines notebooks training jobs model serving KFServing KServe TensorFlow PyTorch hyperparameter tuning pipeline components container images GPU nodes

Ambiguity low

Kubeflow is a specific ML workflow framework with a distinctive name; in typical JDs it is unlikely to be confused with another catalog skill.

Versioning

Versioned 2.x

{
  "Kubeflow 1.x": "1.x",
  "Kubeflow 2.x": "2.x",
  "Kubeflow v1": "1.x",
  "Kubeflow v2": "2.x"
}

Type assignment

Framework ·ml_workflow_framework confidence 0.90

Kubeflow is best classified as a Framework because users build machine-learning workflows and applications on top of it rather than merely operating it as standalone software.

Derived legacy fields

Category: Framework
Sub-category: ml_workflow_framework
Skill nature: FRAMEWORK
Volatility: STABLE
Typical lifespan: EVERGREEN
Version strategy: SEPARATE_ENTITY

Dimensions (API 2 worklist)

Model Serving Deployment and Runtime Packaging Catalog dimension db id 52

Library dimension (catalog)

Roles linked in library: MLOps Engineer, Machine Learning Engineer
Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

Locked dimensions (v3 placement)

Model Serving Deployment and Runtime Packaging
Reuses catalog slug

Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers and wiring them into production runtimes. Kubeflow belongs here because it is commonly used to build and run ML pipelines that culminate in deployable model artifacts and serving workflows.
Machine Learning Pipeline Orchestration
Pipeline tentative id

Building and operating end-to-end ML workflows that coordinate data prep, training, evaluation, and deployment as repeatable pipelines. Kubeflow fits here because it is a dedicated platform for composing and running ML pipelines on Kubernetes.

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems d_init_01	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

All API 3 persistence rows

Same grid as the skill-extractor “Persistence items” table: one row per (skill × dimension) work item.

Skill	Tag	Dimension	Skill↔dim	Role↔dim	Outcome
Python	in_db	Analytical Programming Languages analytical-programming-languages	✓	✓	Existing dimension (library) · Role↔dimension saved
Python	in_db	Automation Scripting and CLI automation-scripting-and-cli	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Automation and Scripting for Operations automation-and-scripting-for-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Network Automation and Scripting network-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for AI Workflows programming-languages-for-ai-workflows	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Backend Systems programming-languages-for-backend-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Data Work programming-languages-for-data-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for ML Systems programming-languages-for-ml-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Security Work programming-languages-for-security-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Test Automation programming-languages-for-test-automation	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Security Automation and Scripting security-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
TensorFlow	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	✓	Existing dimension (library) · Role↔dimension saved
PyTorch	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	✓	Existing dimension (library) · Role↔dimension saved
Scikit-learn	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	✓	Existing dimension (library) · Role↔dimension saved
REST	in_db	API Integration and Data Fetching api-integration-and-data-fetching	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker	in_db	Containerization and Image Delivery containerization-and-image-delivery	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker	in_db	Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kubernetes	in_db	Orchestration Platforms orchestration-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
SQL	in_db	Relational Data Modeling relational-data-modeling	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
SQL	in_db	Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS	in_db	Cloud Platform Operations cloud-platform-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS	in_db	Cloud Security Platforms cloud-security-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
GCP	in_db	Cloud Security Platforms cloud-security-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Git	in_db	Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
CI/CD	in_db	Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
NoSQL	in_db	Data Access and Query Optimization data-access-and-query-optimization	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
MLflow	in_db	Model Serving Deployment, Packaging, and Runtime Operations d_merge_01	✓	—	New skill saved · Existing dimension (reconciliation merge) · Role↔dimension skipped (dimension not under chosen role)
MLflow	in_db	Version Control Systems d_init_01	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
MLflow	in_db	Project Delivery and Coordination d_init_02	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kubeflow	in_db	Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kubeflow	in_db	Version Control Systems d_init_01	✓	—	New skill saved · Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Library artifacts (this run)

Kind	Detail	DB id
canonical_skill_added	NoSQL	2639
canonical_skill_added	MLflow	2640
canonical_skill_added	Kubeflow	2641
library_enrichment_backfilled	Git	2578
dimension_skill_link	NoSQL ↔ Data Access and Query Optimization	74
dimension_skill_link	MLflow ↔ Model Serving Deployment, Packaging, and Runtime Operations	52
dimension_skill_link	MLflow ↔ Version Control Systems	365
dimension_skill_link	MLflow ↔ Project Delivery and Coordination	366
dimension_skill_link	Kubeflow ↔ Model Serving Deployment and Runtime Packaging	52
dimension_skill_link	Kubeflow ↔ Version Control Systems	365

API 1 — extract-from-jd click to toggle

{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "Python"
    },
    {
      "is_primary": false,
      "skill_name": "TensorFlow"
    },
    {
      "is_primary": false,
      "skill_name": "PyTorch"
    },
    {
      "is_primary": false,
      "skill_name": "Scikit-learn"
    },
    {
      "is_primary": false,
      "skill_name": "REST"
    },
    {
      "is_primary": false,
      "skill_name": "Docker"
    },
    {
      "is_primary": false,
      "skill_name": "Kubernetes"
    },
    {
      "is_primary": false,
      "skill_name": "SQL"
    },
    {
      "is_primary": false,
      "skill_name": "NoSQL"
    },
    {
      "is_primary": false,
      "skill_name": "AWS"
    },
    {
      "is_primary": false,
      "skill_name": "GCP"
    },
    {
      "is_primary": false,
      "skill_name": "Git"
    },
    {
      "is_primary": false,
      "skill_name": "CI/CD"
    },
    {
      "is_primary": false,
      "skill_name": "MLflow"
    },
    {
      "is_primary": false,
      "skill_name": "Kubeflow"
    }
  ],
  "run_id": null
}

API 2 — extract-details

{
  "alias_matches": [
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 608,
      "existing_alias_text": "Python",
      "input_term": "Python",
      "matched_canonical": {
        "category_id": 5,
        "display_name": "Python",
        "id": 393,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "python",
        "sub_category_id": 54,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 862,
      "existing_alias_text": "TensorFlow",
      "input_term": "TensorFlow",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "TensorFlow",
        "id": 558,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "tensorflow",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 861,
      "existing_alias_text": "PyTorch",
      "input_term": "PyTorch",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "PyTorch",
        "id": 557,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "pytorch",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 852,
      "existing_alias_text": "scikit-learn",
      "input_term": "Scikit-learn",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "scikit-learn",
        "id": 554,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "scikit-learn",
        "sub_category_id": 458,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 266,
      "existing_alias_text": "REST",
      "input_term": "REST",
      "matched_canonical": {
        "category_id": 8,
        "display_name": "REST",
        "id": 121,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PROTOCOL",
        "slug": "rest",
        "sub_category_id": 67,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 299,
      "existing_alias_text": "Docker",
      "input_term": "Docker",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Docker",
        "id": 153,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "docker",
        "sub_category_id": 170,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 304,
      "existing_alias_text": "Kubernetes",
      "input_term": "Kubernetes",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Kubernetes",
        "id": 158,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "kubernetes",
        "sub_category_id": 1524,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3398,
      "existing_alias_text": "SQL",
      "input_term": "SQL",
      "matched_canonical": {
        "category_id": 5,
        "display_name": "SQL",
        "id": 2601,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "sql",
        "sub_category_id": 55,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 348,
      "existing_alias_text": "AWS",
      "input_term": "AWS",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "AWS",
        "id": 163,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "aws",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3043,
      "existing_alias_text": "GCP",
      "input_term": "GCP",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "GCP",
        "id": 2304,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "gcp",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3375,
      "existing_alias_text": "Git",
      "input_term": "Git",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Git",
        "id": 2578,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "git",
        "sub_category_id": 2101,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3376,
      "existing_alias_text": "CI/CD",
      "input_term": "CI/CD",
      "matched_canonical": {
        "category_id": 7,
        "display_name": "CI/CD",
        "id": 2579,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "METHODOLOGY",
        "slug": "ci-cd",
        "sub_category_id": 2102,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    }
  ],
  "candidate_roles": [
    {
      "display_name": "Data Analyst",
      "id": 20,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-analyst",
      "source": "db"
    },
    {
      "display_name": "Data Scientist",
      "id": 7,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-scientist",
      "source": "db"
    },
    {
      "display_name": "Azure Cloud Engineer",
      "id": 4,
      "rationale": null,
      "role_archetype": null,
      "slug": "azure-cloud-engineer",
      "source": "db"
    },
    {
      "display_name": "Cloud Engineer",
      "id": 18,
      "rationale": null,
      "role_archetype": null,
      "slug": "cloud-engineer",
      "source": "db"
    },
    {
      "display_name": "Virtualization Engineer",
      "id": 26,
      "rationale": null,
      "role_archetype": null,
      "slug": "virtualization-engineer",
      "source": "db"
    },
    {
      "display_name": "Network Engineer",
      "id": 21,
      "rationale": null,
      "role_archetype": null,
      "slug": "network-engineer",
      "source": "db"
    },
    {
      "display_name": "AI Engineer",
      "id": 12,
      "rationale": null,
      "role_archetype": null,
      "slug": "ai-engineer",
      "source": "db"
    },
    {
      "display_name": "Backend Engineer",
      "id": 14,
      "rationale": null,
      "role_archetype": null,
      "slug": "backend-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Engineer",
      "id": 6,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-engineer",
      "source": "db"
    },
    {
      "display_name": "Machine Learning Engineer",
      "id": 10,
      "rationale": null,
      "role_archetype": null,
      "slug": "machine-learning-engineer",
      "source": "db"
    },
    {
      "display_name": "Cybersecurity Engineer",
      "id": 9,
      "rationale": null,
      "role_archetype": null,
      "slug": "cybersecurity-engineer",
      "source": "db"
    },
    {
      "display_name": "Automation Tester",
      "id": 16,
      "rationale": null,
      "role_archetype": null,
      "slug": "automation-tester",
      "source": "db"
    },
    {
      "display_name": "Frontend Engineer",
      "id": 3,
      "rationale": null,
      "role_archetype": "Frontend Engineers design and build the user-facing parts of applications, translating product and design requirements into interactive experiences. They focus on how the application looks, behaves, and responds in the browser, ensuring usability, accessibility, and consistency across the interface.",
      "slug": "frontend-engineer",
      "source": "db"
    },
    {
      "display_name": "Full Stack Developer",
      "id": 2,
      "rationale": null,
      "role_archetype": null,
      "slug": "full-stack-developer",
      "source": "db"
    },
    {
      "display_name": "DevOps Engineer",
      "id": 1,
      "rationale": null,
      "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
      "slug": "devops-engineer",
      "source": "db"
    },
    {
      "display_name": "MLOps Engineer",
      "id": 5,
      "rationale": null,
      "role_archetype": null,
      "slug": "mlops-engineer",
      "source": "db"
    }
  ],
  "chosen_role": {
    "display_name": "Data Scientist",
    "id": 7,
    "rationale": "Data Scientist is the most fitting role as it emphasizes the primary skill of Python and encompasses advanced analytical capabilities.",
    "role_archetype": null,
    "slug": "data-scientist",
    "source": "db"
  },
  "dimensions": [
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Analytical Programming Languages",
        "id": 82,
        "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
        "slug": "analytical-programming-languages",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Analyst",
          "id": 20,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-analyst",
          "source": "db"
        },
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Automation Scripting and CLI",
        "id": 48,
        "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
        "slug": "automation-scripting-and-cli",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Azure Cloud Engineer",
          "id": 4,
          "rationale": null,
          "role_archetype": null,
          "slug": "azure-cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Automation and Scripting for Operations",
        "id": 361,
        "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
        "slug": "automation-and-scripting-for-operations",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Virtualization Engineer",
          "id": 26,
          "rationale": null,
          "role_archetype": null,
          "slug": "virtualization-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Network Automation and Scripting",
        "id": 285,
        "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
        "slug": "network-automation-and-scripting",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Network Engineer",
          "id": 21,
          "rationale": null,
          "role_archetype": null,
          "slug": "network-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for AI Workflows",
        "id": 261,
        "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
        "slug": "programming-languages-for-ai-workflows",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Backend Systems",
        "id": 140,
        "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
        "slug": "programming-languages-for-backend-systems",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Engineer",
          "id": 14,
          "rationale": null,
          "role_archetype": null,
          "slug": "backend-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Data Work",
        "id": 67,
        "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
        "slug": "programming-languages-for-data-work",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for ML Systems",
        "id": 113,
        "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
        "slug": "programming-languages-for-ml-systems",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Security Work",
        "id": 328,
        "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
        "slug": "programming-languages-for-security-work",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Test Automation",
        "id": 193,
        "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
        "slug": "programming-languages-for-test-automation",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Automation Tester",
          "id": 16,
          "rationale": null,
          "role_archetype": null,
          "slug": "automation-tester",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Security Automation and Scripting",
        "id": 258,
        "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
        "slug": "security-automation-and-scripting",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "TensorFlow",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "PyTorch",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "Scikit-learn",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "API Integration and Data Fetching",
        "id": 9,
        "rationale": "Connecting frontend applications to backend services and third-party endpoints. This covers request orchestration, error handling, pagination, and shaping remote data for UI consumption.",
        "slug": "api-integration-and-data-fetching",
        "source": "db"
      },
      "input_skill": "REST",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Frontend Engineer",
          "id": 3,
          "rationale": null,
          "role_archetype": "Frontend Engineers design and build the user-facing parts of applications, translating product and design requirements into interactive experiences. They focus on how the application looks, behaves, and responds in the browser, ensuring usability, accessibility, and consistency across the interface.",
          "slug": "frontend-engineer",
          "source": "db"
        },
        {
          "display_name": "Full Stack Developer",
          "id": 2,
          "rationale": null,
          "role_archetype": null,
          "slug": "full-stack-developer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Containerization and Image Delivery",
        "id": 24,
        "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
        "slug": "containerization-and-image-delivery",
        "source": "db"
      },
      "input_skill": "Docker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Model Serving Deployment and Runtime Packaging",
        "id": 52,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
        "slug": "model-serving-deployment-and-runtime-packaging",
        "source": "db"
      },
      "input_skill": "Docker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        },
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Orchestration Platforms",
        "id": 25,
        "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
        "slug": "orchestration-platforms",
        "source": "db"
      },
      "input_skill": "Kubernetes",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Relational Data Modeling",
        "id": 71,
        "rationale": "Designing tables, relationships, constraints, and transactional data shapes for operational backend systems. This cluster is coherent because backend services frequently own the canonical application data model.",
        "slug": "relational-data-modeling",
        "source": "db"
      },
      "input_skill": "SQL",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Engineer",
          "id": 14,
          "rationale": null,
          "role_archetype": null,
          "slug": "backend-engineer",
          "source": "db"
        },
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "SQL",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Platform Operations",
        "id": 26,
        "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
        "slug": "cloud-platform-operations",
        "source": "db"
      },
      "input_skill": "AWS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Security Platforms",
        "id": 332,
        "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
        "slug": "cloud-security-platforms",
        "source": "db"
      },
      "input_skill": "AWS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Security Platforms",
        "id": 332,
        "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
        "slug": "cloud-security-platforms",
        "source": "db"
      },
      "input_skill": "GCP",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "Git",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "CI/CD",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Data Access and Query Optimization",
        "id": 74,
        "rationale": "Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.",
        "slug": "data-access-and-query-optimization",
        "source": "db"
      },
      "input_skill": "NoSQL",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Data Access and Query Optimization",
        "id": 74,
        "rationale": "Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.",
        "slug": "data-access-and-query-optimization",
        "source": "db"
      },
      "input_skill": "NoSQL",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": null,
        "display_name": "Model Serving Deployment, Packaging, and Runtime Operations",
        "id": null,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, creating release artifacts, coordinating rollout, and handing off to inference systems. Covers model registry/versioning workflows, MLflow, serving frameworks such as TensorFlow Serving, TorchServe, Triton, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base images, entrypoints, dependencies, and image scanning.",
        "slug": "d_merge_01",
        "source": "llm"
      },
      "input_skill": "MLflow",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "MLflow",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Project Delivery and Coordination",
        "id": 366,
        "rationale": "Coordination practices for organizing work, tracking progress, and aligning stakeholders across a delivery effort. Agile fits here when used as a team execution framework for managing scope, cadence, and collaboration.",
        "slug": "d_init_02",
        "source": "db"
      },
      "input_skill": "MLflow",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Model Serving Deployment and Runtime Packaging",
        "id": 52,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
        "slug": "model-serving-deployment-and-runtime-packaging",
        "source": "db"
      },
      "input_skill": "Kubeflow",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        },
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "Kubeflow",
      "llm_role": null,
      "roles_from_db": []
    }
  ],
  "input_final_skills": [
    "Python",
    "TensorFlow",
    "PyTorch",
    "Scikit-learn",
    "REST",
    "Docker",
    "Kubernetes",
    "SQL",
    "NoSQL",
    "AWS",
    "GCP",
    "Git",
    "CI/CD",
    "MLflow",
    "Kubeflow"
  ],
  "input_llm_skills": [
    "Python",
    "TensorFlow",
    "PyTorch",
    "Scikit-learn",
    "REST",
    "Docker",
    "Kubernetes",
    "SQL",
    "NoSQL",
    "AWS",
    "GCP",
    "Git",
    "CI/CD",
    "MLflow",
    "Kubeflow"
  ],
  "new_aliases_persisted": 0,
  "run_id": "325b3712-22a1-4837-9bfe-bdc7cb1fbce6",
  "skills_detail": [
    {
      "aliases_in_db": [
        {
          "alias_text": "Python",
          "alias_type": "CANONICAL",
          "id": 608,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 2",
          "alias_type": "VERSION",
          "id": 611,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 2.x",
          "alias_type": "VERSION",
          "id": 613,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3",
          "alias_type": "VERSION",
          "id": 612,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.10",
          "alias_type": "VERSION",
          "id": 2330,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.11",
          "alias_type": "VERSION",
          "id": 2331,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.12",
          "alias_type": "VERSION",
          "id": 2332,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.x",
          "alias_type": "VERSION",
          "id": 614,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "py2",
          "alias_type": "VERSION",
          "id": 609,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "py3",
          "alias_type": "VERSION",
          "id": 610,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 2",
          "alias_type": "VERSION",
          "id": 2152,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 2.x",
          "alias_type": "VERSION",
          "id": 2154,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3",
          "alias_type": "VERSION",
          "id": 990,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.10",
          "alias_type": "VERSION",
          "id": 992,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.11",
          "alias_type": "VERSION",
          "id": 993,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.12",
          "alias_type": "VERSION",
          "id": 994,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.x",
          "alias_type": "VERSION",
          "id": 991,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python2",
          "alias_type": "VERSION",
          "id": 2150,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python3",
          "alias_type": "VERSION",
          "id": 989,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 5,
        "display_name": "Python",
        "id": 393,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "python",
        "sub_category_id": 54,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Analytical Programming Languages",
            "id": 82,
            "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
            "slug": "analytical-programming-languages",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Analyst",
              "id": 20,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-analyst",
              "source": "db"
            },
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Automation Scripting and CLI",
            "id": 48,
            "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
            "slug": "automation-scripting-and-cli",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Azure Cloud Engineer",
              "id": 4,
              "rationale": null,
              "role_archetype": null,
              "slug": "azure-cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Automation and Scripting for Operations",
            "id": 361,
            "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
            "slug": "automation-and-scripting-for-operations",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Virtualization Engineer",
              "id": 26,
              "rationale": null,
              "role_archetype": null,
              "slug": "virtualization-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Network Automation and Scripting",
            "id": 285,
            "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
            "slug": "network-automation-and-scripting",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Network Engineer",
              "id": 21,
              "rationale": null,
              "role_archetype": null,
              "slug": "network-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for AI Workflows",
            "id": 261,
            "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
            "slug": "programming-languages-for-ai-workflows",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Backend Systems",
            "id": 140,
            "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
            "slug": "programming-languages-for-backend-systems",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Engineer",
              "id": 14,
              "rationale": null,
              "role_archetype": null,
              "slug": "backend-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Data Work",
            "id": 67,
            "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
            "slug": "programming-languages-for-data-work",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for ML Systems",
            "id": 113,
            "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
            "slug": "programming-languages-for-ml-systems",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Security Work",
            "id": 328,
            "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
            "slug": "programming-languages-for-security-work",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Test Automation",
            "id": 193,
            "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
            "slug": "programming-languages-for-test-automation",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Automation Tester",
              "id": 16,
              "rationale": null,
              "role_archetype": null,
              "slug": "automation-tester",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Security Automation and Scripting",
            "id": 258,
            "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
            "slug": "security-automation-and-scripting",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Python",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "TensorFlow",
          "alias_type": "CANONICAL",
          "id": 862,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TF1",
          "alias_type": "VERSION",
          "id": 863,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TF2",
          "alias_type": "VERSION",
          "id": 864,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 1",
          "alias_type": "VERSION",
          "id": 865,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 1.x",
          "alias_type": "VERSION",
          "id": 867,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 2",
          "alias_type": "VERSION",
          "id": 866,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 2.x",
          "alias_type": "VERSION",
          "id": 868,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "TensorFlow",
        "id": 558,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "tensorflow",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "TensorFlow",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "TensorFlow",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "PyTorch",
          "alias_type": "CANONICAL",
          "id": 861,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "PyTorch",
        "id": 557,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "pytorch",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "PyTorch",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "PyTorch",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "scikit-learn",
          "alias_type": "CANONICAL",
          "id": 852,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "scikit-learn",
        "id": 554,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "scikit-learn",
        "sub_category_id": 458,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "Scikit-learn",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Scikit-learn",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "REST",
          "alias_type": "CANONICAL",
          "id": 266,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 8,
        "display_name": "REST",
        "id": 121,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PROTOCOL",
        "slug": "rest",
        "sub_category_id": 67,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "API Integration and Data Fetching",
            "id": 9,
            "rationale": "Connecting frontend applications to backend services and third-party endpoints. This covers request orchestration, error handling, pagination, and shaping remote data for UI consumption.",
            "slug": "api-integration-and-data-fetching",
            "source": "db"
          },
          "input_skill": "REST",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Frontend Engineer",
              "id": 3,
              "rationale": null,
              "role_archetype": "Frontend Engineers design and build the user-facing parts of applications, translating product and design requirements into interactive experiences. They focus on how the application looks, behaves, and responds in the browser, ensuring usability, accessibility, and consistency across the interface.",
              "slug": "frontend-engineer",
              "source": "db"
            },
            {
              "display_name": "Full Stack Developer",
              "id": 2,
              "rationale": null,
              "role_archetype": null,
              "slug": "full-stack-developer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "REST",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Docker",
          "alias_type": "CANONICAL",
          "id": 299,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Docker",
        "id": 153,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "docker",
        "sub_category_id": 170,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Containerization and Image Delivery",
            "id": 24,
            "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
            "slug": "containerization-and-image-delivery",
            "source": "db"
          },
          "input_skill": "Docker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Model Serving Deployment and Runtime Packaging",
            "id": 52,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
            "slug": "model-serving-deployment-and-runtime-packaging",
            "source": "db"
          },
          "input_skill": "Docker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            },
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Docker",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Kubernetes",
          "alias_type": "CANONICAL",
          "id": 304,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.0",
          "alias_type": "VERSION",
          "id": 307,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.0+",
          "alias_type": "VERSION",
          "id": 2366,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.1",
          "alias_type": "VERSION",
          "id": 308,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.10",
          "alias_type": "VERSION",
          "id": 318,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.11",
          "alias_type": "VERSION",
          "id": 319,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.12",
          "alias_type": "VERSION",
          "id": 320,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.13",
          "alias_type": "VERSION",
          "id": 321,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.14",
          "alias_type": "VERSION",
          "id": 322,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.15",
          "alias_type": "VERSION",
          "id": 323,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.16",
          "alias_type": "VERSION",
          "id": 324,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.17",
          "alias_type": "VERSION",
          "id": 325,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.18",
          "alias_type": "VERSION",
          "id": 326,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.19",
          "alias_type": "VERSION",
          "id": 327,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.2",
          "alias_type": "VERSION",
          "id": 309,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.20",
          "alias_type": "VERSION",
          "id": 328,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.21",
          "alias_type": "VERSION",
          "id": 329,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.22",
          "alias_type": "VERSION",
          "id": 330,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.23",
          "alias_type": "VERSION",
          "id": 331,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.24",
          "alias_type": "VERSION",
          "id": 332,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.25",
          "alias_type": "VERSION",
          "id": 333,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.26",
          "alias_type": "VERSION",
          "id": 334,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.27",
          "alias_type": "VERSION",
          "id": 335,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.28",
          "alias_type": "VERSION",
          "id": 336,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.29",
          "alias_type": "VERSION",
          "id": 337,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.3",
          "alias_type": "VERSION",
          "id": 310,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.30",
          "alias_type": "VERSION",
          "id": 338,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.4",
          "alias_type": "VERSION",
          "id": 311,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.5",
          "alias_type": "VERSION",
          "id": 312,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.6",
          "alias_type": "VERSION",
          "id": 313,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.7",
          "alias_type": "VERSION",
          "id": 314,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.8",
          "alias_type": "VERSION",
          "id": 315,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.9",
          "alias_type": "VERSION",
          "id": 316,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.x",
          "alias_type": "VERSION",
          "id": 317,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes v1",
          "alias_type": "VERSION",
          "id": 306,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "k8s",
          "alias_type": "VERSION",
          "id": 305,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Kubernetes",
        "id": 158,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "kubernetes",
        "sub_category_id": 1524,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Orchestration Platforms",
            "id": 25,
            "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
            "slug": "orchestration-platforms",
            "source": "db"
          },
          "input_skill": "Kubernetes",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Kubernetes",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "SQL",
          "alias_type": "CANONICAL",
          "id": 3398,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 5,
        "display_name": "SQL",
        "id": 2601,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "sql",
        "sub_category_id": 55,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Relational Data Modeling",
            "id": 71,
            "rationale": "Designing tables, relationships, constraints, and transactional data shapes for operational backend systems. This cluster is coherent because backend services frequently own the canonical application data model.",
            "slug": "relational-data-modeling",
            "source": "db"
          },
          "input_skill": "SQL",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Engineer",
              "id": 14,
              "rationale": null,
              "role_archetype": null,
              "slug": "backend-engineer",
              "source": "db"
            },
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "SQL",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "SQL",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Data Access and Query Optimization",
            "id": 74,
            "rationale": "Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.",
            "slug": "data-access-and-query-optimization",
            "source": "db"
          },
          "input_skill": "NoSQL",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Data Access and Query Optimization",
            "id": 74,
            "rationale": "Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.",
            "slug": "data-access-and-query-optimization",
            "source": "db"
          },
          "input_skill": "NoSQL",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "NoSQL",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Concept",
          "skill_nature": "CONCEPT",
          "sub_category": "non_relational_database_concept",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "NOT_APPLICABLE",
          "volatility": "STABLE"
        },
        "enrichment": {
          "ambiguity": {
            "ambiguity_flag": false,
            "confused_with": [],
            "reasoning": "NoSQL is a well-established database paradigm and is usually mentioned in a clear data-storage context. In typical JDs it is unlikely to be mistaken for a different catalog skill."
          },
          "context_keywords": {
            "context_keywords": [
              "MongoDB",
              "Cassandra",
              "DynamoDB",
              "Redis",
              "document store",
              "key-value store",
              "column-family",
              "sharding",
              "replication",
              "CAP theorem",
              "eventual consistency",
              "schema-less",
              "aggregation pipeline",
              "TTL index",
              "MapReduce"
            ]
          },
          "maturity": {
            "confidence": 0.93,
            "maturity": "well_known",
            "reasoning": "Broadly listed across job postings for MongoDB, DynamoDB, Cassandra, and Redis; cloud vendors and hiring pipelines treat NoSQL as a standard database category rather than a niche skill."
          },
          "skill_id": "nosql",
          "vendor_license": {
            "confidence": 0.93,
            "license": null,
            "vendor": null,
            "year_introduced": 1998
          },
          "versioning": {
            "current_version": null,
            "version_aliases": {},
            "versioned": false
          }
        },
        "keep_log": [
          {
            "a_dim_id": "data-access-and-query-optimization",
            "a_name": "Data Access and Query Optimization",
            "a_role": "__skill_focal__",
            "b_dim_id": "data-access-and-query-optimization",
            "b_name": "Data Access and Query Optimization",
            "b_role": "Data Engineer",
            "pair_kind": "cross_role",
            "reasoning": "Same name, but different clusters. Dim A is operational DB access: NoSQL/document/key-value/graph stores, MongoDB/Cassandra/DynamoDB/Redis, document modeling, sharding, index tuning, and read/write tradeoffs. Dim B is analytical query performance: partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers. The shared words are about optimization, but A is about database-system storage/query behavior while B is about analytics data layout and query speed.",
            "similarity": 0.6886723881495891
          }
        ],
        "locked_dimensions": [
          {
            "description": "Covers choosing and tuning data access patterns for fast, reliable reads and writes across database systems. NoSQL belongs here because it is fundamentally about non-relational storage models, indexing, and query behavior used to retrieve data efficiently.",
            "exemplar_skills": [
              "NoSQL",
              "MongoDB",
              "Cassandra",
              "DynamoDB",
              "Redis",
              "document modeling",
              "sharding",
              "index tuning"
            ],
            "in_scope": "NoSQL, document databases, key-value stores, column-family stores, graph database access, indexing strategies, query planning, partitioning, sharding, read/write tradeoffs",
            "name": "Data Access and Query Optimization",
            "out_of_scope": "SQL schema design, relational joins, transaction isolation in relational databases, ETL pipelines, which belong to other data platform or database-specific dimensions",
            "overlap_flags": [
              {
                "reason": "Some NoSQL systems are used as event stores or backing stores for streaming architectures, but the storage model itself is the primary concern here.",
                "with_dim_id": "messaging-and-event-streaming",
                "with_dim_name": null,
                "with_role": "Backend Engineer"
              }
            ],
            "tentative_id": "data-access-and-query-optimization"
          },
          {
            "description": "Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.",
            "exemplar_skills": [
              "Data Access and Query Optimization"
            ],
            "in_scope": "Skills, tools, and practices that belong under Data Access and Query Optimization for the target role, including items implied by the dimension rationale.",
            "name": "Data Access and Query Optimization",
            "out_of_scope": "Adjacent clusters explicitly not owned by Data Access and Query Optimization, including unrelated platforms, roles, and skill families per library policy.",
            "overlap_flags": [],
            "tentative_id": "data-access-and-query-optimization"
          }
        ],
        "merge_log": [],
        "placed": {
          "name": "NoSQL",
          "placement_confidence": 0.92,
          "primary_dimension": "data-access-and-query-optimization",
          "reasoning": "Deterministic JD placement: locked_dimensions has 2 dimension(s) from skill-driven dimension generation after reconciliation; primary_dimension is the first locked dim.",
          "secondary_dimensions": [],
          "skill_id": "nosql"
        },
        "relationships": {
          "child_skills": [],
          "parent_skills": [],
          "related_to": [
            "sql",
            "elasticsearch",
            "opensearch",
            "amazon-athena",
            "data-structures",
            "storage-layout",
            "metadata-json",
            "nfs-datastores",
            "subgraphs",
            "cloudwatch"
          ],
          "requires": [],
          "skill_id": "nosql",
          "suppress_on_match": []
        },
        "skill_id": "nosql",
        "split_log": [],
        "typed": {
          "alternatives_considered": [
            "Datastore: ruled out \u2014 NoSQL is not a specific system that persists data, but a category/concept of database technologies.",
            "Architecture: ruled out \u2014 it is broader than a system-shape pattern and refers to a database paradigm rather than an application architecture."
          ],
          "confidence": 0.9,
          "name": "NoSQL",
          "reasoning": "NoSQL is fundamentally a knowledge unit describing non-relational database approaches, so by the Concept vs Methodology rule it is a Concept rather than a Datastore or Format.",
          "skill_id": "nosql",
          "subtype": "non_relational_database_concept",
          "type": "Concept"
        },
        "warnings": [
          "stage3_post_filter_dropped_catalog_only_locked_dims:41-\u003e2"
        ]
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS",
          "alias_type": "CANONICAL",
          "id": 348,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "AWS",
        "id": 163,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "aws",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Platform Operations",
            "id": 26,
            "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
            "slug": "cloud-platform-operations",
            "source": "db"
          },
          "input_skill": "AWS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Security Platforms",
            "id": 332,
            "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
            "slug": "cloud-security-platforms",
            "source": "db"
          },
          "input_skill": "AWS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "GCP",
          "alias_type": "CANONICAL",
          "id": 3043,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "GCP",
        "id": 2304,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "gcp",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Security Platforms",
            "id": 332,
            "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
            "slug": "cloud-security-platforms",
            "source": "db"
          },
          "input_skill": "GCP",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "GCP",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Git",
          "alias_type": "CANONICAL",
          "id": 3375,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Git",
        "id": 2578,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "git",
        "sub_category_id": 2101,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "Git",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "Git",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "CI/CD",
          "alias_type": "CANONICAL",
          "id": 3376,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 7,
        "display_name": "CI/CD",
        "id": 2579,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "METHODOLOGY",
        "slug": "ci-cd",
        "sub_category_id": 2102,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "CI/CD",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "CI/CD",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": null,
            "display_name": "Model Serving Deployment, Packaging, and Runtime Operations",
            "id": null,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, creating release artifacts, coordinating rollout, and handing off to inference systems. Covers model registry/versioning workflows, MLflow, serving frameworks such as TensorFlow Serving, TorchServe, Triton, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base images, entrypoints, dependencies, and image scanning.",
            "slug": "d_merge_01",
            "source": "llm"
          },
          "input_skill": "MLflow",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "MLflow",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Project Delivery and Coordination",
            "id": 366,
            "rationale": "Coordination practices for organizing work, tracking progress, and aligning stakeholders across a delivery effort. Agile fits here when used as a team execution framework for managing scope, cadence, and collaboration.",
            "slug": "d_init_02",
            "source": "db"
          },
          "input_skill": "MLflow",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "MLflow",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Tool",
          "skill_nature": "TOOL",
          "sub_category": "mlops_tool",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "NOT_APPLICABLE",
          "volatility": "EMERGING"
        },
        "enrichment": {
          "ambiguity": {
            "ambiguity_flag": false,
            "confused_with": [],
            "reasoning": "MLflow is a specific MLOps tool with a distinctive name; in typical JDs it is unlikely to be mistaken for another catalog skill."
          },
          "context_keywords": {
            "context_keywords": [
              "experiment tracking",
              "model registry",
              "artifact store",
              "MLproject",
              "MLflow Tracking",
              "MLflow Projects",
              "MLflow Models",
              "pyfunc",
              "model serving",
              "run metadata",
              "parameters",
              "metrics",
              "artifacts",
              "Databricks",
              "MLOps"
            ]
          },
          "maturity": {
            "confidence": 0.86,
            "maturity": "emerging",
            "reasoning": "MLflow appears in many MLOps job descriptions and is widely used in model tracking/registry, but it is not yet as universal as core cloud or data-stack tools."
          },
          "skill_id": "mlflow",
          "vendor_license": {
            "confidence": 0.98,
            "license": "apache_2",
            "vendor": "Databricks",
            "year_introduced": 2018
          },
          "versioning": {
            "current_version": null,
            "version_aliases": {},
            "versioned": false
          }
        },
        "keep_log": [
          {
            "a_dim_id": "model-serving-deployment-and-runtime-packaging",
            "a_name": "Model Serving Deployment and Runtime Packaging",
            "a_role": "__skill_focal__",
            "b_dim_id": "model-serving-architecture",
            "b_name": "Model Serving Architecture",
            "b_role": "Machine Learning Engineer",
            "pair_kind": "cross_role",
            "reasoning": "Dim A is about operational deployment and packaging of trained models: MLflow, model registry/versioning, deployment artifacts, and containerized model serving. Dim B is about serving architecture: hosting, routing, and scaling inference via services, sidecars, gateways, or dedicated serving layers. A is release/deployment mechanics; B is system design for inference topology. The overlap on \"model serving\" is superficial.",
            "similarity": 0.6514498388016978
          }
        ],
        "locked_dimensions": [
          {
            "description": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, creating release artifacts, coordinating rollout, and handing off to inference systems. Covers model registry/versioning workflows, MLflow, serving frameworks such as TensorFlow Serving, TorchServe, Triton, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base images, entrypoints, dependencies, and image scanning.",
            "exemplar_skills": [
              "Model Serving Deployment, Packaging, and Runtime Operations"
            ],
            "in_scope": "Skills, tools, and practices that belong under Model Serving Deployment, Packaging, and Runtime Operations for the target role, including items implied by the dimension rationale.",
            "name": "Model Serving Deployment, Packaging, and Runtime Operations",
            "out_of_scope": "Adjacent clusters explicitly not owned by Model Serving Deployment, Packaging, and Runtime Operations, including unrelated platforms, roles, and skill families per library policy.",
            "overlap_flags": [],
            "tentative_id": "d_merge_01"
          },
          {
            "description": "Tools and practices for logging runs, parameters, metrics, artifacts, and model lineage during ML development. MLflow fits here because its core value is reproducible experiment tracking and comparison across training runs.",
            "exemplar_skills": [
              "MLflow",
              "experiment tracking",
              "run tracking",
              "metric logging",
              "artifact logging",
              "model lineage"
            ],
            "in_scope": "MLflow, run tracking, parameter logging, metric logging, artifact logging, experiment comparison, model lineage, reproducible ML experiments",
            "name": "Machine Learning Experiment Tracking",
            "out_of_scope": "Production inference hosting, container orchestration, feature store design, and batch scoring pipelines, which are owned by deployment and data-pipeline dimensions",
            "overlap_flags": [
              {
                "reason": "Both involve comparing outcomes across runs, but this dimension is about ML workflow instrumentation rather than causal experiment methodology.",
                "with_dim_id": "experiment-design-and-analysis",
                "with_dim_name": null,
                "with_role": "Data Scientist"
              }
            ],
            "tentative_id": "d_init_01"
          },
          {
            "description": "Systems and practices for registering, versioning, approving, and promoting machine learning models across environments. MLflow belongs here because its registry capabilities manage model lifecycle, stage transitions, and governance.",
            "exemplar_skills": [
              "MLflow",
              "model registry",
              "model stages",
              "version promotion",
              "model governance",
              "lineage metadata"
            ],
            "in_scope": "MLflow, model registry, model stages, version promotion, model approval workflows, lineage metadata, model governance, registry-backed deployment",
            "name": "Machine Learning Model Registry",
            "out_of_scope": "Training code, notebook experimentation, and serving runtime implementation, which belong to experiment tracking or deployment dimensions",
            "overlap_flags": [
              {
                "reason": "Registry workflows often feed deployment, but this dimension focuses on lifecycle control rather than packaging and runtime release mechanics.",
                "with_dim_id": "model-serving-deployment-and-runtime-packaging",
                "with_dim_name": null,
                "with_role": "MLOps Engineer, Machine Learning Engineer"
              }
            ],
            "tentative_id": "d_init_02"
          }
        ],
        "merge_log": [
          {
            "a_dim_id": "model-serving-deployment-and-runtime-packaging",
            "a_name": "Model Serving Deployment and Runtime Packaging",
            "a_role": "__skill_focal__",
            "b_dim_id": "model-serving-deployment-and-runtime-packaging",
            "b_name": "Model Serving Deployment and Runtime Packaging",
            "b_role": "MLOps Engineer",
            "into": "d_merge_01",
            "into_name": "Model Serving Deployment, Packaging, and Runtime Operations",
            "merged_from": [
              "model-serving-deployment-and-runtime-packaging",
              "model-serving-deployment-and-runtime-packaging"
            ],
            "pair_kind": "cross_role",
            "reasoning": "Both dims cover operational deployment of trained models into serving environments. A emphasizes MLflow, model registry/versioning, packaging, and deployment artifacts; B adds concrete serving frameworks and container/runtime details like TensorFlow Serving, Triton, Docker images, and GPU/base-image concerns. B is a richer superset of A, not a separate cluster.",
            "similarity": 0.8104536947595666
          }
        ],
        "placed": {
          "name": "MLflow",
          "placement_confidence": 0.92,
          "primary_dimension": "d_merge_01",
          "reasoning": "Deterministic JD placement: locked_dimensions has 3 dimension(s) from skill-driven dimension generation after reconciliation; primary_dimension is the first locked dim.",
          "secondary_dimensions": [
            "d_init_01",
            "d_init_02"
          ],
          "skill_id": "mlflow"
        },
        "relationships": {
          "child_skills": [],
          "parent_skills": [],
          "related_to": [
            "ai-ml",
            "go",
            "migration-scripts",
            "the-graph",
            "metamask",
            "solidity",
            "foundry-traces",
            "foundry-fuzzing",
            "infura"
          ],
          "requires": [
            "ml"
          ],
          "skill_id": "mlflow",
          "suppress_on_match": []
        },
        "skill_id": "mlflow",
        "split_log": [],
        "typed": {
          "alternatives_considered": [
            "Framework: ruled out \u2014 users operate MLflow as software rather than building applications inside it.",
            "Platform: ruled out \u2014 MLflow is not a hosted multi-tenant environment with managed APIs by default."
          ],
          "confidence": 0.93,
          "name": "MLflow",
          "reasoning": "MLflow is software you run to manage experiments, models, and deployments, so by the Tool vs Framework rule it is a Tool rather than a framework or platform.",
          "skill_id": "mlflow",
          "subtype": "mlops_tool",
          "type": "Tool"
        },
        "warnings": [
          "stage3_post_filter_dropped_catalog_only_locked_dims:42-\u003e3"
        ]
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Model Serving Deployment and Runtime Packaging",
            "id": 52,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
            "slug": "model-serving-deployment-and-runtime-packaging",
            "source": "db"
          },
          "input_skill": "Kubeflow",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            },
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "Kubeflow",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "Kubeflow",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Framework",
          "skill_nature": "FRAMEWORK",
          "sub_category": "ml_workflow_framework",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "SEPARATE_ENTITY",
          "volatility": "STABLE"
        },
        "enrichment": {
          "ambiguity": {
            "ambiguity_flag": false,
            "confused_with": [],
            "reasoning": "Kubeflow is a specific ML workflow framework with a distinctive name; in typical JDs it is unlikely to be confused with another catalog skill."
          },
          "context_keywords": {
            "context_keywords": [
              "Kubernetes",
              "Argo Workflows",
              "Tekton",
              "ML pipelines",
              "notebooks",
              "training jobs",
              "model serving",
              "KFServing",
              "KServe",
              "TensorFlow",
              "PyTorch",
              "hyperparameter tuning",
              "pipeline components",
              "container images",
              "GPU nodes"
            ]
          },
          "maturity": {
            "confidence": 0.86,
            "maturity": "niche",
            "reasoning": "Kubeflow appears in some ML platform and MLOps JDs, but far less often than Kubernetes/AWS; GitHub activity is steady yet the market signal is specialized adoption rather than broad hiring demand."
          },
          "skill_id": "kubeflow",
          "vendor_license": {
            "confidence": 0.98,
            "license": "apache_2",
            "vendor": "Kubeflow",
            "year_introduced": 2018
          },
          "versioning": {
            "current_version": "2.x",
            "version_aliases": {
              "Kubeflow 1.x": "1.x",
              "Kubeflow 2.x": "2.x",
              "Kubeflow v1": "1.x",
              "Kubeflow v2": "2.x"
            },
            "versioned": true
          }
        },
        "keep_log": [],
        "locked_dimensions": [
          {
            "description": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers and wiring them into production runtimes. Kubeflow belongs here because it is commonly used to build and run ML pipelines that culminate in deployable model artifacts and serving workflows.",
            "exemplar_skills": [
              "Kubeflow",
              "model serving deployment",
              "ML pipeline orchestration",
              "containerized inference workflows",
              "training-to-serving handoff"
            ],
            "in_scope": "Kubeflow, model pipeline deployment, containerized ML workflows, model packaging, training-to-serving handoff, batch inference jobs, pipeline orchestration for ML, model server runtime setup",
            "name": "Model Serving Deployment and Runtime Packaging",
            "out_of_scope": "Feature engineering logic and data prep before training, which belongs in inference-data-pipelines; general cloud infrastructure provisioning, which belongs in platform or automation dimensions; model architecture design and training algorithms, which belong to ML modeling dimensions",
            "overlap_flags": [
              {
                "reason": "Kubeflow pipelines often include data preparation and feature refresh steps that overlap with inference-time data movement.",
                "with_dim_id": "inference-data-pipelines",
                "with_dim_name": null,
                "with_role": "MLOps Engineer"
              },
              {
                "reason": "Kubeflow deployments frequently integrate with cloud services, storage, and evented systems around the pipeline runtime.",
                "with_dim_id": "cloud-service-integration-patterns",
                "with_dim_name": null,
                "with_role": "Cloud Architect"
              }
            ],
            "tentative_id": "model-serving-deployment-and-runtime-packaging"
          },
          {
            "description": "Building and operating end-to-end ML workflows that coordinate data prep, training, evaluation, and deployment as repeatable pipelines. Kubeflow fits here because it is a dedicated platform for composing and running ML pipelines on Kubernetes.",
            "exemplar_skills": [
              "Kubeflow",
              "ML pipeline orchestration",
              "pipeline DAG design",
              "training workflow automation",
              "reproducible ML workflows"
            ],
            "in_scope": "Kubeflow, ML pipelines, workflow DAGs, training pipelines, evaluation pipelines, pipeline components, pipeline scheduling, reproducible ML workflows",
            "name": "Machine Learning Pipeline Orchestration",
            "out_of_scope": "Low-level Kubernetes cluster administration, which belongs to container orchestration or platform operations; model serving runtime packaging, which belongs to deployment dimensions; notebook-only experimentation without pipeline automation",
            "overlap_flags": [
              {
                "reason": "Kubeflow often extends from pipeline orchestration into deployment and serving of trained models.",
                "with_dim_id": "model-serving-deployment-and-runtime-packaging",
                "with_dim_name": null,
                "with_role": "MLOps Engineer, Machine Learning Engineer"
              }
            ],
            "tentative_id": "d_init_01"
          }
        ],
        "merge_log": [],
        "placed": {
          "name": "Kubeflow",
          "placement_confidence": 0.92,
          "primary_dimension": "model-serving-deployment-and-runtime-packaging",
          "reasoning": "Deterministic JD placement: locked_dimensions has 2 dimension(s) from skill-driven dimension generation after reconciliation; primary_dimension is the first locked dim.",
          "secondary_dimensions": [
            "d_init_01"
          ],
          "skill_id": "kubeflow"
        },
        "relationships": {
          "child_skills": [],
          "parent_skills": [],
          "related_to": [
            "ci-cd",
            "aws",
            "aws-lambda",
            "aws-data-pipeline",
            "amazon-api-gateway",
            "kinesis",
            "ec2",
            "cloudwatch",
            "aws-kms"
          ],
          "requires": [],
          "skill_id": "kubeflow",
          "suppress_on_match": []
        },
        "skill_id": "kubeflow",
        "split_log": [],
        "typed": {
          "alternatives_considered": [],
          "confidence": 0.9,
          "name": "Kubeflow",
          "reasoning": "Kubeflow is best classified as a Framework because users build machine-learning workflows and applications on top of it rather than merely operating it as standalone software.",
          "skill_id": "kubeflow",
          "subtype": "ml_workflow_framework",
          "type": "Framework"
        },
        "warnings": [
          "stage3_reconcile_failed: ValidationError: 1 validation error for ReconciliationDecision\nmerge_into_description\n  String should have at most 600 characters [type=string_too_long, input_value=\u0027Operational deployment o...es, and image scanning.\u0027, input_type=str]\n    For further information visit https://errors.pydantic.dev/2.13/v/string_too_long"
        ]
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": [
    "NoSQL",
    "MLflow",
    "Kubeflow"
  ]
}

API 3 — final-role-output

{
  "chosen_role": {
    "display_name": "Data Scientist",
    "id": 7,
    "rationale": "Data Scientist is the most fitting role as it emphasizes the primary skill of Python and encompasses advanced analytical capabilities.",
    "role_archetype": null,
    "slug": "data-scientist",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "Python",
      "tag": "in_db"
    },
    {
      "skill": "TensorFlow",
      "tag": "in_db"
    },
    {
      "skill": "PyTorch",
      "tag": "in_db"
    },
    {
      "skill": "Scikit-learn",
      "tag": "in_db"
    },
    {
      "skill": "REST",
      "tag": "in_db"
    },
    {
      "skill": "Docker",
      "tag": "in_db"
    },
    {
      "skill": "Kubernetes",
      "tag": "in_db"
    },
    {
      "skill": "SQL",
      "tag": "in_db"
    },
    {
      "skill": "NoSQL",
      "tag": "new"
    },
    {
      "skill": "AWS",
      "tag": "in_db"
    },
    {
      "skill": "GCP",
      "tag": "in_db"
    },
    {
      "skill": "Git",
      "tag": "in_db"
    },
    {
      "skill": "CI/CD",
      "tag": "in_db"
    },
    {
      "skill": "MLflow",
      "tag": "new"
    },
    {
      "skill": "Kubeflow",
      "tag": "new"
    }
  ],
  "persistence": {
    "items": [
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Analytical Programming Languages",
          "id": 82,
          "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
          "slug": "analytical-programming-languages",
          "source": "db"
        },
        "dimension_id": 82,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "Data Analyst",
            "id": 20,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-analyst",
            "source": "db"
          },
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Automation Scripting and CLI",
          "id": 48,
          "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
          "slug": "automation-scripting-and-cli",
          "source": "db"
        },
        "dimension_id": 48,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Azure Cloud Engineer",
            "id": 4,
            "rationale": null,
            "role_archetype": null,
            "slug": "azure-cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Automation and Scripting for Operations",
          "id": 361,
          "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
          "slug": "automation-and-scripting-for-operations",
          "source": "db"
        },
        "dimension_id": 361,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Virtualization Engineer",
            "id": 26,
            "rationale": null,
            "role_archetype": null,
            "slug": "virtualization-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Network Automation and Scripting",
          "id": 285,
          "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
          "slug": "network-automation-and-scripting",
          "source": "db"
        },
        "dimension_id": 285,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Network Engineer",
            "id": 21,
            "rationale": null,
            "role_archetype": null,
            "slug": "network-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for AI Workflows",
          "id": 261,
          "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
          "slug": "programming-languages-for-ai-workflows",
          "source": "db"
        },
        "dimension_id": 261,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Backend Systems",
          "id": 140,
          "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
          "slug": "programming-languages-for-backend-systems",
          "source": "db"
        },
        "dimension_id": 140,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Engineer",
            "id": 14,
            "rationale": null,
            "role_archetype": null,
            "slug": "backend-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Data Work",
          "id": 67,
          "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
          "slug": "programming-languages-for-data-work",
          "source": "db"
        },
        "dimension_id": 67,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for ML Systems",
          "id": 113,
          "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
          "slug": "programming-languages-for-ml-systems",
          "source": "db"
        },
        "dimension_id": 113,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Security Work",
          "id": 328,
          "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
          "slug": "programming-languages-for-security-work",
          "source": "db"
        },
        "dimension_id": 328,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Test Automation",
          "id": 193,
          "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
          "slug": "programming-languages-for-test-automation",
          "source": "db"
        },
        "dimension_id": 193,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Automation Tester",
            "id": 16,
            "rationale": null,
            "role_archetype": null,
            "slug": "automation-tester",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Security Automation and Scripting",
          "id": 258,
          "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
          "slug": "security-automation-and-scripting",
          "source": "db"
        },
        "dimension_id": 258,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "TensorFlow",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 558,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "PyTorch",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 557,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "Scikit-learn",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 554,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "API Integration and Data Fetching",
          "id": 9,
          "rationale": "Connecting frontend applications to backend services and third-party endpoints. This covers request orchestration, error handling, pagination, and shaping remote data for UI consumption.",
          "slug": "api-integration-and-data-fetching",
          "source": "db"
        },
        "dimension_id": 9,
        "input_skill": "REST",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Frontend Engineer",
            "id": 3,
            "rationale": null,
            "role_archetype": "Frontend Engineers design and build the user-facing parts of applications, translating product and design requirements into interactive experiences. They focus on how the application looks, behaves, and responds in the browser, ensuring usability, accessibility, and consistency across the interface.",
            "slug": "frontend-engineer",
            "source": "db"
          },
          {
            "display_name": "Full Stack Developer",
            "id": 2,
            "rationale": null,
            "role_archetype": null,
            "slug": "full-stack-developer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 121,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Containerization and Image Delivery",
          "id": 24,
          "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
          "slug": "containerization-and-image-delivery",
          "source": "db"
        },
        "dimension_id": 24,
        "input_skill": "Docker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 153,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Model Serving Deployment and Runtime Packaging",
          "id": 52,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
          "slug": "model-serving-deployment-and-runtime-packaging",
          "source": "db"
        },
        "dimension_id": 52,
        "input_skill": "Docker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          },
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 153,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Orchestration Platforms",
          "id": 25,
          "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
          "slug": "orchestration-platforms",
          "source": "db"
        },
        "dimension_id": 25,
        "input_skill": "Kubernetes",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 158,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Relational Data Modeling",
          "id": 71,
          "rationale": "Designing tables, relationships, constraints, and transactional data shapes for operational backend systems. This cluster is coherent because backend services frequently own the canonical application data model.",
          "slug": "relational-data-modeling",
          "source": "db"
        },
        "dimension_id": 71,
        "input_skill": "SQL",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Engineer",
            "id": 14,
            "rationale": null,
            "role_archetype": null,
            "slug": "backend-engineer",
            "source": "db"
          },
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2601,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "SQL",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2601,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Platform Operations",
          "id": 26,
          "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
          "slug": "cloud-platform-operations",
          "source": "db"
        },
        "dimension_id": 26,
        "input_skill": "AWS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 163,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Security Platforms",
          "id": 332,
          "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
          "slug": "cloud-security-platforms",
          "source": "db"
        },
        "dimension_id": 332,
        "input_skill": "AWS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 163,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Security Platforms",
          "id": 332,
          "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
          "slug": "cloud-security-platforms",
          "source": "db"
        },
        "dimension_id": 332,
        "input_skill": "GCP",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2304,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "Git",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2578,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "CI/CD",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2579,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Data Access and Query Optimization",
          "id": 74,
          "rationale": "Techniques for making analytical data fast and reliable to query. This includes partitioning, clustering, indexing choices, file layout, and access-path tuning for downstream consumers.",
          "slug": "data-access-and-query-optimization",
          "source": "db"
        },
        "dimension_id": 74,
        "input_skill": "NoSQL",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "New skill saved \u00b7 Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2639,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": null,
          "display_name": "Model Serving Deployment, Packaging, and Runtime Operations",
          "id": null,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, creating release artifacts, coordinating rollout, and handing off to inference systems. Covers model registry/versioning workflows, MLflow, serving frameworks such as TensorFlow Serving, TorchServe, Triton, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base images, entrypoints, dependencies, and image scanning.",
          "slug": "d_merge_01",
          "source": "llm"
        },
        "dimension_id": 52,
        "input_skill": "MLflow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "New skill saved \u00b7 Existing dimension (reconciliation merge) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2640,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "MLflow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "New skill saved \u00b7 Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2640,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Project Delivery and Coordination",
          "id": 366,
          "rationale": "Coordination practices for organizing work, tracking progress, and aligning stakeholders across a delivery effort. Agile fits here when used as a team execution framework for managing scope, cadence, and collaboration.",
          "slug": "d_init_02",
          "source": "db"
        },
        "dimension_id": 366,
        "input_skill": "MLflow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "New skill saved \u00b7 Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2640,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Model Serving Deployment and Runtime Packaging",
          "id": 52,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
          "slug": "model-serving-deployment-and-runtime-packaging",
          "source": "db"
        },
        "dimension_id": 52,
        "input_skill": "Kubeflow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "New skill saved \u00b7 Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          },
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2641,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 7,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "Kubeflow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "New skill saved \u00b7 Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2641,
        "skill_tag": "in_db",
        "skipped_reason": null
      }
    ],
    "new_skills_created": 3,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 6,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "325b3712-22a1-4837-9bfe-bdc7cb1fbce6"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…