← Back to history

Pipeline run

832dda01-52e1-4739-b122-0a9cf9bc8100

Pipeline LLM cost (USD)

API 1: $0.0005 API 2: $0.0006 API 3: $0.0342 Total: $0.0353

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

Nature of work no kras

Vague JD — no KRAs present to derive a specific nature of work.

Tech stack maturity

Mainstream Modern

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

1.70 / 5

· Title match

✓ Has AI skill

✓ AI skill (primary)

· AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): Bedrock, Pinecone

Models / concepts (×3): RAG, LLM, LLMs, MLOps, AI, ML, AI/ML, GenAI, Generative AI, Machine Learning

Evidence — skills matched in JD (27)

AWS AI/ML Amazon SageMaker Amazon Bedrock AWS Lambda ECS EKS EC2 AWS Glue Amazon Athena Redshift AWS Data Pipeline S3 Kinesis Amazon API Gateway Python TensorFlow PyTorch Scikit-learn GitHub Actions Airflow Terraform Docker Kubernetes Pinecone +2

Skill cluster (8 dimension groups, role-scoped)

Cloud Platforms

AWS AWS Lambda Redshift S3

ML Frameworks and Libraries

PyTorch FAISS

Container Orchestration Platforms

Kubernetes

Containerization and Image Builds

Docker

Infrastructure as Code

Terraform

Programming Languages and Scripting

Python

Vector Databases

Pinecone

Cross-cutting / unaligned

AI/ML Amazon SageMaker Amazon Bedrock ECS EKS EC2 AWS Glue Amazon Athena AWS Data Pipeline Kinesis Amazon API Gateway TensorFlow Scikit-learn GitHub Actions Airflow OpenSearch

Status: completed Created: 2026-05-12T10:44:01.066686Z Updated: 2026-05-12T10:45:06.827716Z API 3 duration: 61171 ms

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

MLOps Engineer

slug: mlops-engineer · id: 5 · source: db

The primary skills indicate a strong focus on AWS and AI/ML technologies, which align well with the responsibilities of an MLOps Engineer.

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

New skills

Skill↔dim saved

Role↔dim saved

Skipped

Job description

About the job
At Capgemini Invent, we believe difference drives change. As inventive transformation consultants, we blend our strategic, creative and scientific capabilities, collaborating closely with clients to deliver cutting-edge solutions. Join us to drive transformation tailored to our client's challenges of today and tomorrow. Informed and validated by science and data. Superpowered by creativity and design. All underpinned by technology created with purpose.

Your Role

We are seeking a highly skilled Solution Architect – AWS Cloud & AI/ML to design, architect, and implement advanced AI/ML and generative AI solutions on the AWS platform. The ideal candidate will have deep expertise in large-scale distributed systems, modern AI/ML architectures, LLMs, data engineering pipelines, and AWS-native services. This role involves partnering with cross-functional teams, understanding business challenges, and crafting end‑to‑end scalable, secure, and cost‑optimized solutions

Architect and deliver end‑to‑end AI/ML solutions on AWS, covering data ingestion, training, inference, orchestration, monitoring, and governance.
Design and integrate LLM‑based and Generative AI solutions, including retrieval-augmented generation (RAG), prompt workflows, and production deployment strategies.
Develop feature engineering strategies and scalable data pipelines to support ML training and real-time inference workloads.
Lead technical discussions and provide guidance on AI/ML best practices, model lifecycle, optimization, MLOps, and model governance.
Design highly scalable, secure, and cost-efficient architectures using:
Amazon SageMaker (Training Jobs, Inference Endpoints, Pipelines, Feature Store, Model Registry)
Amazon Bedrock (Foundation models, Generative AI orchestration, prompt management)
AWS Lambda, ECS, EKS, EC2 for building and orchestrating distributed AI workloads.
Architect and optimize data engineering platforms using:
AWS Glue, Amazon Athena, Redshift, AWS Data Pipeline, S3, Kinesis, and related services.
Build secure, production-grade API services for AI model inference using Amazon API Gateway and AWS compute services.
Your Profile

8+ years of experience in cloud architecture, with at least 5 years in AWS.
Strong expertise in:
Machine Learning, MLOps, and GenAI solution design.
Amazon SageMaker (end‑to‑end ML lifecycle).
Amazon Bedrock and modern LLM architectures.
Data engineering with Glue, Redshift, Athena, and pipeline orchestration.
Experience containerizing and scaling AI workloads on Lambda/ECS/EKS.
Strong coding experience in Python and familiarity with ML frameworks (TensorFlow, PyTorch, Scikit‑learn).
Deep understanding of security, networking, IAM, and compliance best practices for AI systems.
Excellent communication, design thinking, and stakeholder management skills.
AWS certifications (e.g., AWS Certified Solutions Architect – Professional, Machine Learning – Specialty).
Experience with vector databases (e.g., Pinecone, OpenSearch, FAISS).
Experience building RAG pipelines, multi‑agent orchestration frameworks, or custom LLM fine‑tuning workflows.
Familiarity with DevOps/MLOps tools: GitHub Actions, Airflow, Terraform, Docker, Kubernetes
Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

AWS Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: AWS id=163 · aws

Aliases — catalog

Compaction (CANONICAL) primary

Context tags (catalog)

Bloom filter LSM tree SSTable checkpointing defragmentation garbage collection leveling log-structured merge policy segment merge storage engine tiered compaction tombstones vacuum write amplification

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Storage Maintenance Concept
Confidence: 0.74
Version strategy: NOT_APPLICABLE

Maturity reasoning: Compaction is a standard storage-maintenance concept in widely used systems like LSM databases and Kafka; it appears in many JDs for Cassandra, RocksDB, and Kafka ops roles, indicating broad market demand.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 161
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Platform Operations Catalog dimension db id 26

Library dimension (catalog)

Roles linked in library: DevOps Engineer
Cloud Security Platforms Catalog dimension db id 332

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Platform Operations cloud-platform-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Cloud Security Platforms cloud-security-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

AI/ML Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: AI/ML id=2611 · ai-ml

Aliases — from this run (catalog unavailable)

AI/ML (CANONICAL)

Skill profile (library / DB)

Skill nature: CONCEPT
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 16
Sub-category id: 2131
Extractable: True
Also category: False

Dimensions (API 2 worklist)

AI Inference Cost, Latency, and Throughput Optimization Catalog dimension db id 260

Library dimension (catalog)

Roles linked in library: AI Engineer
AI Service Architecture Patterns Catalog dimension db id 270

Library dimension (catalog)

Roles linked in library: AI Engineer
Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
AI Inference Cost, Latency, and Throughput Optimization ai-inference-cost-latency-and-throughput-optimization	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI Service Architecture Patterns ai-service-architecture-patterns	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Amazon SageMaker Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Amazon SageMaker id=2612 · amazon-sagemaker

Aliases — from this run (catalog unavailable)

Amazon SageMaker (CANONICAL)

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 326
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Managed ML Platform Workflows Catalog dimension db id 367

Library dimension (catalog)
Managed Model Hosting and Endpoints Catalog dimension db id 368

Library dimension (catalog)
Model Serving Deployment and Runtime Packaging Catalog dimension db id 52

Library dimension (catalog)

Roles linked in library: MLOps Engineer, Machine Learning Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Managed ML Platform Workflows d_split_01_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Managed Model Hosting and Endpoints d_split_01_02	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	✓	Existing dimension (library) · Role↔dimension saved

Amazon Bedrock Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Amazon Bedrock id=2613 · amazon-bedrock

Aliases — catalog

source maps (CANONICAL) primary

Context tags (catalog)

Chrome Firefox JavaScript TypeScript browser dev tools build tools code quality debugging dev environment development workflow error tracking frontend development mapping minification performance optimization source code sourcemap-loader transpilation webpack

Stored enrichment (catalog DB)

Category: Format
Sub-category: Debug Symbol Mapping Format
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Source maps are a standard web-debugging format supported by major bundlers and browsers; they appear routinely in frontend job descriptions and tooling docs for webpack, Vite, and Chrome DevTools.

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 2132
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Model Runtime Services Catalog dimension db id 121

Library dimension (catalog)

Roles linked in library: Machine Learning Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Model Runtime Services cloud-model-runtime-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

AWS Lambda Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: AWS Lambda id=2614 · aws-lambda

Aliases — from this run (catalog unavailable)

AWS Lambda (CANONICAL)

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 262
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Data Platform Services Catalog dimension db id 81

Library dimension (catalog)

Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

ECS Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: ECS id=2615 · ecs

Aliases — catalog

release validation (CANONICAL) primary

Context tags (catalog)

QA processes artifact repository build verification compliance checks compliance standards continuous integration defect tracking deployment pipeline deployment pipelines environment setup integration testing production readiness quality assurance regression testing release management release notes rollback procedures smoke testing staging environments test automation test cases user acceptance testing version control

Stored enrichment (catalog DB)

Category: Methodology
Sub-category: Validation Process
Confidence: 0.82
Version strategy: NOT_APPLICABLE

Maturity reasoning: Common release-gate practice in JDs for QA/DevOps/SRE roles; often listed alongside CI/CD, smoke tests, and canary/rollback checks in production release pipelines.

Skill profile (library / DB)

Skill nature: CONCEPT
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 2
Sub-category id: 2133
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

EKS Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: EKS id=725 · eks

Aliases — catalog

Ansible playbooks (CANONICAL) primary

Context tags (catalog)

Jinja2 YAML ansible-galaxy ansible-vault collections group_vars handlers host_vars idempotent inventory playbook roles tasks templates vars

Stored enrichment (catalog DB)

Category: Format
Sub-category: Automation Playbook Format
Vendor: Red Hat
License: gpl_v3
Year introduced: 2012
Confidence: 0.88
Version strategy: NOT_APPLICABLE

Maturity reasoning: Common in DevOps JDs and widely used for infrastructure automation; Red Hat/Ansible remains a standard tool in hiring pipelines, with playbooks the core format.

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 251
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Model Runtime Services Catalog dimension db id 121

Library dimension (catalog)

Roles linked in library: Machine Learning Engineer
Orchestration Platforms Catalog dimension db id 25

Library dimension (catalog)

Roles linked in library: Cloud Engineer, DevOps Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Model Runtime Services cloud-model-runtime-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Orchestration Platforms orchestration-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

EC2 Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: EC2 id=1773 · ec2

Aliases — from this run (catalog unavailable)

EC2 (CANONICAL) primary

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 1544
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Provider Core Services Catalog dimension db id 290

Library dimension (catalog)

Roles linked in library: Cloud Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Provider Core Services cloud-provider-core-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

AWS Glue Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: AWS Glue id=466 · aws-glue

Aliases — from this run (catalog unavailable)

AWS Glue (CANONICAL) primary

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 385
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Data Platform Services Catalog dimension db id 81

Library dimension (catalog)

Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Amazon Athena Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Amazon Athena id=2616 · amazon-athena

Aliases — catalog

AWS Amplify Console (CANONICAL) primary

Context tags (catalog)

AWS services CI/CD Git integration GraphQL React Vue authentication backend build settings cloud infrastructure custom domains deployment deployment pipelines environment variables frontend hosting monitoring serverless static web apps version control

Stored enrichment (catalog DB)

Category: Service
Sub-category: Managed Deployment Service
Vendor: Amazon Web Services
License: apache_2
Year introduced: 2019
Confidence: 0.95
Version strategy: NOT_APPLICABLE

Maturity reasoning: Appears in growing frontend/CI-CD job postings and AWS docs, but is still far less common than GitHub Actions, Vercel, or AWS CodePipeline in JDs.

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 388
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Data Platform Security and Networking Catalog dimension db id 369

Library dimension (catalog)
Cloud Data Platform Services Catalog dimension db id 81

Library dimension (catalog)

Roles linked in library: Data Engineer
Managed ML Platform Workflows Catalog dimension db id 367

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Data Platform Security and Networking d_split_01_04	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Managed ML Platform Workflows d_split_01_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Redshift Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Redshift id=2570 · redshift

Aliases — from this run (catalog unavailable)

Redshift (CANONICAL)

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 2098
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Data Warehousing Platforms Catalog dimension db id 72

Library dimension (catalog)

Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Data Warehousing Platforms data-warehousing-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

AWS Data Pipeline Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: AWS Data Pipeline id=2617 · aws-data-pipeline

Aliases — from this run (catalog unavailable)

AWS Data Pipeline (CANONICAL)

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: SHORT_LIVED
Category id: 14
Sub-category id: 2134
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Data Platform Services Catalog dimension db id 81

Library dimension (catalog)

Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

S3 Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: S3 id=2618 · s3

Aliases — catalog

Netlify Platform (CANONICAL) primary

Context tags (catalog)

API integrations Git integration Jamstack Netlify CMS build plugins content management continuous deployment custom domains deploy previews edge handlers environment variables form handling redirects serverless functions site analytics

Stored enrichment (catalog DB)

Category: Platform
Sub-category: Vendor Saas Platform
Vendor: Netlify, Inc.
License: other_open
Year introduced: 2014
Confidence: 0.98
Version strategy: NOT_APPLICABLE

Maturity reasoning: Netlify appears in many modern frontend/Jamstack job descriptions, but far less universally than AWS or Kubernetes; its usage is concentrated in static-site and edge-deploy workflows rather than broad platform roles.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 2135
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Storage Provisioning and Automation Catalog dimension db id 311

Library dimension (catalog)

Roles linked in library: Storage Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Storage Provisioning and Automation storage-provisioning-and-automation	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Kinesis Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Kinesis id=2619 · kinesis

Aliases — catalog

Vercel Edge Network (CANONICAL) primary

Context tags (catalog)

API routes CDN ISR Next.js SSR deployment dynamic content edge computing edge functions global distribution performance optimization real-time caching serverless static site generation webhooks

Stored enrichment (catalog DB)

Category: Service
Sub-category: Edge Delivery Service
Vendor: Vercel, Inc.
License: other_open
Year introduced: 2020
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Appears in growing number of JDs for Next.js/edge-runtime roles and Vercel’s docs/launches show expanding adoption, but it’s still far less universal than AWS CloudFront or Cloudflare.

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 2136
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Streaming Data Processing Catalog dimension db id 69

Library dimension (catalog)

Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Streaming Data Processing streaming-data-processing	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Amazon API Gateway Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Amazon API Gateway id=2620 · amazon-api-gateway

Aliases — from this run (catalog unavailable)

Amazon API Gateway (CANONICAL)

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 2137
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Web Service Frameworks Catalog dimension db id 141

Library dimension (catalog)

Roles linked in library: Backend Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Web Service Frameworks web-service-frameworks	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Python Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Python id=393 · python

Aliases — catalog

Cobalt Strike (CANONICAL) primary

Context tags (catalog)

Malleable C2 beacon credential dumping kerberos lateral movement payload phishing post-exploitation privilege escalation psexec red team sleep mask smb stager team server

Stored enrichment (catalog DB)

Category: Tool
Sub-category: Adversary Simulation Tool
Vendor: Fortra
License: proprietary
Year introduced: 2012
Confidence: 0.98
Version strategy: NOT_APPLICABLE

Maturity reasoning: Appears in a limited set of red-team/pentest JDs and security vendor training, but far below mainstream devops tools; market signal is specialized adversary-simulation usage rather than broad hiring demand.

Skill profile (library / DB)

Skill nature: LANGUAGE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 5
Sub-category id: 54
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Analytical Programming Languages Catalog dimension db id 82

Library dimension (catalog)

Roles linked in library: Data Analyst, Data Scientist
Automation Scripting and CLI Catalog dimension db id 48

Library dimension (catalog)

Roles linked in library: Azure Cloud Engineer, Cloud Engineer
Automation and Scripting for Operations Catalog dimension db id 361

Library dimension (catalog)

Roles linked in library: Virtualization Engineer
Network Automation and Scripting Catalog dimension db id 285

Library dimension (catalog)

Roles linked in library: Network Engineer
Programming Languages for AI Workflows Catalog dimension db id 261

Library dimension (catalog)

Roles linked in library: AI Engineer
Programming Languages for Backend Systems Catalog dimension db id 140

Library dimension (catalog)

Roles linked in library: Backend Engineer
Programming Languages for Data Work Catalog dimension db id 67

Library dimension (catalog)

Roles linked in library: Data Engineer
Programming Languages for ML Systems Catalog dimension db id 113

Library dimension (catalog)

Roles linked in library: Machine Learning Engineer
Programming Languages for Security Work Catalog dimension db id 328

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer
Programming Languages for Test Automation Catalog dimension db id 193

Library dimension (catalog)

Roles linked in library: Automation Tester
Security Automation and Scripting Catalog dimension db id 258

Library dimension (catalog)

Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Analytical Programming Languages analytical-programming-languages	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Automation Scripting and CLI automation-scripting-and-cli	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Automation and Scripting for Operations automation-and-scripting-for-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Network Automation and Scripting network-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for AI Workflows programming-languages-for-ai-workflows	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Backend Systems programming-languages-for-backend-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Data Work programming-languages-for-data-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for ML Systems programming-languages-for-ml-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Security Work programming-languages-for-security-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Test Automation programming-languages-for-test-automation	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Security Automation and Scripting security-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

TensorFlow Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: TensorFlow id=558 · tensorflow

Aliases — catalog

shader graphs (CANONICAL) primary

Context tags (catalog)

GLSL HLSL PBR UV mapping fragment shader material editor node graph node-based normal map procedural texturing render pipeline shader compiler tessellation vertex shader visual scripting

Stored enrichment (catalog DB)

Category: Framework
Sub-category: Visual Shader Authoring Framework
Vendor: Unity Technologies
License: proprietary
Year introduced: 2018
Confidence: 0.74
Version strategy: NOT_APPLICABLE

Maturity reasoning: Shader graphs appear in some Unity/Unreal and VFX job postings, but JD volume is far below core graphics skills like HLSL/GLSL; market use is concentrated in game/real-time rendering teams.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 456
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

PyTorch Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: PyTorch id=557 · pytorch

Aliases — catalog

GLSL (CANONICAL) primary

Context tags (catalog)

GPU HLSL OpenGL SPIR-V Vulkan WebGL compute shader fragment shader rendering pipeline shader shader pipeline texture sampling uniform varying vertex shader

Stored enrichment (catalog DB)

Category: Language
Sub-category: Shader Language
Vendor: Khronos Group
License: other_open
Year introduced: 2004
Confidence: 0.99
Version strategy: NOT_APPLICABLE

Maturity reasoning: GLSL appears in graphics/game-engine JDs but at much lower volume than mainstream languages; it’s specialized for shader programming and often replaced in newer pipelines by HLSL/Metal Shading Language or higher-level abstractions.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 456
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Scikit-learn Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: scikit-learn id=554 · scikit-learn

Aliases — catalog

post-processing (CANONICAL) primary

Context tags (catalog)

GPU anti-aliasing bloom color grading compositing depth of field fragment shader framebuffer image filtering motion blur render pipeline render target screen-space shader tone mapping

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Graphics Effect Concept
Confidence: 0.86
Version strategy: NOT_APPLICABLE

Maturity reasoning: Job postings rarely list "post-processing" as a standalone skill; it appears mainly in graphics/VFX roles, while broader JDs usually specify tools like Unreal/Unity or Photoshop instead.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 458
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

GitHub Actions Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: GitHub Actions id=1250 · github-actions

Aliases — catalog

E5 (CANONICAL) primary

Context tags (catalog)

cosine similarity data augmentation dimensionality reduction embedding space feature extraction fine-tuning model evaluation natural language processing nearest neighbors pre-trained models semantic search similarity scoring transfer learning transformer models vector embeddings

Stored enrichment (catalog DB)

Category: Library
Sub-category: Embedding Model Library
Vendor: OpenAI
License: other_open
Year introduced: 2021
Confidence: 0.80
Version strategy: NOT_APPLICABLE

Maturity reasoning: E5 is a specific embedding-model library with limited JD volume; market demand is concentrated in AI/ML roles rather than broad software hiring, unlike mainstream libraries.

Skill profile (library / DB)

Skill nature: CLOUD_SERVICE
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 14
Sub-category id: 1019
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Continuous Integration Test Integration Catalog dimension db id 207

Library dimension (catalog)

Roles linked in library: Automation Tester

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Continuous Integration Test Integration continuous-integration-test-integration	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Airflow Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Airflow id=325 · airflow

Aliases — catalog

OpenVAS (CANONICAL) primary

Context tags (catalog)

CVE CVSS GVM Greenbone NVT asset discovery authenticated scan compliance scan network scan port scanning remediation reporting service detection unauthenticated scan vulnerability assessment

Stored enrichment (catalog DB)

Category: Tool
Sub-category: Vulnerability Scanner
Vendor: Greenbone Networks
License: gpl_v2
Year introduced: 2009
Confidence: 0.98
Version strategy: NOT_APPLICABLE

Maturity reasoning: OpenVAS appears in security-focused JDs far less often than mainstream scanners like Nessus or Qualys, and its usage is concentrated in pentest/vuln-management roles rather than general DevOps stacks.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 11
Sub-category id: 335
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Workflow Orchestration Systems Catalog dimension db id 64

Library dimension (catalog)

Roles linked in library: Data Engineer, MLOps Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Workflow Orchestration Systems workflow-orchestration-systems	✓	✓	Existing dimension (library) · Role↔dimension saved

Terraform Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Terraform id=144 · terraform

Aliases — catalog

Snapshot loads (CANONICAL) primary

Context tags (catalog)

CDC ELT ETL SCD backfill batch ingestion change data capture data warehouse full refresh historical snapshot idempotent loads incremental load late-arriving data partition overwrite point-in-time

Stored enrichment (catalog DB)

Category: Methodology
Sub-category: Data Loading Methodology
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Snapshot loads are a specialized data-loading pattern; JD volume is very low compared with mainstream ETL/ELT tools, and market discussion is mostly in niche data-engineering forums rather than broad hiring pipelines.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 11
Sub-category id: 171
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Infrastructure Provisioning Templates Catalog dimension db id 291

Library dimension (catalog)

Roles linked in library: Cloud Engineer
Infrastructure as Code Catalog dimension db id 22

Library dimension (catalog)

Roles linked in library: DevOps Engineer
Infrastructure as Code and Declarative Provisioning Catalog dimension db id 36

Library dimension (catalog)

Roles linked in library: Azure Cloud Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Infrastructure Provisioning Templates infrastructure-provisioning-templates	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Infrastructure as Code infrastructure-as-code	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Infrastructure as Code and Declarative Provisioning infrastructure-as-code-and-declarative-provisioning	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Docker Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Docker id=153 · docker

Aliases — catalog

Metabase (CANONICAL) primary

Context tags (catalog)

BigQuery MySQL PostgreSQL Redshift SQL ad hoc analysis cards collections dashboards data visualization embedded analytics filters questions segments self-service BI

Stored enrichment (catalog DB)

Category: Tool
Sub-category: Bi Analytics Tool
Vendor: Metabase, Inc.
License: apache_2
Year introduced: 2014
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Metabase appears in many BI/analytics job postings and is growing in GitHub usage, but it is still far less universal than Tableau/Power BI in enterprise JDs.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 11
Sub-category id: 170
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Containerization and Image Delivery Catalog dimension db id 24

Library dimension (catalog)

Roles linked in library: DevOps Engineer
Model Serving Deployment and Runtime Packaging Catalog dimension db id 52

Library dimension (catalog)

Roles linked in library: MLOps Engineer, Machine Learning Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Containerization and Image Delivery containerization-and-image-delivery	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	✓	Existing dimension (library) · Role↔dimension saved

Kubernetes Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Kubernetes id=158 · kubernetes

Aliases — catalog

Column-level security (CANONICAL) primary

Context tags (catalog)

ABAC PII access policies attribute-based access control audit logging data governance data masking database permissions dynamic masking fine-grained access control least privilege policy enforcement row-level security sensitive data static masking

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Access Control Concept
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: Appears in cloud/data platform JDs and vendor docs for Snowflake, BigQuery, and PostgreSQL RLS/column masking, but is not yet a universal hiring staple like core IAM or RBAC.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 1524
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Orchestration Platforms Catalog dimension db id 25

Library dimension (catalog)

Roles linked in library: Cloud Engineer, DevOps Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Orchestration Platforms orchestration-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Pinecone Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: Pinecone id=2621 · pinecone

Aliases — catalog

components (CANONICAL) primary

Context tags (catalog)

Angular React UI libraries Vue component lifecycle component-driven development composition design systems hooks modular design props rendering responsive design slot state management

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Ui Component Concept
Confidence: 0.82
Version strategy: NOT_APPLICABLE

Maturity reasoning: UI components are a standard hiring-pipeline topic across React, Vue, Angular, and design-system JDs; component-based architecture is the default in modern frontend stacks.

Skill profile (library / DB)

Skill nature: PLATFORM
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 13
Sub-category id: 2138
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Model Runtime Services Catalog dimension db id 121

Library dimension (catalog)

Roles linked in library: Machine Learning Engineer

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Model Runtime Services cloud-model-runtime-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

OpenSearch Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: OpenSearch id=2622 · opensearch

Aliases — catalog

templates (CANONICAL) primary

Context tags (catalog)

Bootstrap CSS Dynamic content HTML Handlebars JSON Jinja Liquid Markdown Mustache Sass Static site generator Templating engine Twig XML YAML

Stored enrichment (catalog DB)

Category: Format
Sub-category: Template Format
Confidence: 0.70
Version strategy: NOT_APPLICABLE

Maturity reasoning: Template engines are broadly used across web stacks; JDs commonly list Jinja2, Handlebars, Mustache, or Twig for server-side rendering and email generation, indicating steady hiring demand.

Skill profile (library / DB)

Skill nature: TOOL
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 12
Sub-category id: 2139
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Cloud Data Platform Services Catalog dimension db id 81

Library dimension (catalog)

Roles linked in library: Data Engineer
Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

FAISS Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)

Canonical: FAISS id=2623 · faiss

Aliases — catalog

directives (CANONICAL) primary

Context tags (catalog)

Angular React Vue.js attribute directives component lifecycle conditional rendering custom directives data binding event handling ngFor ngIf props structural directives template expressions template syntax two-way binding v-bind v-for v-if

Stored enrichment (catalog DB)

Category: Concept
Sub-category: Template Directive Concept
Confidence: 0.90
Version strategy: NOT_APPLICABLE

Maturity reasoning: “Directives” is a broad template concept, but JD volume is low and usually appears only inside specific stacks like Angular/Vue rather than as a standalone hiring keyword.

Skill profile (library / DB)

Skill nature: LIBRARY
Volatility: STABLE
Typical lifespan: EVERGREEN
Category id: 6
Sub-category id: 2140
Extractable: True
Also category: False

Dimensions (API 2 worklist)

Applied Machine Learning Toolkits Catalog dimension db id 94

Library dimension (catalog)

Roles linked in library: Data Scientist
Version Control Systems Catalog dimension db id 365

Library dimension (catalog)

API 3 link attempts (this skill)

Dimension	Skill↔dim	Role↔dim	Outcome
Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

All API 3 persistence rows

Same grid as the skill-extractor “Persistence items” table: one row per (skill × dimension) work item.

Skill	Tag	Dimension	Skill↔dim	Role↔dim	Outcome
AWS	in_db	Cloud Platform Operations cloud-platform-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS	in_db	Cloud Security Platforms cloud-security-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML	in_db	AI Inference Cost, Latency, and Throughput Optimization ai-inference-cost-latency-and-throughput-optimization	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML	in_db	AI Service Architecture Patterns ai-service-architecture-patterns	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker	in_db	Managed ML Platform Workflows d_split_01_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker	in_db	Managed Model Hosting and Endpoints d_split_01_02	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker	in_db	Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	✓	Existing dimension (library) · Role↔dimension saved
Amazon Bedrock	in_db	Cloud Model Runtime Services cloud-model-runtime-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Lambda	in_db	Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
ECS	in_db	Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EKS	in_db	Cloud Model Runtime Services cloud-model-runtime-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EKS	in_db	Orchestration Platforms orchestration-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EC2	in_db	Cloud Provider Core Services cloud-provider-core-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Glue	in_db	Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena	in_db	Cloud Data Platform Security and Networking d_split_01_04	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena	in_db	Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena	in_db	Managed ML Platform Workflows d_split_01_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Redshift	in_db	Data Warehousing Platforms data-warehousing-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Data Pipeline	in_db	Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
S3	in_db	Storage Provisioning and Automation storage-provisioning-and-automation	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kinesis	in_db	Streaming Data Processing streaming-data-processing	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon API Gateway	in_db	Web Service Frameworks web-service-frameworks	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Analytical Programming Languages analytical-programming-languages	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Automation Scripting and CLI automation-scripting-and-cli	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Automation and Scripting for Operations automation-and-scripting-for-operations	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Network Automation and Scripting network-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for AI Workflows programming-languages-for-ai-workflows	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Backend Systems programming-languages-for-backend-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Data Work programming-languages-for-data-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for ML Systems programming-languages-for-ml-systems	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Security Work programming-languages-for-security-work	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Programming Languages for Test Automation programming-languages-for-test-automation	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python	in_db	Security Automation and Scripting security-automation-and-scripting	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
TensorFlow	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
PyTorch	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Scikit-learn	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
GitHub Actions	in_db	Continuous Integration Test Integration continuous-integration-test-integration	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Airflow	in_db	Workflow Orchestration Systems workflow-orchestration-systems	✓	✓	Existing dimension (library) · Role↔dimension saved
Terraform	in_db	Infrastructure Provisioning Templates infrastructure-provisioning-templates	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Terraform	in_db	Infrastructure as Code infrastructure-as-code	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Terraform	in_db	Infrastructure as Code and Declarative Provisioning infrastructure-as-code-and-declarative-provisioning	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker	in_db	Containerization and Image Delivery containerization-and-image-delivery	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker	in_db	Model Serving Deployment and Runtime Packaging model-serving-deployment-and-runtime-packaging	✓	✓	Existing dimension (library) · Role↔dimension saved
Kubernetes	in_db	Orchestration Platforms orchestration-platforms	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Pinecone	in_db	Cloud Model Runtime Services cloud-model-runtime-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
OpenSearch	in_db	Cloud Data Platform Services cloud-data-platform-services	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
OpenSearch	in_db	Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
FAISS	in_db	Applied Machine Learning Toolkits applied-machine-learning-toolkits	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
FAISS	in_db	Version Control Systems d_init_01	✓	—	Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Library artifacts (this run)

Kind	Detail	DB id
library_enrichment_backfilled	AI/ML	2611
library_enrichment_backfilled	Amazon SageMaker	2612
library_enrichment_backfilled	Amazon Bedrock	2613
library_enrichment_backfilled	AWS Lambda	2614
library_enrichment_backfilled	ECS	2615
library_enrichment_backfilled	EC2	1773
library_enrichment_backfilled	Amazon Athena	2616
library_enrichment_backfilled	Redshift	2570
library_enrichment_backfilled	AWS Data Pipeline	2617
library_enrichment_backfilled	S3	2618
library_enrichment_backfilled	Kinesis	2619
library_enrichment_backfilled	Amazon API Gateway	2620
library_enrichment_backfilled	Pinecone	2621
library_enrichment_backfilled	OpenSearch	2622
library_enrichment_backfilled	FAISS	2623

API 1 — extract-from-jd click to toggle

{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "AWS"
    },
    {
      "is_primary": true,
      "skill_name": "AI/ML"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon SageMaker"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon Bedrock"
    },
    {
      "is_primary": true,
      "skill_name": "AWS Lambda"
    },
    {
      "is_primary": true,
      "skill_name": "ECS"
    },
    {
      "is_primary": true,
      "skill_name": "EKS"
    },
    {
      "is_primary": true,
      "skill_name": "EC2"
    },
    {
      "is_primary": true,
      "skill_name": "AWS Glue"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon Athena"
    },
    {
      "is_primary": true,
      "skill_name": "Redshift"
    },
    {
      "is_primary": true,
      "skill_name": "AWS Data Pipeline"
    },
    {
      "is_primary": true,
      "skill_name": "S3"
    },
    {
      "is_primary": true,
      "skill_name": "Kinesis"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon API Gateway"
    },
    {
      "is_primary": true,
      "skill_name": "Python"
    },
    {
      "is_primary": false,
      "skill_name": "TensorFlow"
    },
    {
      "is_primary": false,
      "skill_name": "PyTorch"
    },
    {
      "is_primary": false,
      "skill_name": "Scikit-learn"
    },
    {
      "is_primary": false,
      "skill_name": "GitHub Actions"
    },
    {
      "is_primary": false,
      "skill_name": "Airflow"
    },
    {
      "is_primary": false,
      "skill_name": "Terraform"
    },
    {
      "is_primary": false,
      "skill_name": "Docker"
    },
    {
      "is_primary": false,
      "skill_name": "Kubernetes"
    },
    {
      "is_primary": false,
      "skill_name": "Pinecone"
    },
    {
      "is_primary": false,
      "skill_name": "OpenSearch"
    },
    {
      "is_primary": false,
      "skill_name": "FAISS"
    }
  ],
  "run_id": null
}

API 2 — extract-details

{
  "alias_matches": [
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 348,
      "existing_alias_text": "AWS",
      "input_term": "AWS",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "AWS",
        "id": 163,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "aws",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3408,
      "existing_alias_text": "AI/ML",
      "input_term": "AI/ML",
      "matched_canonical": {
        "category_id": 16,
        "display_name": "AI/ML",
        "id": 2611,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ai-ml",
        "sub_category_id": 2131,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3409,
      "existing_alias_text": "Amazon SageMaker",
      "input_term": "Amazon SageMaker",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Amazon SageMaker",
        "id": 2612,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "amazon-sagemaker",
        "sub_category_id": 326,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3410,
      "existing_alias_text": "Amazon Bedrock",
      "input_term": "Amazon Bedrock",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Amazon Bedrock",
        "id": 2613,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-bedrock",
        "sub_category_id": 2132,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3411,
      "existing_alias_text": "AWS Lambda",
      "input_term": "AWS Lambda",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "AWS Lambda",
        "id": 2614,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-lambda",
        "sub_category_id": 262,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3412,
      "existing_alias_text": "ECS",
      "input_term": "ECS",
      "matched_canonical": {
        "category_id": 2,
        "display_name": "ECS",
        "id": 2615,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ecs",
        "sub_category_id": 2133,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 1093,
      "existing_alias_text": "EKS",
      "input_term": "EKS",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "EKS",
        "id": 725,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "eks",
        "sub_category_id": 251,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 2372,
      "existing_alias_text": "EC2",
      "input_term": "EC2",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "EC2",
        "id": 1773,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "ec2",
        "sub_category_id": 1544,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 730,
      "existing_alias_text": "AWS Glue",
      "input_term": "AWS Glue",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "AWS Glue",
        "id": 466,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-glue",
        "sub_category_id": 385,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3413,
      "existing_alias_text": "Amazon Athena",
      "input_term": "Amazon Athena",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Amazon Athena",
        "id": 2616,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-athena",
        "sub_category_id": 388,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3367,
      "existing_alias_text": "Redshift",
      "input_term": "Redshift",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Redshift",
        "id": 2570,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "redshift",
        "sub_category_id": 2098,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3414,
      "existing_alias_text": "AWS Data Pipeline",
      "input_term": "AWS Data Pipeline",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "AWS Data Pipeline",
        "id": 2617,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-data-pipeline",
        "sub_category_id": 2134,
        "typical_lifespan": "SHORT_LIVED",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3415,
      "existing_alias_text": "S3",
      "input_term": "S3",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "S3",
        "id": 2618,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "s3",
        "sub_category_id": 2135,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3416,
      "existing_alias_text": "Kinesis",
      "input_term": "Kinesis",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Kinesis",
        "id": 2619,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "kinesis",
        "sub_category_id": 2136,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3417,
      "existing_alias_text": "Amazon API Gateway",
      "input_term": "Amazon API Gateway",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Amazon API Gateway",
        "id": 2620,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-api-gateway",
        "sub_category_id": 2137,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 608,
      "existing_alias_text": "Python",
      "input_term": "Python",
      "matched_canonical": {
        "category_id": 5,
        "display_name": "Python",
        "id": 393,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "python",
        "sub_category_id": 54,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 862,
      "existing_alias_text": "TensorFlow",
      "input_term": "TensorFlow",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "TensorFlow",
        "id": 558,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "tensorflow",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 861,
      "existing_alias_text": "PyTorch",
      "input_term": "PyTorch",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "PyTorch",
        "id": 557,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "pytorch",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 852,
      "existing_alias_text": "scikit-learn",
      "input_term": "Scikit-learn",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "scikit-learn",
        "id": 554,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "scikit-learn",
        "sub_category_id": 458,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 1800,
      "existing_alias_text": "GitHub Actions",
      "input_term": "GitHub Actions",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "GitHub Actions",
        "id": 1250,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "github-actions",
        "sub_category_id": 1019,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 540,
      "existing_alias_text": "Airflow",
      "input_term": "Airflow",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Airflow",
        "id": 325,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "airflow",
        "sub_category_id": 335,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 290,
      "existing_alias_text": "Terraform",
      "input_term": "Terraform",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Terraform",
        "id": 144,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "terraform",
        "sub_category_id": 171,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 299,
      "existing_alias_text": "Docker",
      "input_term": "Docker",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Docker",
        "id": 153,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "docker",
        "sub_category_id": 170,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 304,
      "existing_alias_text": "Kubernetes",
      "input_term": "Kubernetes",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Kubernetes",
        "id": 158,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "kubernetes",
        "sub_category_id": 1524,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3418,
      "existing_alias_text": "Pinecone",
      "input_term": "Pinecone",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Pinecone",
        "id": 2621,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "pinecone",
        "sub_category_id": 2138,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3419,
      "existing_alias_text": "OpenSearch",
      "input_term": "OpenSearch",
      "matched_canonical": {
        "category_id": 12,
        "display_name": "OpenSearch",
        "id": 2622,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "opensearch",
        "sub_category_id": 2139,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3420,
      "existing_alias_text": "FAISS",
      "input_term": "FAISS",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "FAISS",
        "id": 2623,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "faiss",
        "sub_category_id": 2140,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    }
  ],
  "candidate_roles": [
    {
      "display_name": "DevOps Engineer",
      "id": 1,
      "rationale": null,
      "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
      "slug": "devops-engineer",
      "source": "db"
    },
    {
      "display_name": "Cybersecurity Engineer",
      "id": 9,
      "rationale": null,
      "role_archetype": null,
      "slug": "cybersecurity-engineer",
      "source": "db"
    },
    {
      "display_name": "AI Engineer",
      "id": 12,
      "rationale": null,
      "role_archetype": null,
      "slug": "ai-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Scientist",
      "id": 7,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-scientist",
      "source": "db"
    },
    {
      "display_name": "MLOps Engineer",
      "id": 5,
      "rationale": null,
      "role_archetype": null,
      "slug": "mlops-engineer",
      "source": "db"
    },
    {
      "display_name": "Machine Learning Engineer",
      "id": 10,
      "rationale": null,
      "role_archetype": null,
      "slug": "machine-learning-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Engineer",
      "id": 6,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-engineer",
      "source": "db"
    },
    {
      "display_name": "Cloud Engineer",
      "id": 18,
      "rationale": null,
      "role_archetype": null,
      "slug": "cloud-engineer",
      "source": "db"
    },
    {
      "display_name": "Storage Engineer",
      "id": 22,
      "rationale": null,
      "role_archetype": null,
      "slug": "storage-engineer",
      "source": "db"
    },
    {
      "display_name": "Backend Engineer",
      "id": 14,
      "rationale": null,
      "role_archetype": null,
      "slug": "backend-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Analyst",
      "id": 20,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-analyst",
      "source": "db"
    },
    {
      "display_name": "Azure Cloud Engineer",
      "id": 4,
      "rationale": null,
      "role_archetype": null,
      "slug": "azure-cloud-engineer",
      "source": "db"
    },
    {
      "display_name": "Virtualization Engineer",
      "id": 26,
      "rationale": null,
      "role_archetype": null,
      "slug": "virtualization-engineer",
      "source": "db"
    },
    {
      "display_name": "Network Engineer",
      "id": 21,
      "rationale": null,
      "role_archetype": null,
      "slug": "network-engineer",
      "source": "db"
    },
    {
      "display_name": "Automation Tester",
      "id": 16,
      "rationale": null,
      "role_archetype": null,
      "slug": "automation-tester",
      "source": "db"
    }
  ],
  "chosen_role": {
    "display_name": "MLOps Engineer",
    "id": 5,
    "rationale": "The primary skills indicate a strong focus on AWS and AI/ML technologies, which align well with the responsibilities of an MLOps Engineer.",
    "role_archetype": null,
    "slug": "mlops-engineer",
    "source": "db"
  },
  "dimensions": [
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Platform Operations",
        "id": 26,
        "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
        "slug": "cloud-platform-operations",
        "source": "db"
      },
      "input_skill": "AWS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Security Platforms",
        "id": 332,
        "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
        "slug": "cloud-security-platforms",
        "source": "db"
      },
      "input_skill": "AWS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "AI Inference Cost, Latency, and Throughput Optimization",
        "id": 260,
        "rationale": "Improving the speed, throughput, and cost efficiency of AI and ML-powered product features without sacrificing correctness or user experience. Includes token budgeting, prompt compression, batching, caching, model selection, quantization, pruning, async inference, warm starts, streaming UX, timeout tuning, concurrency control, and profiling. Excludes infrastructure autoscaling, model serving capacity planning, generic backend performance tuning, and unrelated data/warehouse optimization.",
        "slug": "ai-inference-cost-latency-and-throughput-optimization",
        "source": "db"
      },
      "input_skill": "AI/ML",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "AI Service Architecture Patterns",
        "id": 270,
        "rationale": "Structuring AI capabilities within the product and service landscape. This includes deciding whether AI logic lives in handlers, workers, gateways, or dedicated orchestration services.",
        "slug": "ai-service-architecture-patterns",
        "source": "db"
      },
      "input_skill": "AI/ML",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "AI/ML",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Managed ML Platform Workflows",
        "id": 367,
        "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
        "slug": "d_split_01_01",
        "source": "db"
      },
      "input_skill": "Amazon SageMaker",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Managed Model Hosting and Endpoints",
        "id": 368,
        "rationale": "Cloud-managed services for deploying trained models as online or batch inference endpoints, including endpoint provisioning, batch transform, and rollout coordination. Examples: SageMaker endpoints, SageMaker batch transform.",
        "slug": "d_split_01_02",
        "source": "db"
      },
      "input_skill": "Amazon SageMaker",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Model Serving Deployment and Runtime Packaging",
        "id": 52,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
        "slug": "model-serving-deployment-and-runtime-packaging",
        "source": "db"
      },
      "input_skill": "Amazon SageMaker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        },
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Model Runtime Services",
        "id": 121,
        "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
        "slug": "cloud-model-runtime-services",
        "source": "db"
      },
      "input_skill": "Amazon Bedrock",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "AWS Lambda",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "ECS",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Model Runtime Services",
        "id": 121,
        "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
        "slug": "cloud-model-runtime-services",
        "source": "db"
      },
      "input_skill": "EKS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Orchestration Platforms",
        "id": 25,
        "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
        "slug": "orchestration-platforms",
        "source": "db"
      },
      "input_skill": "EKS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Provider Core Services",
        "id": 290,
        "rationale": "Core managed services used to provision and operate cloud environments. This is the base cloud surface for compute, storage, networking, and platform primitives the role configures and maintains.",
        "slug": "cloud-provider-core-services",
        "source": "db"
      },
      "input_skill": "EC2",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "AWS Glue",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Security and Networking",
        "id": 369,
        "rationale": "Identity, access, secrets, and networking primitives used to support cloud data platforms and pipelines.",
        "slug": "d_split_01_04",
        "source": "db"
      },
      "input_skill": "Amazon Athena",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "Amazon Athena",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Managed ML Platform Workflows",
        "id": 367,
        "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
        "slug": "d_split_01_01",
        "source": "db"
      },
      "input_skill": "Amazon Athena",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Data Warehousing Platforms",
        "id": 72,
        "rationale": "Cloud and on-prem analytical storage systems used to persist curated datasets and serve downstream consumers. This cluster is about the warehouse/lakehouse layer where transformed data is organized for access.",
        "slug": "data-warehousing-platforms",
        "source": "db"
      },
      "input_skill": "Redshift",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "AWS Data Pipeline",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Storage Provisioning and Automation",
        "id": 311,
        "rationale": "Covers the scripts, APIs, and operational workflows used to create, resize, map, and retire storage resources. This cluster is coherent because storage engineers often automate repetitive provisioning and maintenance tasks.",
        "slug": "storage-provisioning-and-automation",
        "source": "db"
      },
      "input_skill": "S3",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Storage Engineer",
          "id": 22,
          "rationale": null,
          "role_archetype": null,
          "slug": "storage-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Streaming Data Processing",
        "id": 69,
        "rationale": "Tools and patterns for ingesting and transforming event streams with low latency. This cluster covers continuous processing, windowing, and stateful stream jobs used to keep data fresh.",
        "slug": "streaming-data-processing",
        "source": "db"
      },
      "input_skill": "Kinesis",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Web Service Frameworks",
        "id": 141,
        "rationale": "Server frameworks used to build HTTP APIs, route requests, validate inputs, and structure backend application code. This cluster is coherent because it defines how backend services expose behavior to clients and other services.",
        "slug": "web-service-frameworks",
        "source": "db"
      },
      "input_skill": "Amazon API Gateway",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Engineer",
          "id": 14,
          "rationale": null,
          "role_archetype": null,
          "slug": "backend-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Analytical Programming Languages",
        "id": 82,
        "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
        "slug": "analytical-programming-languages",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Analyst",
          "id": 20,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-analyst",
          "source": "db"
        },
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Automation Scripting and CLI",
        "id": 48,
        "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
        "slug": "automation-scripting-and-cli",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Azure Cloud Engineer",
          "id": 4,
          "rationale": null,
          "role_archetype": null,
          "slug": "azure-cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Automation and Scripting for Operations",
        "id": 361,
        "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
        "slug": "automation-and-scripting-for-operations",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Virtualization Engineer",
          "id": 26,
          "rationale": null,
          "role_archetype": null,
          "slug": "virtualization-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Network Automation and Scripting",
        "id": 285,
        "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
        "slug": "network-automation-and-scripting",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Network Engineer",
          "id": 21,
          "rationale": null,
          "role_archetype": null,
          "slug": "network-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for AI Workflows",
        "id": 261,
        "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
        "slug": "programming-languages-for-ai-workflows",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Backend Systems",
        "id": 140,
        "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
        "slug": "programming-languages-for-backend-systems",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Engineer",
          "id": 14,
          "rationale": null,
          "role_archetype": null,
          "slug": "backend-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Data Work",
        "id": 67,
        "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
        "slug": "programming-languages-for-data-work",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for ML Systems",
        "id": 113,
        "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
        "slug": "programming-languages-for-ml-systems",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Security Work",
        "id": 328,
        "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
        "slug": "programming-languages-for-security-work",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Test Automation",
        "id": 193,
        "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
        "slug": "programming-languages-for-test-automation",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Automation Tester",
          "id": 16,
          "rationale": null,
          "role_archetype": null,
          "slug": "automation-tester",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Security Automation and Scripting",
        "id": 258,
        "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
        "slug": "security-automation-and-scripting",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "TensorFlow",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "PyTorch",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "Scikit-learn",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Continuous Integration Test Integration",
        "id": 207,
        "rationale": "Integrating automated checks into shared build and merge workflows so results are repeatable and visible. This cluster is coherent because automation testers commonly configure test execution triggers, artifacts, and reporting hooks.",
        "slug": "continuous-integration-test-integration",
        "source": "db"
      },
      "input_skill": "GitHub Actions",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Automation Tester",
          "id": 16,
          "rationale": null,
          "role_archetype": null,
          "slug": "automation-tester",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Workflow Orchestration Systems",
        "id": 64,
        "rationale": "Operational orchestration of ML jobs, dependencies, and handoffs across training, validation, deployment, and retraining. This is a useful split from training pipelines because it emphasizes the scheduler and control plane.",
        "slug": "workflow-orchestration-systems",
        "source": "db"
      },
      "input_skill": "Airflow",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        },
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Infrastructure Provisioning Templates",
        "id": 291,
        "rationale": "Declarative templates and modules used to create repeatable cloud resources and environments. This cluster covers the infrastructure definitions the role applies, reviews, and updates to keep environments consistent.",
        "slug": "infrastructure-provisioning-templates",
        "source": "db"
      },
      "input_skill": "Terraform",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Infrastructure as Code",
        "id": 22,
        "rationale": "Defines infrastructure and platform resources through versioned code so environments are repeatable and reviewable. This is a coherent cluster because it underpins environment consistency and change control.",
        "slug": "infrastructure-as-code",
        "source": "db"
      },
      "input_skill": "Terraform",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Infrastructure as Code and Declarative Provisioning",
        "id": 36,
        "rationale": "Defines cloud and platform infrastructure declaratively through versioned code so environments are repeatable, reviewable, and automatable. This includes authoring and maintaining IaC templates/modules, managing parameters and state, and using plan/apply workflows to provision and update resources across Azure and other cloud platforms.",
        "slug": "infrastructure-as-code-and-declarative-provisioning",
        "source": "db"
      },
      "input_skill": "Terraform",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Azure Cloud Engineer",
          "id": 4,
          "rationale": null,
          "role_archetype": null,
          "slug": "azure-cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Containerization and Image Delivery",
        "id": 24,
        "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
        "slug": "containerization-and-image-delivery",
        "source": "db"
      },
      "input_skill": "Docker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Model Serving Deployment and Runtime Packaging",
        "id": 52,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
        "slug": "model-serving-deployment-and-runtime-packaging",
        "source": "db"
      },
      "input_skill": "Docker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        },
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Orchestration Platforms",
        "id": 25,
        "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
        "slug": "orchestration-platforms",
        "source": "db"
      },
      "input_skill": "Kubernetes",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Model Runtime Services",
        "id": 121,
        "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
        "slug": "cloud-model-runtime-services",
        "source": "db"
      },
      "input_skill": "Pinecone",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "OpenSearch",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "OpenSearch",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "FAISS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "FAISS",
      "llm_role": null,
      "roles_from_db": []
    }
  ],
  "input_final_skills": [
    "AWS",
    "AI/ML",
    "Amazon SageMaker",
    "Amazon Bedrock",
    "AWS Lambda",
    "ECS",
    "EKS",
    "EC2",
    "AWS Glue",
    "Amazon Athena",
    "Redshift",
    "AWS Data Pipeline",
    "S3",
    "Kinesis",
    "Amazon API Gateway",
    "Python",
    "TensorFlow",
    "PyTorch",
    "Scikit-learn",
    "GitHub Actions",
    "Airflow",
    "Terraform",
    "Docker",
    "Kubernetes",
    "Pinecone",
    "OpenSearch",
    "FAISS"
  ],
  "input_llm_skills": [
    "AWS",
    "AI/ML",
    "Amazon SageMaker",
    "Amazon Bedrock",
    "AWS Lambda",
    "ECS",
    "EKS",
    "EC2",
    "AWS Glue",
    "Amazon Athena",
    "Redshift",
    "AWS Data Pipeline",
    "S3",
    "Kinesis",
    "Amazon API Gateway",
    "Python",
    "TensorFlow",
    "PyTorch",
    "Scikit-learn",
    "GitHub Actions",
    "Airflow",
    "Terraform",
    "Docker",
    "Kubernetes",
    "Pinecone",
    "OpenSearch",
    "FAISS"
  ],
  "new_aliases_persisted": 0,
  "run_id": "832dda01-52e1-4739-b122-0a9cf9bc8100",
  "skills_detail": [
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS",
          "alias_type": "CANONICAL",
          "id": 348,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "AWS",
        "id": 163,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "aws",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Platform Operations",
            "id": 26,
            "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
            "slug": "cloud-platform-operations",
            "source": "db"
          },
          "input_skill": "AWS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Security Platforms",
            "id": 332,
            "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
            "slug": "cloud-security-platforms",
            "source": "db"
          },
          "input_skill": "AWS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AI/ML",
          "alias_type": "CANONICAL",
          "id": 3408,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 16,
        "display_name": "AI/ML",
        "id": 2611,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ai-ml",
        "sub_category_id": 2131,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "AI Inference Cost, Latency, and Throughput Optimization",
            "id": 260,
            "rationale": "Improving the speed, throughput, and cost efficiency of AI and ML-powered product features without sacrificing correctness or user experience. Includes token budgeting, prompt compression, batching, caching, model selection, quantization, pruning, async inference, warm starts, streaming UX, timeout tuning, concurrency control, and profiling. Excludes infrastructure autoscaling, model serving capacity planning, generic backend performance tuning, and unrelated data/warehouse optimization.",
            "slug": "ai-inference-cost-latency-and-throughput-optimization",
            "source": "db"
          },
          "input_skill": "AI/ML",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "AI Service Architecture Patterns",
            "id": 270,
            "rationale": "Structuring AI capabilities within the product and service landscape. This includes deciding whether AI logic lives in handlers, workers, gateways, or dedicated orchestration services.",
            "slug": "ai-service-architecture-patterns",
            "source": "db"
          },
          "input_skill": "AI/ML",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "AI/ML",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AI/ML",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon SageMaker",
          "alias_type": "CANONICAL",
          "id": 3409,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Amazon SageMaker",
        "id": 2612,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "amazon-sagemaker",
        "sub_category_id": 326,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Managed ML Platform Workflows",
            "id": 367,
            "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
            "slug": "d_split_01_01",
            "source": "db"
          },
          "input_skill": "Amazon SageMaker",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Managed Model Hosting and Endpoints",
            "id": 368,
            "rationale": "Cloud-managed services for deploying trained models as online or batch inference endpoints, including endpoint provisioning, batch transform, and rollout coordination. Examples: SageMaker endpoints, SageMaker batch transform.",
            "slug": "d_split_01_02",
            "source": "db"
          },
          "input_skill": "Amazon SageMaker",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Model Serving Deployment and Runtime Packaging",
            "id": 52,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
            "slug": "model-serving-deployment-and-runtime-packaging",
            "source": "db"
          },
          "input_skill": "Amazon SageMaker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            },
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Amazon SageMaker",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon Bedrock",
          "alias_type": "CANONICAL",
          "id": 3410,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Amazon Bedrock",
        "id": 2613,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-bedrock",
        "sub_category_id": 2132,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Model Runtime Services",
            "id": 121,
            "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
            "slug": "cloud-model-runtime-services",
            "source": "db"
          },
          "input_skill": "Amazon Bedrock",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Amazon Bedrock",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS Lambda",
          "alias_type": "CANONICAL",
          "id": 3411,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "AWS Lambda",
        "id": 2614,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-lambda",
        "sub_category_id": 262,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "AWS Lambda",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS Lambda",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "ECS",
          "alias_type": "CANONICAL",
          "id": 3412,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 2,
        "display_name": "ECS",
        "id": 2615,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ecs",
        "sub_category_id": 2133,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "ECS",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "ECS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "EKS",
          "alias_type": "CANONICAL",
          "id": 1093,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "EKS",
        "id": 725,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "eks",
        "sub_category_id": 251,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Model Runtime Services",
            "id": 121,
            "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
            "slug": "cloud-model-runtime-services",
            "source": "db"
          },
          "input_skill": "EKS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Orchestration Platforms",
            "id": 25,
            "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
            "slug": "orchestration-platforms",
            "source": "db"
          },
          "input_skill": "EKS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "EKS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "EC2",
          "alias_type": "CANONICAL",
          "id": 2372,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "EC2",
        "id": 1773,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "ec2",
        "sub_category_id": 1544,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Provider Core Services",
            "id": 290,
            "rationale": "Core managed services used to provision and operate cloud environments. This is the base cloud surface for compute, storage, networking, and platform primitives the role configures and maintains.",
            "slug": "cloud-provider-core-services",
            "source": "db"
          },
          "input_skill": "EC2",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "EC2",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS Glue",
          "alias_type": "CANONICAL",
          "id": 730,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "AWS Glue",
        "id": 466,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-glue",
        "sub_category_id": 385,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "AWS Glue",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS Glue",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon Athena",
          "alias_type": "CANONICAL",
          "id": 3413,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Amazon Athena",
        "id": 2616,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-athena",
        "sub_category_id": 388,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Security and Networking",
            "id": 369,
            "rationale": "Identity, access, secrets, and networking primitives used to support cloud data platforms and pipelines.",
            "slug": "d_split_01_04",
            "source": "db"
          },
          "input_skill": "Amazon Athena",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "Amazon Athena",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Managed ML Platform Workflows",
            "id": 367,
            "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
            "slug": "d_split_01_01",
            "source": "db"
          },
          "input_skill": "Amazon Athena",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "Amazon Athena",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Redshift",
          "alias_type": "CANONICAL",
          "id": 3367,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Redshift",
        "id": 2570,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "redshift",
        "sub_category_id": 2098,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Data Warehousing Platforms",
            "id": 72,
            "rationale": "Cloud and on-prem analytical storage systems used to persist curated datasets and serve downstream consumers. This cluster is about the warehouse/lakehouse layer where transformed data is organized for access.",
            "slug": "data-warehousing-platforms",
            "source": "db"
          },
          "input_skill": "Redshift",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Redshift",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS Data Pipeline",
          "alias_type": "CANONICAL",
          "id": 3414,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "AWS Data Pipeline",
        "id": 2617,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-data-pipeline",
        "sub_category_id": 2134,
        "typical_lifespan": "SHORT_LIVED",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "AWS Data Pipeline",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS Data Pipeline",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "S3",
          "alias_type": "CANONICAL",
          "id": 3415,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "S3",
        "id": 2618,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "s3",
        "sub_category_id": 2135,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Storage Provisioning and Automation",
            "id": 311,
            "rationale": "Covers the scripts, APIs, and operational workflows used to create, resize, map, and retire storage resources. This cluster is coherent because storage engineers often automate repetitive provisioning and maintenance tasks.",
            "slug": "storage-provisioning-and-automation",
            "source": "db"
          },
          "input_skill": "S3",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Storage Engineer",
              "id": 22,
              "rationale": null,
              "role_archetype": null,
              "slug": "storage-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "S3",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Kinesis",
          "alias_type": "CANONICAL",
          "id": 3416,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Kinesis",
        "id": 2619,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "kinesis",
        "sub_category_id": 2136,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Streaming Data Processing",
            "id": 69,
            "rationale": "Tools and patterns for ingesting and transforming event streams with low latency. This cluster covers continuous processing, windowing, and stateful stream jobs used to keep data fresh.",
            "slug": "streaming-data-processing",
            "source": "db"
          },
          "input_skill": "Kinesis",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Kinesis",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon API Gateway",
          "alias_type": "CANONICAL",
          "id": 3417,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Amazon API Gateway",
        "id": 2620,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-api-gateway",
        "sub_category_id": 2137,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Web Service Frameworks",
            "id": 141,
            "rationale": "Server frameworks used to build HTTP APIs, route requests, validate inputs, and structure backend application code. This cluster is coherent because it defines how backend services expose behavior to clients and other services.",
            "slug": "web-service-frameworks",
            "source": "db"
          },
          "input_skill": "Amazon API Gateway",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Engineer",
              "id": 14,
              "rationale": null,
              "role_archetype": null,
              "slug": "backend-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Amazon API Gateway",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Python",
          "alias_type": "CANONICAL",
          "id": 608,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 2",
          "alias_type": "VERSION",
          "id": 611,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 2.x",
          "alias_type": "VERSION",
          "id": 613,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3",
          "alias_type": "VERSION",
          "id": 612,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.10",
          "alias_type": "VERSION",
          "id": 2330,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.11",
          "alias_type": "VERSION",
          "id": 2331,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.12",
          "alias_type": "VERSION",
          "id": 2332,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.x",
          "alias_type": "VERSION",
          "id": 614,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "py2",
          "alias_type": "VERSION",
          "id": 609,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "py3",
          "alias_type": "VERSION",
          "id": 610,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 2",
          "alias_type": "VERSION",
          "id": 2152,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 2.x",
          "alias_type": "VERSION",
          "id": 2154,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3",
          "alias_type": "VERSION",
          "id": 990,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.10",
          "alias_type": "VERSION",
          "id": 992,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.11",
          "alias_type": "VERSION",
          "id": 993,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.12",
          "alias_type": "VERSION",
          "id": 994,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.x",
          "alias_type": "VERSION",
          "id": 991,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python2",
          "alias_type": "VERSION",
          "id": 2150,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python3",
          "alias_type": "VERSION",
          "id": 989,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 5,
        "display_name": "Python",
        "id": 393,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "python",
        "sub_category_id": 54,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Analytical Programming Languages",
            "id": 82,
            "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
            "slug": "analytical-programming-languages",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Analyst",
              "id": 20,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-analyst",
              "source": "db"
            },
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Automation Scripting and CLI",
            "id": 48,
            "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
            "slug": "automation-scripting-and-cli",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Azure Cloud Engineer",
              "id": 4,
              "rationale": null,
              "role_archetype": null,
              "slug": "azure-cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Automation and Scripting for Operations",
            "id": 361,
            "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
            "slug": "automation-and-scripting-for-operations",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Virtualization Engineer",
              "id": 26,
              "rationale": null,
              "role_archetype": null,
              "slug": "virtualization-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Network Automation and Scripting",
            "id": 285,
            "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
            "slug": "network-automation-and-scripting",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Network Engineer",
              "id": 21,
              "rationale": null,
              "role_archetype": null,
              "slug": "network-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for AI Workflows",
            "id": 261,
            "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
            "slug": "programming-languages-for-ai-workflows",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Backend Systems",
            "id": 140,
            "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
            "slug": "programming-languages-for-backend-systems",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Engineer",
              "id": 14,
              "rationale": null,
              "role_archetype": null,
              "slug": "backend-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Data Work",
            "id": 67,
            "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
            "slug": "programming-languages-for-data-work",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for ML Systems",
            "id": 113,
            "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
            "slug": "programming-languages-for-ml-systems",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Security Work",
            "id": 328,
            "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
            "slug": "programming-languages-for-security-work",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Test Automation",
            "id": 193,
            "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
            "slug": "programming-languages-for-test-automation",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Automation Tester",
              "id": 16,
              "rationale": null,
              "role_archetype": null,
              "slug": "automation-tester",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Security Automation and Scripting",
            "id": 258,
            "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
            "slug": "security-automation-and-scripting",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Python",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "TensorFlow",
          "alias_type": "CANONICAL",
          "id": 862,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TF1",
          "alias_type": "VERSION",
          "id": 863,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TF2",
          "alias_type": "VERSION",
          "id": 864,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 1",
          "alias_type": "VERSION",
          "id": 865,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 1.x",
          "alias_type": "VERSION",
          "id": 867,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 2",
          "alias_type": "VERSION",
          "id": 866,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 2.x",
          "alias_type": "VERSION",
          "id": 868,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "TensorFlow",
        "id": 558,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "tensorflow",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "TensorFlow",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "TensorFlow",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "PyTorch",
          "alias_type": "CANONICAL",
          "id": 861,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "PyTorch",
        "id": 557,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "pytorch",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "PyTorch",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "PyTorch",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "scikit-learn",
          "alias_type": "CANONICAL",
          "id": 852,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "scikit-learn",
        "id": 554,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "scikit-learn",
        "sub_category_id": 458,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "Scikit-learn",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Scikit-learn",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "GitHub Actions",
          "alias_type": "CANONICAL",
          "id": 1800,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "GitHub Actions",
        "id": 1250,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "github-actions",
        "sub_category_id": 1019,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Continuous Integration Test Integration",
            "id": 207,
            "rationale": "Integrating automated checks into shared build and merge workflows so results are repeatable and visible. This cluster is coherent because automation testers commonly configure test execution triggers, artifacts, and reporting hooks.",
            "slug": "continuous-integration-test-integration",
            "source": "db"
          },
          "input_skill": "GitHub Actions",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Automation Tester",
              "id": 16,
              "rationale": null,
              "role_archetype": null,
              "slug": "automation-tester",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "GitHub Actions",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Airflow",
          "alias_type": "CANONICAL",
          "id": 540,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Airflow",
        "id": 325,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "airflow",
        "sub_category_id": 335,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Workflow Orchestration Systems",
            "id": 64,
            "rationale": "Operational orchestration of ML jobs, dependencies, and handoffs across training, validation, deployment, and retraining. This is a useful split from training pipelines because it emphasizes the scheduler and control plane.",
            "slug": "workflow-orchestration-systems",
            "source": "db"
          },
          "input_skill": "Airflow",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            },
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Airflow",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Terraform",
          "alias_type": "CANONICAL",
          "id": 290,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Terraform",
        "id": 144,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "terraform",
        "sub_category_id": 171,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Infrastructure Provisioning Templates",
            "id": 291,
            "rationale": "Declarative templates and modules used to create repeatable cloud resources and environments. This cluster covers the infrastructure definitions the role applies, reviews, and updates to keep environments consistent.",
            "slug": "infrastructure-provisioning-templates",
            "source": "db"
          },
          "input_skill": "Terraform",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Infrastructure as Code",
            "id": 22,
            "rationale": "Defines infrastructure and platform resources through versioned code so environments are repeatable and reviewable. This is a coherent cluster because it underpins environment consistency and change control.",
            "slug": "infrastructure-as-code",
            "source": "db"
          },
          "input_skill": "Terraform",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Infrastructure as Code and Declarative Provisioning",
            "id": 36,
            "rationale": "Defines cloud and platform infrastructure declaratively through versioned code so environments are repeatable, reviewable, and automatable. This includes authoring and maintaining IaC templates/modules, managing parameters and state, and using plan/apply workflows to provision and update resources across Azure and other cloud platforms.",
            "slug": "infrastructure-as-code-and-declarative-provisioning",
            "source": "db"
          },
          "input_skill": "Terraform",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Azure Cloud Engineer",
              "id": 4,
              "rationale": null,
              "role_archetype": null,
              "slug": "azure-cloud-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Terraform",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Docker",
          "alias_type": "CANONICAL",
          "id": 299,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Docker",
        "id": 153,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "docker",
        "sub_category_id": 170,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Containerization and Image Delivery",
            "id": 24,
            "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
            "slug": "containerization-and-image-delivery",
            "source": "db"
          },
          "input_skill": "Docker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Model Serving Deployment and Runtime Packaging",
            "id": 52,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
            "slug": "model-serving-deployment-and-runtime-packaging",
            "source": "db"
          },
          "input_skill": "Docker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            },
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Docker",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Kubernetes",
          "alias_type": "CANONICAL",
          "id": 304,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.0",
          "alias_type": "VERSION",
          "id": 307,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.0+",
          "alias_type": "VERSION",
          "id": 2366,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.1",
          "alias_type": "VERSION",
          "id": 308,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.10",
          "alias_type": "VERSION",
          "id": 318,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.11",
          "alias_type": "VERSION",
          "id": 319,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.12",
          "alias_type": "VERSION",
          "id": 320,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.13",
          "alias_type": "VERSION",
          "id": 321,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.14",
          "alias_type": "VERSION",
          "id": 322,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.15",
          "alias_type": "VERSION",
          "id": 323,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.16",
          "alias_type": "VERSION",
          "id": 324,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.17",
          "alias_type": "VERSION",
          "id": 325,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.18",
          "alias_type": "VERSION",
          "id": 326,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.19",
          "alias_type": "VERSION",
          "id": 327,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.2",
          "alias_type": "VERSION",
          "id": 309,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.20",
          "alias_type": "VERSION",
          "id": 328,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.21",
          "alias_type": "VERSION",
          "id": 329,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.22",
          "alias_type": "VERSION",
          "id": 330,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.23",
          "alias_type": "VERSION",
          "id": 331,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.24",
          "alias_type": "VERSION",
          "id": 332,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.25",
          "alias_type": "VERSION",
          "id": 333,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.26",
          "alias_type": "VERSION",
          "id": 334,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.27",
          "alias_type": "VERSION",
          "id": 335,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.28",
          "alias_type": "VERSION",
          "id": 336,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.29",
          "alias_type": "VERSION",
          "id": 337,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.3",
          "alias_type": "VERSION",
          "id": 310,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.30",
          "alias_type": "VERSION",
          "id": 338,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.4",
          "alias_type": "VERSION",
          "id": 311,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.5",
          "alias_type": "VERSION",
          "id": 312,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.6",
          "alias_type": "VERSION",
          "id": 313,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.7",
          "alias_type": "VERSION",
          "id": 314,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.8",
          "alias_type": "VERSION",
          "id": 315,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.9",
          "alias_type": "VERSION",
          "id": 316,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.x",
          "alias_type": "VERSION",
          "id": 317,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes v1",
          "alias_type": "VERSION",
          "id": 306,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "k8s",
          "alias_type": "VERSION",
          "id": 305,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Kubernetes",
        "id": 158,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "kubernetes",
        "sub_category_id": 1524,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Orchestration Platforms",
            "id": 25,
            "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
            "slug": "orchestration-platforms",
            "source": "db"
          },
          "input_skill": "Kubernetes",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Kubernetes",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Pinecone",
          "alias_type": "CANONICAL",
          "id": 3418,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Pinecone",
        "id": 2621,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "pinecone",
        "sub_category_id": 2138,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Model Runtime Services",
            "id": 121,
            "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
            "slug": "cloud-model-runtime-services",
            "source": "db"
          },
          "input_skill": "Pinecone",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Pinecone",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "OpenSearch",
          "alias_type": "CANONICAL",
          "id": 3419,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 12,
        "display_name": "OpenSearch",
        "id": 2622,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "opensearch",
        "sub_category_id": 2139,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "OpenSearch",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "OpenSearch",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "OpenSearch",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "FAISS",
          "alias_type": "CANONICAL",
          "id": 3420,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "FAISS",
        "id": 2623,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "faiss",
        "sub_category_id": 2140,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "FAISS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "FAISS",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "FAISS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": []
}

API 3 — final-role-output

{
  "chosen_role": {
    "display_name": "MLOps Engineer",
    "id": 5,
    "rationale": "The primary skills indicate a strong focus on AWS and AI/ML technologies, which align well with the responsibilities of an MLOps Engineer.",
    "role_archetype": null,
    "slug": "mlops-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "AWS",
      "tag": "in_db"
    },
    {
      "skill": "AI/ML",
      "tag": "in_db"
    },
    {
      "skill": "Amazon SageMaker",
      "tag": "in_db"
    },
    {
      "skill": "Amazon Bedrock",
      "tag": "in_db"
    },
    {
      "skill": "AWS Lambda",
      "tag": "in_db"
    },
    {
      "skill": "ECS",
      "tag": "in_db"
    },
    {
      "skill": "EKS",
      "tag": "in_db"
    },
    {
      "skill": "EC2",
      "tag": "in_db"
    },
    {
      "skill": "AWS Glue",
      "tag": "in_db"
    },
    {
      "skill": "Amazon Athena",
      "tag": "in_db"
    },
    {
      "skill": "Redshift",
      "tag": "in_db"
    },
    {
      "skill": "AWS Data Pipeline",
      "tag": "in_db"
    },
    {
      "skill": "S3",
      "tag": "in_db"
    },
    {
      "skill": "Kinesis",
      "tag": "in_db"
    },
    {
      "skill": "Amazon API Gateway",
      "tag": "in_db"
    },
    {
      "skill": "Python",
      "tag": "in_db"
    },
    {
      "skill": "TensorFlow",
      "tag": "in_db"
    },
    {
      "skill": "PyTorch",
      "tag": "in_db"
    },
    {
      "skill": "Scikit-learn",
      "tag": "in_db"
    },
    {
      "skill": "GitHub Actions",
      "tag": "in_db"
    },
    {
      "skill": "Airflow",
      "tag": "in_db"
    },
    {
      "skill": "Terraform",
      "tag": "in_db"
    },
    {
      "skill": "Docker",
      "tag": "in_db"
    },
    {
      "skill": "Kubernetes",
      "tag": "in_db"
    },
    {
      "skill": "Pinecone",
      "tag": "in_db"
    },
    {
      "skill": "OpenSearch",
      "tag": "in_db"
    },
    {
      "skill": "FAISS",
      "tag": "in_db"
    }
  ],
  "persistence": {
    "items": [
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Platform Operations",
          "id": 26,
          "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
          "slug": "cloud-platform-operations",
          "source": "db"
        },
        "dimension_id": 26,
        "input_skill": "AWS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 163,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Security Platforms",
          "id": 332,
          "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
          "slug": "cloud-security-platforms",
          "source": "db"
        },
        "dimension_id": 332,
        "input_skill": "AWS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 163,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "AI Inference Cost, Latency, and Throughput Optimization",
          "id": 260,
          "rationale": "Improving the speed, throughput, and cost efficiency of AI and ML-powered product features without sacrificing correctness or user experience. Includes token budgeting, prompt compression, batching, caching, model selection, quantization, pruning, async inference, warm starts, streaming UX, timeout tuning, concurrency control, and profiling. Excludes infrastructure autoscaling, model serving capacity planning, generic backend performance tuning, and unrelated data/warehouse optimization.",
          "slug": "ai-inference-cost-latency-and-throughput-optimization",
          "source": "db"
        },
        "dimension_id": 260,
        "input_skill": "AI/ML",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2611,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "AI Service Architecture Patterns",
          "id": 270,
          "rationale": "Structuring AI capabilities within the product and service landscape. This includes deciding whether AI logic lives in handlers, workers, gateways, or dedicated orchestration services.",
          "slug": "ai-service-architecture-patterns",
          "source": "db"
        },
        "dimension_id": 270,
        "input_skill": "AI/ML",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2611,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "AI/ML",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2611,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Managed ML Platform Workflows",
          "id": 367,
          "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
          "slug": "d_split_01_01",
          "source": "db"
        },
        "dimension_id": 367,
        "input_skill": "Amazon SageMaker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2612,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Managed Model Hosting and Endpoints",
          "id": 368,
          "rationale": "Cloud-managed services for deploying trained models as online or batch inference endpoints, including endpoint provisioning, batch transform, and rollout coordination. Examples: SageMaker endpoints, SageMaker batch transform.",
          "slug": "d_split_01_02",
          "source": "db"
        },
        "dimension_id": 368,
        "input_skill": "Amazon SageMaker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2612,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Model Serving Deployment and Runtime Packaging",
          "id": 52,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
          "slug": "model-serving-deployment-and-runtime-packaging",
          "source": "db"
        },
        "dimension_id": 52,
        "input_skill": "Amazon SageMaker",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          },
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2612,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Model Runtime Services",
          "id": 121,
          "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
          "slug": "cloud-model-runtime-services",
          "source": "db"
        },
        "dimension_id": 121,
        "input_skill": "Amazon Bedrock",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2613,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "AWS Lambda",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2614,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "ECS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2615,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Model Runtime Services",
          "id": 121,
          "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
          "slug": "cloud-model-runtime-services",
          "source": "db"
        },
        "dimension_id": 121,
        "input_skill": "EKS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 725,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Orchestration Platforms",
          "id": 25,
          "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
          "slug": "orchestration-platforms",
          "source": "db"
        },
        "dimension_id": 25,
        "input_skill": "EKS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 725,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Provider Core Services",
          "id": 290,
          "rationale": "Core managed services used to provision and operate cloud environments. This is the base cloud surface for compute, storage, networking, and platform primitives the role configures and maintains.",
          "slug": "cloud-provider-core-services",
          "source": "db"
        },
        "dimension_id": 290,
        "input_skill": "EC2",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 1773,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "AWS Glue",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 466,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Security and Networking",
          "id": 369,
          "rationale": "Identity, access, secrets, and networking primitives used to support cloud data platforms and pipelines.",
          "slug": "d_split_01_04",
          "source": "db"
        },
        "dimension_id": 369,
        "input_skill": "Amazon Athena",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2616,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "Amazon Athena",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2616,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Managed ML Platform Workflows",
          "id": 367,
          "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
          "slug": "d_split_01_01",
          "source": "db"
        },
        "dimension_id": 367,
        "input_skill": "Amazon Athena",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2616,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Data Warehousing Platforms",
          "id": 72,
          "rationale": "Cloud and on-prem analytical storage systems used to persist curated datasets and serve downstream consumers. This cluster is about the warehouse/lakehouse layer where transformed data is organized for access.",
          "slug": "data-warehousing-platforms",
          "source": "db"
        },
        "dimension_id": 72,
        "input_skill": "Redshift",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2570,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "AWS Data Pipeline",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2617,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Storage Provisioning and Automation",
          "id": 311,
          "rationale": "Covers the scripts, APIs, and operational workflows used to create, resize, map, and retire storage resources. This cluster is coherent because storage engineers often automate repetitive provisioning and maintenance tasks.",
          "slug": "storage-provisioning-and-automation",
          "source": "db"
        },
        "dimension_id": 311,
        "input_skill": "S3",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Storage Engineer",
            "id": 22,
            "rationale": null,
            "role_archetype": null,
            "slug": "storage-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2618,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Streaming Data Processing",
          "id": 69,
          "rationale": "Tools and patterns for ingesting and transforming event streams with low latency. This cluster covers continuous processing, windowing, and stateful stream jobs used to keep data fresh.",
          "slug": "streaming-data-processing",
          "source": "db"
        },
        "dimension_id": 69,
        "input_skill": "Kinesis",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2619,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Web Service Frameworks",
          "id": 141,
          "rationale": "Server frameworks used to build HTTP APIs, route requests, validate inputs, and structure backend application code. This cluster is coherent because it defines how backend services expose behavior to clients and other services.",
          "slug": "web-service-frameworks",
          "source": "db"
        },
        "dimension_id": 141,
        "input_skill": "Amazon API Gateway",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Engineer",
            "id": 14,
            "rationale": null,
            "role_archetype": null,
            "slug": "backend-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2620,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Analytical Programming Languages",
          "id": 82,
          "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
          "slug": "analytical-programming-languages",
          "source": "db"
        },
        "dimension_id": 82,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Analyst",
            "id": 20,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-analyst",
            "source": "db"
          },
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Automation Scripting and CLI",
          "id": 48,
          "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
          "slug": "automation-scripting-and-cli",
          "source": "db"
        },
        "dimension_id": 48,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Azure Cloud Engineer",
            "id": 4,
            "rationale": null,
            "role_archetype": null,
            "slug": "azure-cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Automation and Scripting for Operations",
          "id": 361,
          "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
          "slug": "automation-and-scripting-for-operations",
          "source": "db"
        },
        "dimension_id": 361,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Virtualization Engineer",
            "id": 26,
            "rationale": null,
            "role_archetype": null,
            "slug": "virtualization-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Network Automation and Scripting",
          "id": 285,
          "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
          "slug": "network-automation-and-scripting",
          "source": "db"
        },
        "dimension_id": 285,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Network Engineer",
            "id": 21,
            "rationale": null,
            "role_archetype": null,
            "slug": "network-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for AI Workflows",
          "id": 261,
          "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
          "slug": "programming-languages-for-ai-workflows",
          "source": "db"
        },
        "dimension_id": 261,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Backend Systems",
          "id": 140,
          "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
          "slug": "programming-languages-for-backend-systems",
          "source": "db"
        },
        "dimension_id": 140,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Engineer",
            "id": 14,
            "rationale": null,
            "role_archetype": null,
            "slug": "backend-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Data Work",
          "id": 67,
          "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
          "slug": "programming-languages-for-data-work",
          "source": "db"
        },
        "dimension_id": 67,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for ML Systems",
          "id": 113,
          "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
          "slug": "programming-languages-for-ml-systems",
          "source": "db"
        },
        "dimension_id": 113,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Security Work",
          "id": 328,
          "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
          "slug": "programming-languages-for-security-work",
          "source": "db"
        },
        "dimension_id": 328,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Test Automation",
          "id": 193,
          "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
          "slug": "programming-languages-for-test-automation",
          "source": "db"
        },
        "dimension_id": 193,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Automation Tester",
            "id": 16,
            "rationale": null,
            "role_archetype": null,
            "slug": "automation-tester",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Security Automation and Scripting",
          "id": 258,
          "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
          "slug": "security-automation-and-scripting",
          "source": "db"
        },
        "dimension_id": 258,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "TensorFlow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 558,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "PyTorch",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 557,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "Scikit-learn",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 554,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Continuous Integration Test Integration",
          "id": 207,
          "rationale": "Integrating automated checks into shared build and merge workflows so results are repeatable and visible. This cluster is coherent because automation testers commonly configure test execution triggers, artifacts, and reporting hooks.",
          "slug": "continuous-integration-test-integration",
          "source": "db"
        },
        "dimension_id": 207,
        "input_skill": "GitHub Actions",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Automation Tester",
            "id": 16,
            "rationale": null,
            "role_archetype": null,
            "slug": "automation-tester",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 1250,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Workflow Orchestration Systems",
          "id": 64,
          "rationale": "Operational orchestration of ML jobs, dependencies, and handoffs across training, validation, deployment, and retraining. This is a useful split from training pipelines because it emphasizes the scheduler and control plane.",
          "slug": "workflow-orchestration-systems",
          "source": "db"
        },
        "dimension_id": 64,
        "input_skill": "Airflow",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          },
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 325,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Infrastructure Provisioning Templates",
          "id": 291,
          "rationale": "Declarative templates and modules used to create repeatable cloud resources and environments. This cluster covers the infrastructure definitions the role applies, reviews, and updates to keep environments consistent.",
          "slug": "infrastructure-provisioning-templates",
          "source": "db"
        },
        "dimension_id": 291,
        "input_skill": "Terraform",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 144,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Infrastructure as Code",
          "id": 22,
          "rationale": "Defines infrastructure and platform resources through versioned code so environments are repeatable and reviewable. This is a coherent cluster because it underpins environment consistency and change control.",
          "slug": "infrastructure-as-code",
          "source": "db"
        },
        "dimension_id": 22,
        "input_skill": "Terraform",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 144,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Infrastructure as Code and Declarative Provisioning",
          "id": 36,
          "rationale": "Defines cloud and platform infrastructure declaratively through versioned code so environments are repeatable, reviewable, and automatable. This includes authoring and maintaining IaC templates/modules, managing parameters and state, and using plan/apply workflows to provision and update resources across Azure and other cloud platforms.",
          "slug": "infrastructure-as-code-and-declarative-provisioning",
          "source": "db"
        },
        "dimension_id": 36,
        "input_skill": "Terraform",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Azure Cloud Engineer",
            "id": 4,
            "rationale": null,
            "role_archetype": null,
            "slug": "azure-cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 144,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Containerization and Image Delivery",
          "id": 24,
          "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
          "slug": "containerization-and-image-delivery",
          "source": "db"
        },
        "dimension_id": 24,
        "input_skill": "Docker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 153,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Model Serving Deployment and Runtime Packaging",
          "id": 52,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
          "slug": "model-serving-deployment-and-runtime-packaging",
          "source": "db"
        },
        "dimension_id": 52,
        "input_skill": "Docker",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          },
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 153,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Orchestration Platforms",
          "id": 25,
          "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
          "slug": "orchestration-platforms",
          "source": "db"
        },
        "dimension_id": 25,
        "input_skill": "Kubernetes",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 158,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Model Runtime Services",
          "id": 121,
          "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
          "slug": "cloud-model-runtime-services",
          "source": "db"
        },
        "dimension_id": 121,
        "input_skill": "Pinecone",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2621,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "OpenSearch",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2622,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "OpenSearch",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2622,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "FAISS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2623,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "FAISS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2623,
        "skill_tag": "in_db",
        "skipped_reason": null
      }
    ],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "832dda01-52e1-4739-b122-0a9cf9bc8100"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…