← Back to history

Pipeline run

832dda01-52e1-4739-b122-0a9cf9bc8100

Pipeline LLM cost (USD)
API 1: $0.0005 API 2: $0.0006 API 3: $0.0342 Total: $0.0353

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description
Nature of work no kras
Vague JD — no KRAs present to derive a specific nature of work.
Tech stack maturity
Mainstream Modern
AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)
1.70 / 5
· Title match
Has AI skill
AI skill (primary)
· AI skill (secondary)
· On AI team
· Builds AI products
vocab breakdown (legacy)
Assistants (×1):
Frameworks (×2): Bedrock, Pinecone
Models / concepts (×3): RAG, LLM, LLMs, MLOps, AI, ML, AI/ML, GenAI, Generative AI, Machine Learning
Evidence — skills matched in JD (27)
AWS AI/ML Amazon SageMaker Amazon Bedrock AWS Lambda ECS EKS EC2 AWS Glue Amazon Athena Redshift AWS Data Pipeline S3 Kinesis Amazon API Gateway Python TensorFlow PyTorch Scikit-learn GitHub Actions Airflow Terraform Docker Kubernetes Pinecone +2
Skill cluster (8 dimension groups, role-scoped)
Cloud Platforms
AWS AWS Lambda Redshift S3
ML Frameworks and Libraries
PyTorch FAISS
Container Orchestration Platforms
Kubernetes
Containerization and Image Builds
Docker
Infrastructure as Code
Terraform
Programming Languages and Scripting
Python
Vector Databases
Pinecone
Cross-cutting / unaligned
AI/ML Amazon SageMaker Amazon Bedrock ECS EKS EC2 AWS Glue Amazon Athena AWS Data Pipeline Kinesis Amazon API Gateway TensorFlow Scikit-learn GitHub Actions Airflow OpenSearch
Status: completed Created: 2026-05-12T10:44:01.066686Z Updated: 2026-05-12T10:45:06.827716Z API 3 duration: 61171 ms
Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

MLOps Engineer

slug: mlops-engineer · id: 5 · source: db

The primary skills indicate a strong focus on AWS and AI/ML technologies, which align well with the responsibilities of an MLOps Engineer.

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

0
New skills
0
Skill↔dim saved
0
Role↔dim saved
0
Skipped

Job description

About the job
At Capgemini Invent, we believe difference drives change. As inventive transformation consultants, we blend our strategic, creative and scientific capabilities, collaborating closely with clients to deliver cutting-edge solutions. Join us to drive transformation tailored to our client's challenges of today and tomorrow. Informed and validated by science and data. Superpowered by creativity and design. All underpinned by technology created with purpose.

Your Role

We are seeking a highly skilled Solution Architect – AWS Cloud & AI/ML to design, architect, and implement advanced AI/ML and generative AI solutions on the AWS platform. The ideal candidate will have deep expertise in large-scale distributed systems, modern AI/ML architectures, LLMs, data engineering pipelines, and AWS-native services. This role involves partnering with cross-functional teams, understanding business challenges, and crafting end‑to‑end scalable, secure, and cost‑optimized solutions

Architect and deliver end‑to‑end AI/ML solutions on AWS, covering data ingestion, training, inference, orchestration, monitoring, and governance.
Design and integrate LLM‑based and Generative AI solutions, including retrieval-augmented generation (RAG), prompt workflows, and production deployment strategies.
Develop feature engineering strategies and scalable data pipelines to support ML training and real-time inference workloads.
Lead technical discussions and provide guidance on AI/ML best practices, model lifecycle, optimization, MLOps, and model governance.
Design highly scalable, secure, and cost-efficient architectures using:
Amazon SageMaker (Training Jobs, Inference Endpoints, Pipelines, Feature Store, Model Registry)
Amazon Bedrock (Foundation models, Generative AI orchestration, prompt management)
AWS Lambda, ECS, EKS, EC2 for building and orchestrating distributed AI workloads.
Architect and optimize data engineering platforms using:
AWS Glue, Amazon Athena, Redshift, AWS Data Pipeline, S3, Kinesis, and related services.
Build secure, production-grade API services for AI model inference using Amazon API Gateway and AWS compute services.
Your Profile

8+ years of experience in cloud architecture, with at least 5 years in AWS.
Strong expertise in:
Machine Learning, MLOps, and GenAI solution design.
Amazon SageMaker (end‑to‑end ML lifecycle).
Amazon Bedrock and modern LLM architectures.
Data engineering with Glue, Redshift, Athena, and pipeline orchestration.
Experience containerizing and scaling AI workloads on Lambda/ECS/EKS.
Strong coding experience in Python and familiarity with ML frameworks (TensorFlow, PyTorch, Scikit‑learn).
Deep understanding of security, networking, IAM, and compliance best practices for AI systems.
Excellent communication, design thinking, and stakeholder management skills.
AWS certifications (e.g., AWS Certified Solutions Architect – Professional, Machine Learning – Specialty).
Experience with vector databases (e.g., Pinecone, OpenSearch, FAISS).
Experience building RAG pipelines, multi‑agent orchestration frameworks, or custom LLM fine‑tuning workflows.
Familiarity with DevOps/MLOps tools: GitHub Actions, Airflow, Terraform, Docker, Kubernetes
Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

AWS Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: AWS id=163 · aws

Aliases — catalog

  • Compaction (CANONICAL) primary

Context tags (catalog)

Bloom filter LSM tree SSTable checkpointing defragmentation garbage collection leveling log-structured merge policy segment merge storage engine tiered compaction tombstones vacuum write amplification

Stored enrichment (catalog DB)

Category
Concept
Sub-category
Storage Maintenance Concept
Confidence
0.74
Version strategy
NOT_APPLICABLE

Maturity reasoning: Compaction is a standard storage-maintenance concept in widely used systems like LSM databases and Kafka; it appears in many JDs for Cassandra, RocksDB, and Kafka ops roles, indicating broad market demand.

Skill profile (library / DB)

Skill nature
PLATFORM
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
13
Sub-category id
161
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Platform Operations Catalog dimension db id 26

    Library dimension (catalog)

    Roles linked in library: DevOps Engineer

  • Cloud Security Platforms Catalog dimension db id 332

    Library dimension (catalog)

    Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Platform Operations
cloud-platform-operations
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Cloud Security Platforms
cloud-security-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: AI/ML id=2611 · ai-ml

Aliases — from this run (catalog unavailable)

  • AI/ML (CANONICAL)

Skill profile (library / DB)

Skill nature
CONCEPT
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
16
Sub-category id
2131
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • AI Inference Cost, Latency, and Throughput Optimization Catalog dimension db id 260

    Library dimension (catalog)

    Roles linked in library: AI Engineer

  • AI Service Architecture Patterns Catalog dimension db id 270

    Library dimension (catalog)

    Roles linked in library: AI Engineer

  • Applied Machine Learning Toolkits Catalog dimension db id 94

    Library dimension (catalog)

    Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
AI Inference Cost, Latency, and Throughput Optimization
ai-inference-cost-latency-and-throughput-optimization
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI Service Architecture Patterns
ai-service-architecture-patterns
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Amazon SageMaker id=2612 · amazon-sagemaker

Aliases — from this run (catalog unavailable)

  • Amazon SageMaker (CANONICAL)

Skill profile (library / DB)

Skill nature
PLATFORM
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
13
Sub-category id
326
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Managed ML Platform Workflows Catalog dimension db id 367

    Library dimension (catalog)

  • Managed Model Hosting and Endpoints Catalog dimension db id 368

    Library dimension (catalog)

  • Model Serving Deployment and Runtime Packaging Catalog dimension db id 52

    Library dimension (catalog)

    Roles linked in library: MLOps Engineer, Machine Learning Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Managed ML Platform Workflows
d_split_01_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Managed Model Hosting and Endpoints
d_split_01_02
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Model Serving Deployment and Runtime Packaging
model-serving-deployment-and-runtime-packaging
Existing dimension (library) · Role↔dimension saved
Amazon Bedrock Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Amazon Bedrock id=2613 · amazon-bedrock

Aliases — catalog

  • source maps (CANONICAL) primary

Context tags (catalog)

Chrome Firefox JavaScript TypeScript browser dev tools build tools code quality debugging dev environment development workflow error tracking frontend development mapping minification performance optimization source code sourcemap-loader transpilation webpack

Stored enrichment (catalog DB)

Category
Format
Sub-category
Debug Symbol Mapping Format
Confidence
0.90
Version strategy
NOT_APPLICABLE

Maturity reasoning: Source maps are a standard web-debugging format supported by major bundlers and browsers; they appear routinely in frontend job descriptions and tooling docs for webpack, Vite, and Chrome DevTools.

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
2132
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Model Runtime Services Catalog dimension db id 121

    Library dimension (catalog)

    Roles linked in library: Machine Learning Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Model Runtime Services
cloud-model-runtime-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Lambda Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: AWS Lambda id=2614 · aws-lambda

Aliases — from this run (catalog unavailable)

  • AWS Lambda (CANONICAL)

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
262
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Data Platform Services Catalog dimension db id 81

    Library dimension (catalog)

    Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
ECS Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: ECS id=2615 · ecs

Aliases — catalog

  • release validation (CANONICAL) primary

Context tags (catalog)

QA processes artifact repository build verification compliance checks compliance standards continuous integration defect tracking deployment pipeline deployment pipelines environment setup integration testing production readiness quality assurance regression testing release management release notes rollback procedures smoke testing staging environments test automation test cases user acceptance testing version control

Stored enrichment (catalog DB)

Category
Methodology
Sub-category
Validation Process
Confidence
0.82
Version strategy
NOT_APPLICABLE

Maturity reasoning: Common release-gate practice in JDs for QA/DevOps/SRE roles; often listed alongside CI/CD, smoke tests, and canary/rollback checks in production release pipelines.

Skill profile (library / DB)

Skill nature
CONCEPT
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
2
Sub-category id
2133
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Version Control Systems Catalog dimension db id 365

    Library dimension (catalog)

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Version Control Systems
d_init_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EKS Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: EKS id=725 · eks

Aliases — catalog

  • Ansible playbooks (CANONICAL) primary

Context tags (catalog)

Jinja2 YAML ansible-galaxy ansible-vault collections group_vars handlers host_vars idempotent inventory playbook roles tasks templates vars

Stored enrichment (catalog DB)

Category
Format
Sub-category
Automation Playbook Format
Vendor
Red Hat
License
gpl_v3
Year introduced
2012
Confidence
0.88
Version strategy
NOT_APPLICABLE

Maturity reasoning: Common in DevOps JDs and widely used for infrastructure automation; Red Hat/Ansible remains a standard tool in hiring pipelines, with playbooks the core format.

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
251
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Model Runtime Services Catalog dimension db id 121

    Library dimension (catalog)

    Roles linked in library: Machine Learning Engineer

  • Orchestration Platforms Catalog dimension db id 25

    Library dimension (catalog)

    Roles linked in library: Cloud Engineer, DevOps Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Model Runtime Services
cloud-model-runtime-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Orchestration Platforms
orchestration-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EC2 Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: EC2 id=1773 · ec2

Aliases — from this run (catalog unavailable)

  • EC2 (CANONICAL) primary

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
1544
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Provider Core Services Catalog dimension db id 290

    Library dimension (catalog)

    Roles linked in library: Cloud Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Provider Core Services
cloud-provider-core-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Glue Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: AWS Glue id=466 · aws-glue

Aliases — from this run (catalog unavailable)

  • AWS Glue (CANONICAL) primary

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
385
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Data Platform Services Catalog dimension db id 81

    Library dimension (catalog)

    Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Amazon Athena id=2616 · amazon-athena

Aliases — catalog

  • AWS Amplify Console (CANONICAL) primary

Context tags (catalog)

AWS services CI/CD Git integration GraphQL React Vue authentication backend build settings cloud infrastructure custom domains deployment deployment pipelines environment variables frontend hosting monitoring serverless static web apps version control

Stored enrichment (catalog DB)

Category
Service
Sub-category
Managed Deployment Service
Vendor
Amazon Web Services
License
apache_2
Year introduced
2019
Confidence
0.95
Version strategy
NOT_APPLICABLE

Maturity reasoning: Appears in growing frontend/CI-CD job postings and AWS docs, but is still far less common than GitHub Actions, Vercel, or AWS CodePipeline in JDs.

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
388
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Data Platform Security and Networking Catalog dimension db id 369

    Library dimension (catalog)

  • Cloud Data Platform Services Catalog dimension db id 81

    Library dimension (catalog)

    Roles linked in library: Data Engineer

  • Managed ML Platform Workflows Catalog dimension db id 367

    Library dimension (catalog)

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Data Platform Security and Networking
d_split_01_04
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Managed ML Platform Workflows
d_split_01_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Redshift Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Redshift id=2570 · redshift

Aliases — from this run (catalog unavailable)

  • Redshift (CANONICAL)

Skill profile (library / DB)

Skill nature
PLATFORM
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
13
Sub-category id
2098
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Data Warehousing Platforms Catalog dimension db id 72

    Library dimension (catalog)

    Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Data Warehousing Platforms
data-warehousing-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Data Pipeline Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: AWS Data Pipeline id=2617 · aws-data-pipeline

Aliases — from this run (catalog unavailable)

  • AWS Data Pipeline (CANONICAL)

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
SHORT_LIVED
Category id
14
Sub-category id
2134
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Data Platform Services Catalog dimension db id 81

    Library dimension (catalog)

    Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
S3 Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: S3 id=2618 · s3

Aliases — catalog

  • Netlify Platform (CANONICAL) primary

Context tags (catalog)

API integrations Git integration Jamstack Netlify CMS build plugins content management continuous deployment custom domains deploy previews edge handlers environment variables form handling redirects serverless functions site analytics

Stored enrichment (catalog DB)

Category
Platform
Sub-category
Vendor Saas Platform
Vendor
Netlify, Inc.
License
other_open
Year introduced
2014
Confidence
0.98
Version strategy
NOT_APPLICABLE

Maturity reasoning: Netlify appears in many modern frontend/Jamstack job descriptions, but far less universally than AWS or Kubernetes; its usage is concentrated in static-site and edge-deploy workflows rather than broad platform roles.

Skill profile (library / DB)

Skill nature
PLATFORM
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
13
Sub-category id
2135
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Storage Provisioning and Automation Catalog dimension db id 311

    Library dimension (catalog)

    Roles linked in library: Storage Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Storage Provisioning and Automation
storage-provisioning-and-automation
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kinesis Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Kinesis id=2619 · kinesis

Aliases — catalog

  • Vercel Edge Network (CANONICAL) primary

Context tags (catalog)

API routes CDN ISR Next.js SSR deployment dynamic content edge computing edge functions global distribution performance optimization real-time caching serverless static site generation webhooks

Stored enrichment (catalog DB)

Category
Service
Sub-category
Edge Delivery Service
Vendor
Vercel, Inc.
License
other_open
Year introduced
2020
Confidence
0.90
Version strategy
NOT_APPLICABLE

Maturity reasoning: Appears in growing number of JDs for Next.js/edge-runtime roles and Vercel’s docs/launches show expanding adoption, but it’s still far less universal than AWS CloudFront or Cloudflare.

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
2136
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Streaming Data Processing Catalog dimension db id 69

    Library dimension (catalog)

    Roles linked in library: Data Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Streaming Data Processing
streaming-data-processing
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon API Gateway Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Amazon API Gateway id=2620 · amazon-api-gateway

Aliases — from this run (catalog unavailable)

  • Amazon API Gateway (CANONICAL)

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
2137
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Web Service Frameworks Catalog dimension db id 141

    Library dimension (catalog)

    Roles linked in library: Backend Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Web Service Frameworks
web-service-frameworks
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python Primary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Python id=393 · python

Aliases — catalog

  • Cobalt Strike (CANONICAL) primary

Context tags (catalog)

Malleable C2 beacon credential dumping kerberos lateral movement payload phishing post-exploitation privilege escalation psexec red team sleep mask smb stager team server

Stored enrichment (catalog DB)

Category
Tool
Sub-category
Adversary Simulation Tool
Vendor
Fortra
License
proprietary
Year introduced
2012
Confidence
0.98
Version strategy
NOT_APPLICABLE

Maturity reasoning: Appears in a limited set of red-team/pentest JDs and security vendor training, but far below mainstream devops tools; market signal is specialized adversary-simulation usage rather than broad hiring demand.

Skill profile (library / DB)

Skill nature
LANGUAGE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
5
Sub-category id
54
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Analytical Programming Languages Catalog dimension db id 82

    Library dimension (catalog)

    Roles linked in library: Data Analyst, Data Scientist

  • Automation Scripting and CLI Catalog dimension db id 48

    Library dimension (catalog)

    Roles linked in library: Azure Cloud Engineer, Cloud Engineer

  • Automation and Scripting for Operations Catalog dimension db id 361

    Library dimension (catalog)

    Roles linked in library: Virtualization Engineer

  • Network Automation and Scripting Catalog dimension db id 285

    Library dimension (catalog)

    Roles linked in library: Network Engineer

  • Programming Languages for AI Workflows Catalog dimension db id 261

    Library dimension (catalog)

    Roles linked in library: AI Engineer

  • Programming Languages for Backend Systems Catalog dimension db id 140

    Library dimension (catalog)

    Roles linked in library: Backend Engineer

  • Programming Languages for Data Work Catalog dimension db id 67

    Library dimension (catalog)

    Roles linked in library: Data Engineer

  • Programming Languages for ML Systems Catalog dimension db id 113

    Library dimension (catalog)

    Roles linked in library: Machine Learning Engineer

  • Programming Languages for Security Work Catalog dimension db id 328

    Library dimension (catalog)

    Roles linked in library: Cybersecurity Engineer

  • Programming Languages for Test Automation Catalog dimension db id 193

    Library dimension (catalog)

    Roles linked in library: Automation Tester

  • Security Automation and Scripting Catalog dimension db id 258

    Library dimension (catalog)

    Roles linked in library: Cybersecurity Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Analytical Programming Languages
analytical-programming-languages
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Automation Scripting and CLI
automation-scripting-and-cli
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Automation and Scripting for Operations
automation-and-scripting-for-operations
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Network Automation and Scripting
network-automation-and-scripting
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for AI Workflows
programming-languages-for-ai-workflows
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Backend Systems
programming-languages-for-backend-systems
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Data Work
programming-languages-for-data-work
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for ML Systems
programming-languages-for-ml-systems
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Security Work
programming-languages-for-security-work
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Programming Languages for Test Automation
programming-languages-for-test-automation
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Security Automation and Scripting
security-automation-and-scripting
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
TensorFlow Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: TensorFlow id=558 · tensorflow

Aliases — catalog

  • shader graphs (CANONICAL) primary

Context tags (catalog)

GLSL HLSL PBR UV mapping fragment shader material editor node graph node-based normal map procedural texturing render pipeline shader compiler tessellation vertex shader visual scripting

Stored enrichment (catalog DB)

Category
Framework
Sub-category
Visual Shader Authoring Framework
Vendor
Unity Technologies
License
proprietary
Year introduced
2018
Confidence
0.74
Version strategy
NOT_APPLICABLE

Maturity reasoning: Shader graphs appear in some Unity/Unreal and VFX job postings, but JD volume is far below core graphics skills like HLSL/GLSL; market use is concentrated in game/real-time rendering teams.

Skill profile (library / DB)

Skill nature
LIBRARY
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
6
Sub-category id
456
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Applied Machine Learning Toolkits Catalog dimension db id 94

    Library dimension (catalog)

    Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
PyTorch Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: PyTorch id=557 · pytorch

Aliases — catalog

  • GLSL (CANONICAL) primary

Context tags (catalog)

GPU HLSL OpenGL SPIR-V Vulkan WebGL compute shader fragment shader rendering pipeline shader shader pipeline texture sampling uniform varying vertex shader

Stored enrichment (catalog DB)

Category
Language
Sub-category
Shader Language
Vendor
Khronos Group
License
other_open
Year introduced
2004
Confidence
0.99
Version strategy
NOT_APPLICABLE

Maturity reasoning: GLSL appears in graphics/game-engine JDs but at much lower volume than mainstream languages; it’s specialized for shader programming and often replaced in newer pipelines by HLSL/Metal Shading Language or higher-level abstractions.

Skill profile (library / DB)

Skill nature
LIBRARY
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
6
Sub-category id
456
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Applied Machine Learning Toolkits Catalog dimension db id 94

    Library dimension (catalog)

    Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Scikit-learn Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: scikit-learn id=554 · scikit-learn

Aliases — catalog

  • post-processing (CANONICAL) primary

Context tags (catalog)

GPU anti-aliasing bloom color grading compositing depth of field fragment shader framebuffer image filtering motion blur render pipeline render target screen-space shader tone mapping

Stored enrichment (catalog DB)

Category
Concept
Sub-category
Graphics Effect Concept
Confidence
0.86
Version strategy
NOT_APPLICABLE

Maturity reasoning: Job postings rarely list "post-processing" as a standalone skill; it appears mainly in graphics/VFX roles, while broader JDs usually specify tools like Unreal/Unity or Photoshop instead.

Skill profile (library / DB)

Skill nature
LIBRARY
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
6
Sub-category id
458
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Applied Machine Learning Toolkits Catalog dimension db id 94

    Library dimension (catalog)

    Roles linked in library: Data Scientist

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
GitHub Actions Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: GitHub Actions id=1250 · github-actions

Aliases — catalog

  • E5 (CANONICAL) primary

Context tags (catalog)

cosine similarity data augmentation dimensionality reduction embedding space feature extraction fine-tuning model evaluation natural language processing nearest neighbors pre-trained models semantic search similarity scoring transfer learning transformer models vector embeddings

Stored enrichment (catalog DB)

Category
Library
Sub-category
Embedding Model Library
Vendor
OpenAI
License
other_open
Year introduced
2021
Confidence
0.80
Version strategy
NOT_APPLICABLE

Maturity reasoning: E5 is a specific embedding-model library with limited JD volume; market demand is concentrated in AI/ML roles rather than broad software hiring, unlike mainstream libraries.

Skill profile (library / DB)

Skill nature
CLOUD_SERVICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
14
Sub-category id
1019
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Continuous Integration Test Integration Catalog dimension db id 207

    Library dimension (catalog)

    Roles linked in library: Automation Tester

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Continuous Integration Test Integration
continuous-integration-test-integration
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Airflow Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Airflow id=325 · airflow

Aliases — catalog

  • OpenVAS (CANONICAL) primary

Context tags (catalog)

CVE CVSS GVM Greenbone NVT asset discovery authenticated scan compliance scan network scan port scanning remediation reporting service detection unauthenticated scan vulnerability assessment

Stored enrichment (catalog DB)

Category
Tool
Sub-category
Vulnerability Scanner
Vendor
Greenbone Networks
License
gpl_v2
Year introduced
2009
Confidence
0.98
Version strategy
NOT_APPLICABLE

Maturity reasoning: OpenVAS appears in security-focused JDs far less often than mainstream scanners like Nessus or Qualys, and its usage is concentrated in pentest/vuln-management roles rather than general DevOps stacks.

Skill profile (library / DB)

Skill nature
TOOL
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
11
Sub-category id
335
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Workflow Orchestration Systems Catalog dimension db id 64

    Library dimension (catalog)

    Roles linked in library: Data Engineer, MLOps Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Workflow Orchestration Systems
workflow-orchestration-systems
Existing dimension (library) · Role↔dimension saved
Terraform Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Terraform id=144 · terraform

Aliases — catalog

  • Snapshot loads (CANONICAL) primary

Context tags (catalog)

CDC ELT ETL SCD backfill batch ingestion change data capture data warehouse full refresh historical snapshot idempotent loads incremental load late-arriving data partition overwrite point-in-time

Stored enrichment (catalog DB)

Category
Methodology
Sub-category
Data Loading Methodology
Confidence
0.90
Version strategy
NOT_APPLICABLE

Maturity reasoning: Snapshot loads are a specialized data-loading pattern; JD volume is very low compared with mainstream ETL/ELT tools, and market discussion is mostly in niche data-engineering forums rather than broad hiring pipelines.

Skill profile (library / DB)

Skill nature
TOOL
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
11
Sub-category id
171
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Infrastructure Provisioning Templates Catalog dimension db id 291

    Library dimension (catalog)

    Roles linked in library: Cloud Engineer

  • Infrastructure as Code Catalog dimension db id 22

    Library dimension (catalog)

    Roles linked in library: DevOps Engineer

  • Infrastructure as Code and Declarative Provisioning Catalog dimension db id 36

    Library dimension (catalog)

    Roles linked in library: Azure Cloud Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Infrastructure Provisioning Templates
infrastructure-provisioning-templates
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Infrastructure as Code
infrastructure-as-code
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Infrastructure as Code and Declarative Provisioning
infrastructure-as-code-and-declarative-provisioning
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Docker id=153 · docker

Aliases — catalog

  • Metabase (CANONICAL) primary

Context tags (catalog)

BigQuery MySQL PostgreSQL Redshift SQL ad hoc analysis cards collections dashboards data visualization embedded analytics filters questions segments self-service BI

Stored enrichment (catalog DB)

Category
Tool
Sub-category
Bi Analytics Tool
Vendor
Metabase, Inc.
License
apache_2
Year introduced
2014
Confidence
0.90
Version strategy
NOT_APPLICABLE

Maturity reasoning: Metabase appears in many BI/analytics job postings and is growing in GitHub usage, but it is still far less universal than Tableau/Power BI in enterprise JDs.

Skill profile (library / DB)

Skill nature
TOOL
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
11
Sub-category id
170
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Containerization and Image Delivery Catalog dimension db id 24

    Library dimension (catalog)

    Roles linked in library: DevOps Engineer

  • Model Serving Deployment and Runtime Packaging Catalog dimension db id 52

    Library dimension (catalog)

    Roles linked in library: MLOps Engineer, Machine Learning Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Containerization and Image Delivery
containerization-and-image-delivery
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Model Serving Deployment and Runtime Packaging
model-serving-deployment-and-runtime-packaging
Existing dimension (library) · Role↔dimension saved
Kubernetes Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Kubernetes id=158 · kubernetes

Aliases — catalog

  • Column-level security (CANONICAL) primary

Context tags (catalog)

ABAC PII access policies attribute-based access control audit logging data governance data masking database permissions dynamic masking fine-grained access control least privilege policy enforcement row-level security sensitive data static masking

Stored enrichment (catalog DB)

Category
Concept
Sub-category
Access Control Concept
Confidence
0.90
Version strategy
NOT_APPLICABLE

Maturity reasoning: Appears in cloud/data platform JDs and vendor docs for Snowflake, BigQuery, and PostgreSQL RLS/column masking, but is not yet a universal hiring staple like core IAM or RBAC.

Skill profile (library / DB)

Skill nature
PLATFORM
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
13
Sub-category id
1524
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Orchestration Platforms Catalog dimension db id 25

    Library dimension (catalog)

    Roles linked in library: Cloud Engineer, DevOps Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Orchestration Platforms
orchestration-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Pinecone Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: Pinecone id=2621 · pinecone

Aliases — catalog

  • components (CANONICAL) primary

Context tags (catalog)

Angular React UI libraries Vue component lifecycle component-driven development composition design systems hooks modular design props rendering responsive design slot state management

Stored enrichment (catalog DB)

Category
Concept
Sub-category
Ui Component Concept
Confidence
0.82
Version strategy
NOT_APPLICABLE

Maturity reasoning: UI components are a standard hiring-pipeline topic across React, Vue, Angular, and design-system JDs; component-based architecture is the default in modern frontend stacks.

Skill profile (library / DB)

Skill nature
PLATFORM
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
13
Sub-category id
2138
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Model Runtime Services Catalog dimension db id 121

    Library dimension (catalog)

    Roles linked in library: Machine Learning Engineer

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Model Runtime Services
cloud-model-runtime-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
OpenSearch Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: OpenSearch id=2622 · opensearch

Aliases — catalog

  • templates (CANONICAL) primary

Context tags (catalog)

Bootstrap CSS Dynamic content HTML Handlebars JSON Jinja Liquid Markdown Mustache Sass Static site generator Templating engine Twig XML YAML

Stored enrichment (catalog DB)

Category
Format
Sub-category
Template Format
Confidence
0.70
Version strategy
NOT_APPLICABLE

Maturity reasoning: Template engines are broadly used across web stacks; JDs commonly list Jinja2, Handlebars, Mustache, or Twig for server-side rendering and email generation, indicating steady hiring demand.

Skill profile (library / DB)

Skill nature
TOOL
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
12
Sub-category id
2139
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Cloud Data Platform Services Catalog dimension db id 81

    Library dimension (catalog)

    Roles linked in library: Data Engineer

  • Version Control Systems Catalog dimension db id 365

    Library dimension (catalog)

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems
d_init_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
FAISS Secondary Library skill API 3: existing canonical (in_db) Existing skill (matched library)
Canonical: FAISS id=2623 · faiss

Aliases — catalog

  • directives (CANONICAL) primary

Context tags (catalog)

Angular React Vue.js attribute directives component lifecycle conditional rendering custom directives data binding event handling ngFor ngIf props structural directives template expressions template syntax two-way binding v-bind v-for v-if

Stored enrichment (catalog DB)

Category
Concept
Sub-category
Template Directive Concept
Confidence
0.90
Version strategy
NOT_APPLICABLE

Maturity reasoning: “Directives” is a broad template concept, but JD volume is low and usually appears only inside specific stacks like Angular/Vue rather than as a standalone hiring keyword.

Skill profile (library / DB)

Skill nature
LIBRARY
Volatility
STABLE
Typical lifespan
EVERGREEN
Category id
6
Sub-category id
2140
Extractable
True
Also category
False

Dimensions (API 2 worklist)

  • Applied Machine Learning Toolkits Catalog dimension db id 94

    Library dimension (catalog)

    Roles linked in library: Data Scientist

  • Version Control Systems Catalog dimension db id 365

    Library dimension (catalog)

API 3 link attempts (this skill)

Dimension Skill↔dim Role↔dim Outcome
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Version Control Systems
d_init_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

All API 3 persistence rows

Same grid as the skill-extractor “Persistence items” table: one row per (skill × dimension) work item.

Skill Tag Dimension Skill↔dim Role↔dim Outcome Notes
AWS in_db
Cloud Platform Operations
cloud-platform-operations
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS in_db
Cloud Security Platforms
cloud-security-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML in_db
AI Inference Cost, Latency, and Throughput Optimization
ai-inference-cost-latency-and-throughput-optimization
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML in_db
AI Service Architecture Patterns
ai-service-architecture-patterns
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AI/ML in_db
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker in_db
Managed ML Platform Workflows
d_split_01_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker in_db
Managed Model Hosting and Endpoints
d_split_01_02
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon SageMaker in_db
Model Serving Deployment and Runtime Packaging
model-serving-deployment-and-runtime-packaging
Existing dimension (library) · Role↔dimension saved
Amazon Bedrock in_db
Cloud Model Runtime Services
cloud-model-runtime-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Lambda in_db
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
ECS in_db
Version Control Systems
d_init_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EKS in_db
Cloud Model Runtime Services
cloud-model-runtime-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EKS in_db
Orchestration Platforms
orchestration-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
EC2 in_db
Cloud Provider Core Services
cloud-provider-core-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Glue in_db
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena in_db
Cloud Data Platform Security and Networking
d_split_01_04
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena in_db
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon Athena in_db
Managed ML Platform Workflows
d_split_01_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Redshift in_db
Data Warehousing Platforms
data-warehousing-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
AWS Data Pipeline in_db
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
S3 in_db
Storage Provisioning and Automation
storage-provisioning-and-automation
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Kinesis in_db
Streaming Data Processing
streaming-data-processing
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Amazon API Gateway in_db
Web Service Frameworks
web-service-frameworks
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Analytical Programming Languages
analytical-programming-languages
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Automation Scripting and CLI
automation-scripting-and-cli
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Automation and Scripting for Operations
automation-and-scripting-for-operations
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Network Automation and Scripting
network-automation-and-scripting
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Programming Languages for AI Workflows
programming-languages-for-ai-workflows
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Programming Languages for Backend Systems
programming-languages-for-backend-systems
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Programming Languages for Data Work
programming-languages-for-data-work
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Programming Languages for ML Systems
programming-languages-for-ml-systems
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Programming Languages for Security Work
programming-languages-for-security-work
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Programming Languages for Test Automation
programming-languages-for-test-automation
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Python in_db
Security Automation and Scripting
security-automation-and-scripting
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
TensorFlow in_db
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
PyTorch in_db
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Scikit-learn in_db
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
GitHub Actions in_db
Continuous Integration Test Integration
continuous-integration-test-integration
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Airflow in_db
Workflow Orchestration Systems
workflow-orchestration-systems
Existing dimension (library) · Role↔dimension saved
Terraform in_db
Infrastructure Provisioning Templates
infrastructure-provisioning-templates
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Terraform in_db
Infrastructure as Code
infrastructure-as-code
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Terraform in_db
Infrastructure as Code and Declarative Provisioning
infrastructure-as-code-and-declarative-provisioning
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker in_db
Containerization and Image Delivery
containerization-and-image-delivery
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Docker in_db
Model Serving Deployment and Runtime Packaging
model-serving-deployment-and-runtime-packaging
Existing dimension (library) · Role↔dimension saved
Kubernetes in_db
Orchestration Platforms
orchestration-platforms
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
Pinecone in_db
Cloud Model Runtime Services
cloud-model-runtime-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
OpenSearch in_db
Cloud Data Platform Services
cloud-data-platform-services
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
OpenSearch in_db
Version Control Systems
d_init_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
FAISS in_db
Applied Machine Learning Toolkits
applied-machine-learning-toolkits
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)
FAISS in_db
Version Control Systems
d_init_01
Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role)

Library artifacts (this run)

Kind Detail DB id
library_enrichment_backfilled AI/ML 2611
library_enrichment_backfilled Amazon SageMaker 2612
library_enrichment_backfilled Amazon Bedrock 2613
library_enrichment_backfilled AWS Lambda 2614
library_enrichment_backfilled ECS 2615
library_enrichment_backfilled EC2 1773
library_enrichment_backfilled Amazon Athena 2616
library_enrichment_backfilled Redshift 2570
library_enrichment_backfilled AWS Data Pipeline 2617
library_enrichment_backfilled S3 2618
library_enrichment_backfilled Kinesis 2619
library_enrichment_backfilled Amazon API Gateway 2620
library_enrichment_backfilled Pinecone 2621
library_enrichment_backfilled OpenSearch 2622
library_enrichment_backfilled FAISS 2623
API 1 — extract-from-jd click to toggle
{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "AWS"
    },
    {
      "is_primary": true,
      "skill_name": "AI/ML"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon SageMaker"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon Bedrock"
    },
    {
      "is_primary": true,
      "skill_name": "AWS Lambda"
    },
    {
      "is_primary": true,
      "skill_name": "ECS"
    },
    {
      "is_primary": true,
      "skill_name": "EKS"
    },
    {
      "is_primary": true,
      "skill_name": "EC2"
    },
    {
      "is_primary": true,
      "skill_name": "AWS Glue"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon Athena"
    },
    {
      "is_primary": true,
      "skill_name": "Redshift"
    },
    {
      "is_primary": true,
      "skill_name": "AWS Data Pipeline"
    },
    {
      "is_primary": true,
      "skill_name": "S3"
    },
    {
      "is_primary": true,
      "skill_name": "Kinesis"
    },
    {
      "is_primary": true,
      "skill_name": "Amazon API Gateway"
    },
    {
      "is_primary": true,
      "skill_name": "Python"
    },
    {
      "is_primary": false,
      "skill_name": "TensorFlow"
    },
    {
      "is_primary": false,
      "skill_name": "PyTorch"
    },
    {
      "is_primary": false,
      "skill_name": "Scikit-learn"
    },
    {
      "is_primary": false,
      "skill_name": "GitHub Actions"
    },
    {
      "is_primary": false,
      "skill_name": "Airflow"
    },
    {
      "is_primary": false,
      "skill_name": "Terraform"
    },
    {
      "is_primary": false,
      "skill_name": "Docker"
    },
    {
      "is_primary": false,
      "skill_name": "Kubernetes"
    },
    {
      "is_primary": false,
      "skill_name": "Pinecone"
    },
    {
      "is_primary": false,
      "skill_name": "OpenSearch"
    },
    {
      "is_primary": false,
      "skill_name": "FAISS"
    }
  ],
  "run_id": null
}
API 2 — extract-details
{
  "alias_matches": [
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 348,
      "existing_alias_text": "AWS",
      "input_term": "AWS",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "AWS",
        "id": 163,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "aws",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3408,
      "existing_alias_text": "AI/ML",
      "input_term": "AI/ML",
      "matched_canonical": {
        "category_id": 16,
        "display_name": "AI/ML",
        "id": 2611,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ai-ml",
        "sub_category_id": 2131,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3409,
      "existing_alias_text": "Amazon SageMaker",
      "input_term": "Amazon SageMaker",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Amazon SageMaker",
        "id": 2612,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "amazon-sagemaker",
        "sub_category_id": 326,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3410,
      "existing_alias_text": "Amazon Bedrock",
      "input_term": "Amazon Bedrock",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Amazon Bedrock",
        "id": 2613,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-bedrock",
        "sub_category_id": 2132,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3411,
      "existing_alias_text": "AWS Lambda",
      "input_term": "AWS Lambda",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "AWS Lambda",
        "id": 2614,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-lambda",
        "sub_category_id": 262,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3412,
      "existing_alias_text": "ECS",
      "input_term": "ECS",
      "matched_canonical": {
        "category_id": 2,
        "display_name": "ECS",
        "id": 2615,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ecs",
        "sub_category_id": 2133,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 1093,
      "existing_alias_text": "EKS",
      "input_term": "EKS",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "EKS",
        "id": 725,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "eks",
        "sub_category_id": 251,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 2372,
      "existing_alias_text": "EC2",
      "input_term": "EC2",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "EC2",
        "id": 1773,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "ec2",
        "sub_category_id": 1544,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 730,
      "existing_alias_text": "AWS Glue",
      "input_term": "AWS Glue",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "AWS Glue",
        "id": 466,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-glue",
        "sub_category_id": 385,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3413,
      "existing_alias_text": "Amazon Athena",
      "input_term": "Amazon Athena",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Amazon Athena",
        "id": 2616,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-athena",
        "sub_category_id": 388,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3367,
      "existing_alias_text": "Redshift",
      "input_term": "Redshift",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Redshift",
        "id": 2570,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "redshift",
        "sub_category_id": 2098,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3414,
      "existing_alias_text": "AWS Data Pipeline",
      "input_term": "AWS Data Pipeline",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "AWS Data Pipeline",
        "id": 2617,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-data-pipeline",
        "sub_category_id": 2134,
        "typical_lifespan": "SHORT_LIVED",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3415,
      "existing_alias_text": "S3",
      "input_term": "S3",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "S3",
        "id": 2618,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "s3",
        "sub_category_id": 2135,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3416,
      "existing_alias_text": "Kinesis",
      "input_term": "Kinesis",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Kinesis",
        "id": 2619,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "kinesis",
        "sub_category_id": 2136,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3417,
      "existing_alias_text": "Amazon API Gateway",
      "input_term": "Amazon API Gateway",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "Amazon API Gateway",
        "id": 2620,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-api-gateway",
        "sub_category_id": 2137,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 608,
      "existing_alias_text": "Python",
      "input_term": "Python",
      "matched_canonical": {
        "category_id": 5,
        "display_name": "Python",
        "id": 393,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "python",
        "sub_category_id": 54,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 862,
      "existing_alias_text": "TensorFlow",
      "input_term": "TensorFlow",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "TensorFlow",
        "id": 558,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "tensorflow",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 861,
      "existing_alias_text": "PyTorch",
      "input_term": "PyTorch",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "PyTorch",
        "id": 557,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "pytorch",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 852,
      "existing_alias_text": "scikit-learn",
      "input_term": "Scikit-learn",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "scikit-learn",
        "id": 554,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "scikit-learn",
        "sub_category_id": 458,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 1800,
      "existing_alias_text": "GitHub Actions",
      "input_term": "GitHub Actions",
      "matched_canonical": {
        "category_id": 14,
        "display_name": "GitHub Actions",
        "id": 1250,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "github-actions",
        "sub_category_id": 1019,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 540,
      "existing_alias_text": "Airflow",
      "input_term": "Airflow",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Airflow",
        "id": 325,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "airflow",
        "sub_category_id": 335,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 290,
      "existing_alias_text": "Terraform",
      "input_term": "Terraform",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Terraform",
        "id": 144,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "terraform",
        "sub_category_id": 171,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 299,
      "existing_alias_text": "Docker",
      "input_term": "Docker",
      "matched_canonical": {
        "category_id": 11,
        "display_name": "Docker",
        "id": 153,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "docker",
        "sub_category_id": 170,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 304,
      "existing_alias_text": "Kubernetes",
      "input_term": "Kubernetes",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Kubernetes",
        "id": 158,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "kubernetes",
        "sub_category_id": 1524,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3418,
      "existing_alias_text": "Pinecone",
      "input_term": "Pinecone",
      "matched_canonical": {
        "category_id": 13,
        "display_name": "Pinecone",
        "id": 2621,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "pinecone",
        "sub_category_id": 2138,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3419,
      "existing_alias_text": "OpenSearch",
      "input_term": "OpenSearch",
      "matched_canonical": {
        "category_id": 12,
        "display_name": "OpenSearch",
        "id": 2622,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "opensearch",
        "sub_category_id": 2139,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    },
    {
      "alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
      "alias_persisted": false,
      "existing_alias_id": 3420,
      "existing_alias_text": "FAISS",
      "input_term": "FAISS",
      "matched_canonical": {
        "category_id": 6,
        "display_name": "FAISS",
        "id": 2623,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "faiss",
        "sub_category_id": 2140,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "matched_via": "alias"
    }
  ],
  "candidate_roles": [
    {
      "display_name": "DevOps Engineer",
      "id": 1,
      "rationale": null,
      "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
      "slug": "devops-engineer",
      "source": "db"
    },
    {
      "display_name": "Cybersecurity Engineer",
      "id": 9,
      "rationale": null,
      "role_archetype": null,
      "slug": "cybersecurity-engineer",
      "source": "db"
    },
    {
      "display_name": "AI Engineer",
      "id": 12,
      "rationale": null,
      "role_archetype": null,
      "slug": "ai-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Scientist",
      "id": 7,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-scientist",
      "source": "db"
    },
    {
      "display_name": "MLOps Engineer",
      "id": 5,
      "rationale": null,
      "role_archetype": null,
      "slug": "mlops-engineer",
      "source": "db"
    },
    {
      "display_name": "Machine Learning Engineer",
      "id": 10,
      "rationale": null,
      "role_archetype": null,
      "slug": "machine-learning-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Engineer",
      "id": 6,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-engineer",
      "source": "db"
    },
    {
      "display_name": "Cloud Engineer",
      "id": 18,
      "rationale": null,
      "role_archetype": null,
      "slug": "cloud-engineer",
      "source": "db"
    },
    {
      "display_name": "Storage Engineer",
      "id": 22,
      "rationale": null,
      "role_archetype": null,
      "slug": "storage-engineer",
      "source": "db"
    },
    {
      "display_name": "Backend Engineer",
      "id": 14,
      "rationale": null,
      "role_archetype": null,
      "slug": "backend-engineer",
      "source": "db"
    },
    {
      "display_name": "Data Analyst",
      "id": 20,
      "rationale": null,
      "role_archetype": null,
      "slug": "data-analyst",
      "source": "db"
    },
    {
      "display_name": "Azure Cloud Engineer",
      "id": 4,
      "rationale": null,
      "role_archetype": null,
      "slug": "azure-cloud-engineer",
      "source": "db"
    },
    {
      "display_name": "Virtualization Engineer",
      "id": 26,
      "rationale": null,
      "role_archetype": null,
      "slug": "virtualization-engineer",
      "source": "db"
    },
    {
      "display_name": "Network Engineer",
      "id": 21,
      "rationale": null,
      "role_archetype": null,
      "slug": "network-engineer",
      "source": "db"
    },
    {
      "display_name": "Automation Tester",
      "id": 16,
      "rationale": null,
      "role_archetype": null,
      "slug": "automation-tester",
      "source": "db"
    }
  ],
  "chosen_role": {
    "display_name": "MLOps Engineer",
    "id": 5,
    "rationale": "The primary skills indicate a strong focus on AWS and AI/ML technologies, which align well with the responsibilities of an MLOps Engineer.",
    "role_archetype": null,
    "slug": "mlops-engineer",
    "source": "db"
  },
  "dimensions": [
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Platform Operations",
        "id": 26,
        "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
        "slug": "cloud-platform-operations",
        "source": "db"
      },
      "input_skill": "AWS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Security Platforms",
        "id": 332,
        "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
        "slug": "cloud-security-platforms",
        "source": "db"
      },
      "input_skill": "AWS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "AI Inference Cost, Latency, and Throughput Optimization",
        "id": 260,
        "rationale": "Improving the speed, throughput, and cost efficiency of AI and ML-powered product features without sacrificing correctness or user experience. Includes token budgeting, prompt compression, batching, caching, model selection, quantization, pruning, async inference, warm starts, streaming UX, timeout tuning, concurrency control, and profiling. Excludes infrastructure autoscaling, model serving capacity planning, generic backend performance tuning, and unrelated data/warehouse optimization.",
        "slug": "ai-inference-cost-latency-and-throughput-optimization",
        "source": "db"
      },
      "input_skill": "AI/ML",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "AI Service Architecture Patterns",
        "id": 270,
        "rationale": "Structuring AI capabilities within the product and service landscape. This includes deciding whether AI logic lives in handlers, workers, gateways, or dedicated orchestration services.",
        "slug": "ai-service-architecture-patterns",
        "source": "db"
      },
      "input_skill": "AI/ML",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "AI/ML",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Managed ML Platform Workflows",
        "id": 367,
        "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
        "slug": "d_split_01_01",
        "source": "db"
      },
      "input_skill": "Amazon SageMaker",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Managed Model Hosting and Endpoints",
        "id": 368,
        "rationale": "Cloud-managed services for deploying trained models as online or batch inference endpoints, including endpoint provisioning, batch transform, and rollout coordination. Examples: SageMaker endpoints, SageMaker batch transform.",
        "slug": "d_split_01_02",
        "source": "db"
      },
      "input_skill": "Amazon SageMaker",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Model Serving Deployment and Runtime Packaging",
        "id": 52,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
        "slug": "model-serving-deployment-and-runtime-packaging",
        "source": "db"
      },
      "input_skill": "Amazon SageMaker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        },
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Model Runtime Services",
        "id": 121,
        "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
        "slug": "cloud-model-runtime-services",
        "source": "db"
      },
      "input_skill": "Amazon Bedrock",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "AWS Lambda",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "ECS",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Model Runtime Services",
        "id": 121,
        "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
        "slug": "cloud-model-runtime-services",
        "source": "db"
      },
      "input_skill": "EKS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Orchestration Platforms",
        "id": 25,
        "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
        "slug": "orchestration-platforms",
        "source": "db"
      },
      "input_skill": "EKS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Provider Core Services",
        "id": 290,
        "rationale": "Core managed services used to provision and operate cloud environments. This is the base cloud surface for compute, storage, networking, and platform primitives the role configures and maintains.",
        "slug": "cloud-provider-core-services",
        "source": "db"
      },
      "input_skill": "EC2",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "AWS Glue",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Security and Networking",
        "id": 369,
        "rationale": "Identity, access, secrets, and networking primitives used to support cloud data platforms and pipelines.",
        "slug": "d_split_01_04",
        "source": "db"
      },
      "input_skill": "Amazon Athena",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "Amazon Athena",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Managed ML Platform Workflows",
        "id": 367,
        "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
        "slug": "d_split_01_01",
        "source": "db"
      },
      "input_skill": "Amazon Athena",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Data Warehousing Platforms",
        "id": 72,
        "rationale": "Cloud and on-prem analytical storage systems used to persist curated datasets and serve downstream consumers. This cluster is about the warehouse/lakehouse layer where transformed data is organized for access.",
        "slug": "data-warehousing-platforms",
        "source": "db"
      },
      "input_skill": "Redshift",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "AWS Data Pipeline",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Storage Provisioning and Automation",
        "id": 311,
        "rationale": "Covers the scripts, APIs, and operational workflows used to create, resize, map, and retire storage resources. This cluster is coherent because storage engineers often automate repetitive provisioning and maintenance tasks.",
        "slug": "storage-provisioning-and-automation",
        "source": "db"
      },
      "input_skill": "S3",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Storage Engineer",
          "id": 22,
          "rationale": null,
          "role_archetype": null,
          "slug": "storage-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Streaming Data Processing",
        "id": 69,
        "rationale": "Tools and patterns for ingesting and transforming event streams with low latency. This cluster covers continuous processing, windowing, and stateful stream jobs used to keep data fresh.",
        "slug": "streaming-data-processing",
        "source": "db"
      },
      "input_skill": "Kinesis",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Web Service Frameworks",
        "id": 141,
        "rationale": "Server frameworks used to build HTTP APIs, route requests, validate inputs, and structure backend application code. This cluster is coherent because it defines how backend services expose behavior to clients and other services.",
        "slug": "web-service-frameworks",
        "source": "db"
      },
      "input_skill": "Amazon API Gateway",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Engineer",
          "id": 14,
          "rationale": null,
          "role_archetype": null,
          "slug": "backend-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Analytical Programming Languages",
        "id": 82,
        "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
        "slug": "analytical-programming-languages",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Analyst",
          "id": 20,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-analyst",
          "source": "db"
        },
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Automation Scripting and CLI",
        "id": 48,
        "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
        "slug": "automation-scripting-and-cli",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Azure Cloud Engineer",
          "id": 4,
          "rationale": null,
          "role_archetype": null,
          "slug": "azure-cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Automation and Scripting for Operations",
        "id": 361,
        "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
        "slug": "automation-and-scripting-for-operations",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Virtualization Engineer",
          "id": 26,
          "rationale": null,
          "role_archetype": null,
          "slug": "virtualization-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Network Automation and Scripting",
        "id": 285,
        "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
        "slug": "network-automation-and-scripting",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Network Engineer",
          "id": 21,
          "rationale": null,
          "role_archetype": null,
          "slug": "network-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for AI Workflows",
        "id": 261,
        "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
        "slug": "programming-languages-for-ai-workflows",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "AI Engineer",
          "id": 12,
          "rationale": null,
          "role_archetype": null,
          "slug": "ai-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Backend Systems",
        "id": 140,
        "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
        "slug": "programming-languages-for-backend-systems",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Backend Engineer",
          "id": 14,
          "rationale": null,
          "role_archetype": null,
          "slug": "backend-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Data Work",
        "id": 67,
        "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
        "slug": "programming-languages-for-data-work",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for ML Systems",
        "id": 113,
        "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
        "slug": "programming-languages-for-ml-systems",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Security Work",
        "id": 328,
        "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
        "slug": "programming-languages-for-security-work",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Programming Languages for Test Automation",
        "id": 193,
        "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
        "slug": "programming-languages-for-test-automation",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Automation Tester",
          "id": 16,
          "rationale": null,
          "role_archetype": null,
          "slug": "automation-tester",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Security Automation and Scripting",
        "id": 258,
        "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
        "slug": "security-automation-and-scripting",
        "source": "db"
      },
      "input_skill": "Python",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cybersecurity Engineer",
          "id": 9,
          "rationale": null,
          "role_archetype": null,
          "slug": "cybersecurity-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "TensorFlow",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "PyTorch",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "Scikit-learn",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Continuous Integration Test Integration",
        "id": 207,
        "rationale": "Integrating automated checks into shared build and merge workflows so results are repeatable and visible. This cluster is coherent because automation testers commonly configure test execution triggers, artifacts, and reporting hooks.",
        "slug": "continuous-integration-test-integration",
        "source": "db"
      },
      "input_skill": "GitHub Actions",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Automation Tester",
          "id": 16,
          "rationale": null,
          "role_archetype": null,
          "slug": "automation-tester",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Workflow Orchestration Systems",
        "id": 64,
        "rationale": "Operational orchestration of ML jobs, dependencies, and handoffs across training, validation, deployment, and retraining. This is a useful split from training pipelines because it emphasizes the scheduler and control plane.",
        "slug": "workflow-orchestration-systems",
        "source": "db"
      },
      "input_skill": "Airflow",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        },
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Infrastructure Provisioning Templates",
        "id": 291,
        "rationale": "Declarative templates and modules used to create repeatable cloud resources and environments. This cluster covers the infrastructure definitions the role applies, reviews, and updates to keep environments consistent.",
        "slug": "infrastructure-provisioning-templates",
        "source": "db"
      },
      "input_skill": "Terraform",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Infrastructure as Code",
        "id": 22,
        "rationale": "Defines infrastructure and platform resources through versioned code so environments are repeatable and reviewable. This is a coherent cluster because it underpins environment consistency and change control.",
        "slug": "infrastructure-as-code",
        "source": "db"
      },
      "input_skill": "Terraform",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Infrastructure as Code and Declarative Provisioning",
        "id": 36,
        "rationale": "Defines cloud and platform infrastructure declaratively through versioned code so environments are repeatable, reviewable, and automatable. This includes authoring and maintaining IaC templates/modules, managing parameters and state, and using plan/apply workflows to provision and update resources across Azure and other cloud platforms.",
        "slug": "infrastructure-as-code-and-declarative-provisioning",
        "source": "db"
      },
      "input_skill": "Terraform",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Azure Cloud Engineer",
          "id": 4,
          "rationale": null,
          "role_archetype": null,
          "slug": "azure-cloud-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Containerization and Image Delivery",
        "id": 24,
        "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
        "slug": "containerization-and-image-delivery",
        "source": "db"
      },
      "input_skill": "Docker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Model Serving Deployment and Runtime Packaging",
        "id": 52,
        "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
        "slug": "model-serving-deployment-and-runtime-packaging",
        "source": "db"
      },
      "input_skill": "Docker",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "MLOps Engineer",
          "id": 5,
          "rationale": null,
          "role_archetype": null,
          "slug": "mlops-engineer",
          "source": "db"
        },
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Orchestration Platforms",
        "id": 25,
        "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
        "slug": "orchestration-platforms",
        "source": "db"
      },
      "input_skill": "Kubernetes",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Cloud Engineer",
          "id": 18,
          "rationale": null,
          "role_archetype": null,
          "slug": "cloud-engineer",
          "source": "db"
        },
        {
          "display_name": "DevOps Engineer",
          "id": 1,
          "rationale": null,
          "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
          "slug": "devops-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Model Runtime Services",
        "id": 121,
        "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
        "slug": "cloud-model-runtime-services",
        "source": "db"
      },
      "input_skill": "Pinecone",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Machine Learning Engineer",
          "id": 10,
          "rationale": null,
          "role_archetype": null,
          "slug": "machine-learning-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Cloud Data Platform Services",
        "id": 81,
        "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
        "slug": "cloud-data-platform-services",
        "source": "db"
      },
      "input_skill": "OpenSearch",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Engineer",
          "id": 6,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-engineer",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "OpenSearch",
      "llm_role": null,
      "roles_from_db": []
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Applied Machine Learning Toolkits",
        "id": 94,
        "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
        "slug": "applied-machine-learning-toolkits",
        "source": "db"
      },
      "input_skill": "FAISS",
      "llm_role": null,
      "roles_from_db": [
        {
          "display_name": "Data Scientist",
          "id": 7,
          "rationale": null,
          "role_archetype": null,
          "slug": "data-scientist",
          "source": "db"
        }
      ]
    },
    {
      "dimension": {
        "difficulty_hint": "well_known",
        "display_name": "Version Control Systems",
        "id": 365,
        "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
        "slug": "d_init_01",
        "source": "db"
      },
      "input_skill": "FAISS",
      "llm_role": null,
      "roles_from_db": []
    }
  ],
  "input_final_skills": [
    "AWS",
    "AI/ML",
    "Amazon SageMaker",
    "Amazon Bedrock",
    "AWS Lambda",
    "ECS",
    "EKS",
    "EC2",
    "AWS Glue",
    "Amazon Athena",
    "Redshift",
    "AWS Data Pipeline",
    "S3",
    "Kinesis",
    "Amazon API Gateway",
    "Python",
    "TensorFlow",
    "PyTorch",
    "Scikit-learn",
    "GitHub Actions",
    "Airflow",
    "Terraform",
    "Docker",
    "Kubernetes",
    "Pinecone",
    "OpenSearch",
    "FAISS"
  ],
  "input_llm_skills": [
    "AWS",
    "AI/ML",
    "Amazon SageMaker",
    "Amazon Bedrock",
    "AWS Lambda",
    "ECS",
    "EKS",
    "EC2",
    "AWS Glue",
    "Amazon Athena",
    "Redshift",
    "AWS Data Pipeline",
    "S3",
    "Kinesis",
    "Amazon API Gateway",
    "Python",
    "TensorFlow",
    "PyTorch",
    "Scikit-learn",
    "GitHub Actions",
    "Airflow",
    "Terraform",
    "Docker",
    "Kubernetes",
    "Pinecone",
    "OpenSearch",
    "FAISS"
  ],
  "new_aliases_persisted": 0,
  "run_id": "832dda01-52e1-4739-b122-0a9cf9bc8100",
  "skills_detail": [
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS",
          "alias_type": "CANONICAL",
          "id": 348,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "AWS",
        "id": 163,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "aws",
        "sub_category_id": 161,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Platform Operations",
            "id": 26,
            "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
            "slug": "cloud-platform-operations",
            "source": "db"
          },
          "input_skill": "AWS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Security Platforms",
            "id": 332,
            "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
            "slug": "cloud-security-platforms",
            "source": "db"
          },
          "input_skill": "AWS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AI/ML",
          "alias_type": "CANONICAL",
          "id": 3408,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 16,
        "display_name": "AI/ML",
        "id": 2611,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ai-ml",
        "sub_category_id": 2131,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "AI Inference Cost, Latency, and Throughput Optimization",
            "id": 260,
            "rationale": "Improving the speed, throughput, and cost efficiency of AI and ML-powered product features without sacrificing correctness or user experience. Includes token budgeting, prompt compression, batching, caching, model selection, quantization, pruning, async inference, warm starts, streaming UX, timeout tuning, concurrency control, and profiling. Excludes infrastructure autoscaling, model serving capacity planning, generic backend performance tuning, and unrelated data/warehouse optimization.",
            "slug": "ai-inference-cost-latency-and-throughput-optimization",
            "source": "db"
          },
          "input_skill": "AI/ML",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "AI Service Architecture Patterns",
            "id": 270,
            "rationale": "Structuring AI capabilities within the product and service landscape. This includes deciding whether AI logic lives in handlers, workers, gateways, or dedicated orchestration services.",
            "slug": "ai-service-architecture-patterns",
            "source": "db"
          },
          "input_skill": "AI/ML",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "AI/ML",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AI/ML",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon SageMaker",
          "alias_type": "CANONICAL",
          "id": 3409,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Amazon SageMaker",
        "id": 2612,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "amazon-sagemaker",
        "sub_category_id": 326,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Managed ML Platform Workflows",
            "id": 367,
            "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
            "slug": "d_split_01_01",
            "source": "db"
          },
          "input_skill": "Amazon SageMaker",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Managed Model Hosting and Endpoints",
            "id": 368,
            "rationale": "Cloud-managed services for deploying trained models as online or batch inference endpoints, including endpoint provisioning, batch transform, and rollout coordination. Examples: SageMaker endpoints, SageMaker batch transform.",
            "slug": "d_split_01_02",
            "source": "db"
          },
          "input_skill": "Amazon SageMaker",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Model Serving Deployment and Runtime Packaging",
            "id": 52,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
            "slug": "model-serving-deployment-and-runtime-packaging",
            "source": "db"
          },
          "input_skill": "Amazon SageMaker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            },
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Amazon SageMaker",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon Bedrock",
          "alias_type": "CANONICAL",
          "id": 3410,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Amazon Bedrock",
        "id": 2613,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-bedrock",
        "sub_category_id": 2132,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Model Runtime Services",
            "id": 121,
            "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
            "slug": "cloud-model-runtime-services",
            "source": "db"
          },
          "input_skill": "Amazon Bedrock",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Amazon Bedrock",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS Lambda",
          "alias_type": "CANONICAL",
          "id": 3411,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "AWS Lambda",
        "id": 2614,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-lambda",
        "sub_category_id": 262,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "AWS Lambda",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS Lambda",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "ECS",
          "alias_type": "CANONICAL",
          "id": 3412,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 2,
        "display_name": "ECS",
        "id": 2615,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CONCEPT",
        "slug": "ecs",
        "sub_category_id": 2133,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "ECS",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "ECS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "EKS",
          "alias_type": "CANONICAL",
          "id": 1093,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "EKS",
        "id": 725,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "eks",
        "sub_category_id": 251,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Model Runtime Services",
            "id": 121,
            "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
            "slug": "cloud-model-runtime-services",
            "source": "db"
          },
          "input_skill": "EKS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Orchestration Platforms",
            "id": 25,
            "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
            "slug": "orchestration-platforms",
            "source": "db"
          },
          "input_skill": "EKS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "EKS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "EC2",
          "alias_type": "CANONICAL",
          "id": 2372,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "EC2",
        "id": 1773,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "ec2",
        "sub_category_id": 1544,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Provider Core Services",
            "id": 290,
            "rationale": "Core managed services used to provision and operate cloud environments. This is the base cloud surface for compute, storage, networking, and platform primitives the role configures and maintains.",
            "slug": "cloud-provider-core-services",
            "source": "db"
          },
          "input_skill": "EC2",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "EC2",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS Glue",
          "alias_type": "CANONICAL",
          "id": 730,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "AWS Glue",
        "id": 466,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-glue",
        "sub_category_id": 385,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "AWS Glue",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS Glue",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon Athena",
          "alias_type": "CANONICAL",
          "id": 3413,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Amazon Athena",
        "id": 2616,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-athena",
        "sub_category_id": 388,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Security and Networking",
            "id": 369,
            "rationale": "Identity, access, secrets, and networking primitives used to support cloud data platforms and pipelines.",
            "slug": "d_split_01_04",
            "source": "db"
          },
          "input_skill": "Amazon Athena",
          "llm_role": null,
          "roles_from_db": []
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "Amazon Athena",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Managed ML Platform Workflows",
            "id": 367,
            "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
            "slug": "d_split_01_01",
            "source": "db"
          },
          "input_skill": "Amazon Athena",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "Amazon Athena",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Redshift",
          "alias_type": "CANONICAL",
          "id": 3367,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Redshift",
        "id": 2570,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "redshift",
        "sub_category_id": 2098,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Data Warehousing Platforms",
            "id": 72,
            "rationale": "Cloud and on-prem analytical storage systems used to persist curated datasets and serve downstream consumers. This cluster is about the warehouse/lakehouse layer where transformed data is organized for access.",
            "slug": "data-warehousing-platforms",
            "source": "db"
          },
          "input_skill": "Redshift",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Redshift",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "AWS Data Pipeline",
          "alias_type": "CANONICAL",
          "id": 3414,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "AWS Data Pipeline",
        "id": 2617,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "aws-data-pipeline",
        "sub_category_id": 2134,
        "typical_lifespan": "SHORT_LIVED",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "AWS Data Pipeline",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "AWS Data Pipeline",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "S3",
          "alias_type": "CANONICAL",
          "id": 3415,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "S3",
        "id": 2618,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "s3",
        "sub_category_id": 2135,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Storage Provisioning and Automation",
            "id": 311,
            "rationale": "Covers the scripts, APIs, and operational workflows used to create, resize, map, and retire storage resources. This cluster is coherent because storage engineers often automate repetitive provisioning and maintenance tasks.",
            "slug": "storage-provisioning-and-automation",
            "source": "db"
          },
          "input_skill": "S3",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Storage Engineer",
              "id": 22,
              "rationale": null,
              "role_archetype": null,
              "slug": "storage-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "S3",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Kinesis",
          "alias_type": "CANONICAL",
          "id": 3416,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Kinesis",
        "id": 2619,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "kinesis",
        "sub_category_id": 2136,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Streaming Data Processing",
            "id": 69,
            "rationale": "Tools and patterns for ingesting and transforming event streams with low latency. This cluster covers continuous processing, windowing, and stateful stream jobs used to keep data fresh.",
            "slug": "streaming-data-processing",
            "source": "db"
          },
          "input_skill": "Kinesis",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Kinesis",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Amazon API Gateway",
          "alias_type": "CANONICAL",
          "id": 3417,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "Amazon API Gateway",
        "id": 2620,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "amazon-api-gateway",
        "sub_category_id": 2137,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Web Service Frameworks",
            "id": 141,
            "rationale": "Server frameworks used to build HTTP APIs, route requests, validate inputs, and structure backend application code. This cluster is coherent because it defines how backend services expose behavior to clients and other services.",
            "slug": "web-service-frameworks",
            "source": "db"
          },
          "input_skill": "Amazon API Gateway",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Engineer",
              "id": 14,
              "rationale": null,
              "role_archetype": null,
              "slug": "backend-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Amazon API Gateway",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Python",
          "alias_type": "CANONICAL",
          "id": 608,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 2",
          "alias_type": "VERSION",
          "id": 611,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 2.x",
          "alias_type": "VERSION",
          "id": 613,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3",
          "alias_type": "VERSION",
          "id": 612,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.10",
          "alias_type": "VERSION",
          "id": 2330,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.11",
          "alias_type": "VERSION",
          "id": 2331,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.12",
          "alias_type": "VERSION",
          "id": 2332,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Python 3.x",
          "alias_type": "VERSION",
          "id": 614,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "py2",
          "alias_type": "VERSION",
          "id": 609,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "py3",
          "alias_type": "VERSION",
          "id": 610,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 2",
          "alias_type": "VERSION",
          "id": 2152,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 2.x",
          "alias_type": "VERSION",
          "id": 2154,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3",
          "alias_type": "VERSION",
          "id": 990,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.10",
          "alias_type": "VERSION",
          "id": 992,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.11",
          "alias_type": "VERSION",
          "id": 993,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.12",
          "alias_type": "VERSION",
          "id": 994,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python 3.x",
          "alias_type": "VERSION",
          "id": 991,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python2",
          "alias_type": "VERSION",
          "id": 2150,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "python3",
          "alias_type": "VERSION",
          "id": 989,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 5,
        "display_name": "Python",
        "id": 393,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LANGUAGE",
        "slug": "python",
        "sub_category_id": 54,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Analytical Programming Languages",
            "id": 82,
            "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
            "slug": "analytical-programming-languages",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Analyst",
              "id": 20,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-analyst",
              "source": "db"
            },
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Automation Scripting and CLI",
            "id": 48,
            "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
            "slug": "automation-scripting-and-cli",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Azure Cloud Engineer",
              "id": 4,
              "rationale": null,
              "role_archetype": null,
              "slug": "azure-cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Automation and Scripting for Operations",
            "id": 361,
            "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
            "slug": "automation-and-scripting-for-operations",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Virtualization Engineer",
              "id": 26,
              "rationale": null,
              "role_archetype": null,
              "slug": "virtualization-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Network Automation and Scripting",
            "id": 285,
            "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
            "slug": "network-automation-and-scripting",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Network Engineer",
              "id": 21,
              "rationale": null,
              "role_archetype": null,
              "slug": "network-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for AI Workflows",
            "id": 261,
            "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
            "slug": "programming-languages-for-ai-workflows",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "AI Engineer",
              "id": 12,
              "rationale": null,
              "role_archetype": null,
              "slug": "ai-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Backend Systems",
            "id": 140,
            "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
            "slug": "programming-languages-for-backend-systems",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Backend Engineer",
              "id": 14,
              "rationale": null,
              "role_archetype": null,
              "slug": "backend-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Data Work",
            "id": 67,
            "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
            "slug": "programming-languages-for-data-work",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for ML Systems",
            "id": 113,
            "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
            "slug": "programming-languages-for-ml-systems",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Security Work",
            "id": 328,
            "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
            "slug": "programming-languages-for-security-work",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Programming Languages for Test Automation",
            "id": 193,
            "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
            "slug": "programming-languages-for-test-automation",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Automation Tester",
              "id": 16,
              "rationale": null,
              "role_archetype": null,
              "slug": "automation-tester",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Security Automation and Scripting",
            "id": 258,
            "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
            "slug": "security-automation-and-scripting",
            "source": "db"
          },
          "input_skill": "Python",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cybersecurity Engineer",
              "id": 9,
              "rationale": null,
              "role_archetype": null,
              "slug": "cybersecurity-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Python",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "TensorFlow",
          "alias_type": "CANONICAL",
          "id": 862,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TF1",
          "alias_type": "VERSION",
          "id": 863,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TF2",
          "alias_type": "VERSION",
          "id": 864,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 1",
          "alias_type": "VERSION",
          "id": 865,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 1.x",
          "alias_type": "VERSION",
          "id": 867,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 2",
          "alias_type": "VERSION",
          "id": 866,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "TensorFlow 2.x",
          "alias_type": "VERSION",
          "id": 868,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "TensorFlow",
        "id": 558,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "tensorflow",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "TensorFlow",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "TensorFlow",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "PyTorch",
          "alias_type": "CANONICAL",
          "id": 861,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "PyTorch",
        "id": 557,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "pytorch",
        "sub_category_id": 456,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "PyTorch",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "PyTorch",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "scikit-learn",
          "alias_type": "CANONICAL",
          "id": 852,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "scikit-learn",
        "id": 554,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "scikit-learn",
        "sub_category_id": 458,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "Scikit-learn",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Scikit-learn",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "GitHub Actions",
          "alias_type": "CANONICAL",
          "id": 1800,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 14,
        "display_name": "GitHub Actions",
        "id": 1250,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "CLOUD_SERVICE",
        "slug": "github-actions",
        "sub_category_id": 1019,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Continuous Integration Test Integration",
            "id": 207,
            "rationale": "Integrating automated checks into shared build and merge workflows so results are repeatable and visible. This cluster is coherent because automation testers commonly configure test execution triggers, artifacts, and reporting hooks.",
            "slug": "continuous-integration-test-integration",
            "source": "db"
          },
          "input_skill": "GitHub Actions",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Automation Tester",
              "id": 16,
              "rationale": null,
              "role_archetype": null,
              "slug": "automation-tester",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "GitHub Actions",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Airflow",
          "alias_type": "CANONICAL",
          "id": 540,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Airflow",
        "id": 325,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "airflow",
        "sub_category_id": 335,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Workflow Orchestration Systems",
            "id": 64,
            "rationale": "Operational orchestration of ML jobs, dependencies, and handoffs across training, validation, deployment, and retraining. This is a useful split from training pipelines because it emphasizes the scheduler and control plane.",
            "slug": "workflow-orchestration-systems",
            "source": "db"
          },
          "input_skill": "Airflow",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            },
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Airflow",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Terraform",
          "alias_type": "CANONICAL",
          "id": 290,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Terraform",
        "id": 144,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "terraform",
        "sub_category_id": 171,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Infrastructure Provisioning Templates",
            "id": 291,
            "rationale": "Declarative templates and modules used to create repeatable cloud resources and environments. This cluster covers the infrastructure definitions the role applies, reviews, and updates to keep environments consistent.",
            "slug": "infrastructure-provisioning-templates",
            "source": "db"
          },
          "input_skill": "Terraform",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Infrastructure as Code",
            "id": 22,
            "rationale": "Defines infrastructure and platform resources through versioned code so environments are repeatable and reviewable. This is a coherent cluster because it underpins environment consistency and change control.",
            "slug": "infrastructure-as-code",
            "source": "db"
          },
          "input_skill": "Terraform",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Infrastructure as Code and Declarative Provisioning",
            "id": 36,
            "rationale": "Defines cloud and platform infrastructure declaratively through versioned code so environments are repeatable, reviewable, and automatable. This includes authoring and maintaining IaC templates/modules, managing parameters and state, and using plan/apply workflows to provision and update resources across Azure and other cloud platforms.",
            "slug": "infrastructure-as-code-and-declarative-provisioning",
            "source": "db"
          },
          "input_skill": "Terraform",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Azure Cloud Engineer",
              "id": 4,
              "rationale": null,
              "role_archetype": null,
              "slug": "azure-cloud-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Terraform",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Docker",
          "alias_type": "CANONICAL",
          "id": 299,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 11,
        "display_name": "Docker",
        "id": 153,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "docker",
        "sub_category_id": 170,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Containerization and Image Delivery",
            "id": 24,
            "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
            "slug": "containerization-and-image-delivery",
            "source": "db"
          },
          "input_skill": "Docker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Model Serving Deployment and Runtime Packaging",
            "id": 52,
            "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
            "slug": "model-serving-deployment-and-runtime-packaging",
            "source": "db"
          },
          "input_skill": "Docker",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "MLOps Engineer",
              "id": 5,
              "rationale": null,
              "role_archetype": null,
              "slug": "mlops-engineer",
              "source": "db"
            },
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Docker",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Kubernetes",
          "alias_type": "CANONICAL",
          "id": 304,
          "is_primary": true,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.0",
          "alias_type": "VERSION",
          "id": 307,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.0+",
          "alias_type": "VERSION",
          "id": 2366,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.1",
          "alias_type": "VERSION",
          "id": 308,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.10",
          "alias_type": "VERSION",
          "id": 318,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.11",
          "alias_type": "VERSION",
          "id": 319,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.12",
          "alias_type": "VERSION",
          "id": 320,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.13",
          "alias_type": "VERSION",
          "id": 321,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.14",
          "alias_type": "VERSION",
          "id": 322,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.15",
          "alias_type": "VERSION",
          "id": 323,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.16",
          "alias_type": "VERSION",
          "id": 324,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.17",
          "alias_type": "VERSION",
          "id": 325,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.18",
          "alias_type": "VERSION",
          "id": 326,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.19",
          "alias_type": "VERSION",
          "id": 327,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.2",
          "alias_type": "VERSION",
          "id": 309,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.20",
          "alias_type": "VERSION",
          "id": 328,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.21",
          "alias_type": "VERSION",
          "id": 329,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.22",
          "alias_type": "VERSION",
          "id": 330,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.23",
          "alias_type": "VERSION",
          "id": 331,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.24",
          "alias_type": "VERSION",
          "id": 332,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.25",
          "alias_type": "VERSION",
          "id": 333,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.26",
          "alias_type": "VERSION",
          "id": 334,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.27",
          "alias_type": "VERSION",
          "id": 335,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.28",
          "alias_type": "VERSION",
          "id": 336,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.29",
          "alias_type": "VERSION",
          "id": 337,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.3",
          "alias_type": "VERSION",
          "id": 310,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.30",
          "alias_type": "VERSION",
          "id": 338,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.4",
          "alias_type": "VERSION",
          "id": 311,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.5",
          "alias_type": "VERSION",
          "id": 312,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.6",
          "alias_type": "VERSION",
          "id": 313,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.7",
          "alias_type": "VERSION",
          "id": 314,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.8",
          "alias_type": "VERSION",
          "id": 315,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.9",
          "alias_type": "VERSION",
          "id": 316,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes 1.x",
          "alias_type": "VERSION",
          "id": 317,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "Kubernetes v1",
          "alias_type": "VERSION",
          "id": 306,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        },
        {
          "alias_text": "k8s",
          "alias_type": "VERSION",
          "id": 305,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Kubernetes",
        "id": 158,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "kubernetes",
        "sub_category_id": 1524,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Orchestration Platforms",
            "id": 25,
            "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
            "slug": "orchestration-platforms",
            "source": "db"
          },
          "input_skill": "Kubernetes",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Cloud Engineer",
              "id": 18,
              "rationale": null,
              "role_archetype": null,
              "slug": "cloud-engineer",
              "source": "db"
            },
            {
              "display_name": "DevOps Engineer",
              "id": 1,
              "rationale": null,
              "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
              "slug": "devops-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Kubernetes",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "Pinecone",
          "alias_type": "CANONICAL",
          "id": 3418,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 13,
        "display_name": "Pinecone",
        "id": 2621,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "PLATFORM",
        "slug": "pinecone",
        "sub_category_id": 2138,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Model Runtime Services",
            "id": 121,
            "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
            "slug": "cloud-model-runtime-services",
            "source": "db"
          },
          "input_skill": "Pinecone",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Machine Learning Engineer",
              "id": 10,
              "rationale": null,
              "role_archetype": null,
              "slug": "machine-learning-engineer",
              "source": "db"
            }
          ]
        }
      ],
      "input_skill": "Pinecone",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "OpenSearch",
          "alias_type": "CANONICAL",
          "id": 3419,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 12,
        "display_name": "OpenSearch",
        "id": 2622,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "TOOL",
        "slug": "opensearch",
        "sub_category_id": 2139,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Cloud Data Platform Services",
            "id": 81,
            "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
            "slug": "cloud-data-platform-services",
            "source": "db"
          },
          "input_skill": "OpenSearch",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Engineer",
              "id": 6,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-engineer",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "OpenSearch",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "OpenSearch",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [
        {
          "alias_text": "FAISS",
          "alias_type": "CANONICAL",
          "id": 3420,
          "is_primary": false,
          "match_strategy": "CASE_INSENSITIVE"
        }
      ],
      "canonical": {
        "category_id": 6,
        "display_name": "FAISS",
        "id": 2623,
        "is_also_category": false,
        "is_extractable": true,
        "skill_nature": "LIBRARY",
        "slug": "faiss",
        "sub_category_id": 2140,
        "typical_lifespan": "EVERGREEN",
        "volatility": "STABLE"
      },
      "dimensions": [
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Applied Machine Learning Toolkits",
            "id": 94,
            "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
            "slug": "applied-machine-learning-toolkits",
            "source": "db"
          },
          "input_skill": "FAISS",
          "llm_role": null,
          "roles_from_db": [
            {
              "display_name": "Data Scientist",
              "id": 7,
              "rationale": null,
              "role_archetype": null,
              "slug": "data-scientist",
              "source": "db"
            }
          ]
        },
        {
          "dimension": {
            "difficulty_hint": "well_known",
            "display_name": "Version Control Systems",
            "id": 365,
            "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
            "slug": "d_init_01",
            "source": "db"
          },
          "input_skill": "FAISS",
          "llm_role": null,
          "roles_from_db": []
        }
      ],
      "input_skill": "FAISS",
      "matched_via": "alias",
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": null,
      "source_tag": "db",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": []
}
API 3 — final-role-output
{
  "chosen_role": {
    "display_name": "MLOps Engineer",
    "id": 5,
    "rationale": "The primary skills indicate a strong focus on AWS and AI/ML technologies, which align well with the responsibilities of an MLOps Engineer.",
    "role_archetype": null,
    "slug": "mlops-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "AWS",
      "tag": "in_db"
    },
    {
      "skill": "AI/ML",
      "tag": "in_db"
    },
    {
      "skill": "Amazon SageMaker",
      "tag": "in_db"
    },
    {
      "skill": "Amazon Bedrock",
      "tag": "in_db"
    },
    {
      "skill": "AWS Lambda",
      "tag": "in_db"
    },
    {
      "skill": "ECS",
      "tag": "in_db"
    },
    {
      "skill": "EKS",
      "tag": "in_db"
    },
    {
      "skill": "EC2",
      "tag": "in_db"
    },
    {
      "skill": "AWS Glue",
      "tag": "in_db"
    },
    {
      "skill": "Amazon Athena",
      "tag": "in_db"
    },
    {
      "skill": "Redshift",
      "tag": "in_db"
    },
    {
      "skill": "AWS Data Pipeline",
      "tag": "in_db"
    },
    {
      "skill": "S3",
      "tag": "in_db"
    },
    {
      "skill": "Kinesis",
      "tag": "in_db"
    },
    {
      "skill": "Amazon API Gateway",
      "tag": "in_db"
    },
    {
      "skill": "Python",
      "tag": "in_db"
    },
    {
      "skill": "TensorFlow",
      "tag": "in_db"
    },
    {
      "skill": "PyTorch",
      "tag": "in_db"
    },
    {
      "skill": "Scikit-learn",
      "tag": "in_db"
    },
    {
      "skill": "GitHub Actions",
      "tag": "in_db"
    },
    {
      "skill": "Airflow",
      "tag": "in_db"
    },
    {
      "skill": "Terraform",
      "tag": "in_db"
    },
    {
      "skill": "Docker",
      "tag": "in_db"
    },
    {
      "skill": "Kubernetes",
      "tag": "in_db"
    },
    {
      "skill": "Pinecone",
      "tag": "in_db"
    },
    {
      "skill": "OpenSearch",
      "tag": "in_db"
    },
    {
      "skill": "FAISS",
      "tag": "in_db"
    }
  ],
  "persistence": {
    "items": [
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Platform Operations",
          "id": 26,
          "rationale": "Uses cloud provider services to support delivery and runtime environments. The focus is on consumer-level operation of cloud services rather than deep cloud architecture ownership.",
          "slug": "cloud-platform-operations",
          "source": "db"
        },
        "dimension_id": 26,
        "input_skill": "AWS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 163,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Security Platforms",
          "id": 332,
          "rationale": "Cloud-native security products used to assess posture, detect misconfigurations, and monitor workloads across AWS, Azure, and GCP. This is a distinct product family because the role often works across multiple CNAPP/CSPM/CWPP offerings and cloud-native detectors.",
          "slug": "cloud-security-platforms",
          "source": "db"
        },
        "dimension_id": 332,
        "input_skill": "AWS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 163,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "AI Inference Cost, Latency, and Throughput Optimization",
          "id": 260,
          "rationale": "Improving the speed, throughput, and cost efficiency of AI and ML-powered product features without sacrificing correctness or user experience. Includes token budgeting, prompt compression, batching, caching, model selection, quantization, pruning, async inference, warm starts, streaming UX, timeout tuning, concurrency control, and profiling. Excludes infrastructure autoscaling, model serving capacity planning, generic backend performance tuning, and unrelated data/warehouse optimization.",
          "slug": "ai-inference-cost-latency-and-throughput-optimization",
          "source": "db"
        },
        "dimension_id": 260,
        "input_skill": "AI/ML",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2611,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "AI Service Architecture Patterns",
          "id": 270,
          "rationale": "Structuring AI capabilities within the product and service landscape. This includes deciding whether AI logic lives in handlers, workers, gateways, or dedicated orchestration services.",
          "slug": "ai-service-architecture-patterns",
          "source": "db"
        },
        "dimension_id": 270,
        "input_skill": "AI/ML",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2611,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "AI/ML",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2611,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Managed ML Platform Workflows",
          "id": 367,
          "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
          "slug": "d_split_01_01",
          "source": "db"
        },
        "dimension_id": 367,
        "input_skill": "Amazon SageMaker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2612,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Managed Model Hosting and Endpoints",
          "id": 368,
          "rationale": "Cloud-managed services for deploying trained models as online or batch inference endpoints, including endpoint provisioning, batch transform, and rollout coordination. Examples: SageMaker endpoints, SageMaker batch transform.",
          "slug": "d_split_01_02",
          "source": "db"
        },
        "dimension_id": 368,
        "input_skill": "Amazon SageMaker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2612,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Model Serving Deployment and Runtime Packaging",
          "id": 52,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
          "slug": "model-serving-deployment-and-runtime-packaging",
          "source": "db"
        },
        "dimension_id": 52,
        "input_skill": "Amazon SageMaker",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          },
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2612,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Model Runtime Services",
          "id": 121,
          "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
          "slug": "cloud-model-runtime-services",
          "source": "db"
        },
        "dimension_id": 121,
        "input_skill": "Amazon Bedrock",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2613,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "AWS Lambda",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2614,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "ECS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2615,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Model Runtime Services",
          "id": 121,
          "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
          "slug": "cloud-model-runtime-services",
          "source": "db"
        },
        "dimension_id": 121,
        "input_skill": "EKS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 725,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Orchestration Platforms",
          "id": 25,
          "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
          "slug": "orchestration-platforms",
          "source": "db"
        },
        "dimension_id": 25,
        "input_skill": "EKS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 725,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Provider Core Services",
          "id": 290,
          "rationale": "Core managed services used to provision and operate cloud environments. This is the base cloud surface for compute, storage, networking, and platform primitives the role configures and maintains.",
          "slug": "cloud-provider-core-services",
          "source": "db"
        },
        "dimension_id": 290,
        "input_skill": "EC2",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 1773,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "AWS Glue",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 466,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Security and Networking",
          "id": 369,
          "rationale": "Identity, access, secrets, and networking primitives used to support cloud data platforms and pipelines.",
          "slug": "d_split_01_04",
          "source": "db"
        },
        "dimension_id": 369,
        "input_skill": "Amazon Athena",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2616,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "Amazon Athena",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2616,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Managed ML Platform Workflows",
          "id": 367,
          "rationale": "Cloud ML platforms for building and operating models end-to-end, including notebooks, experiments, managed training jobs, and pipeline/studio workflows. Examples: SageMaker Studio, SageMaker notebooks, SageMaker Pipelines, managed training jobs.",
          "slug": "d_split_01_01",
          "source": "db"
        },
        "dimension_id": 367,
        "input_skill": "Amazon Athena",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2616,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Data Warehousing Platforms",
          "id": 72,
          "rationale": "Cloud and on-prem analytical storage systems used to persist curated datasets and serve downstream consumers. This cluster is about the warehouse/lakehouse layer where transformed data is organized for access.",
          "slug": "data-warehousing-platforms",
          "source": "db"
        },
        "dimension_id": 72,
        "input_skill": "Redshift",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2570,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "AWS Data Pipeline",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2617,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Storage Provisioning and Automation",
          "id": 311,
          "rationale": "Covers the scripts, APIs, and operational workflows used to create, resize, map, and retire storage resources. This cluster is coherent because storage engineers often automate repetitive provisioning and maintenance tasks.",
          "slug": "storage-provisioning-and-automation",
          "source": "db"
        },
        "dimension_id": 311,
        "input_skill": "S3",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Storage Engineer",
            "id": 22,
            "rationale": null,
            "role_archetype": null,
            "slug": "storage-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2618,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Streaming Data Processing",
          "id": 69,
          "rationale": "Tools and patterns for ingesting and transforming event streams with low latency. This cluster covers continuous processing, windowing, and stateful stream jobs used to keep data fresh.",
          "slug": "streaming-data-processing",
          "source": "db"
        },
        "dimension_id": 69,
        "input_skill": "Kinesis",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2619,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Web Service Frameworks",
          "id": 141,
          "rationale": "Server frameworks used to build HTTP APIs, route requests, validate inputs, and structure backend application code. This cluster is coherent because it defines how backend services expose behavior to clients and other services.",
          "slug": "web-service-frameworks",
          "source": "db"
        },
        "dimension_id": 141,
        "input_skill": "Amazon API Gateway",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Engineer",
            "id": 14,
            "rationale": null,
            "role_archetype": null,
            "slug": "backend-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2620,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Analytical Programming Languages",
          "id": 82,
          "rationale": "Languages used to clean, transform, analyze, and prototype models in notebooks and scripts. This is the core coding surface for expressing statistical logic and data manipulation in a reproducible way.",
          "slug": "analytical-programming-languages",
          "source": "db"
        },
        "dimension_id": 82,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Analyst",
            "id": 20,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-analyst",
            "source": "db"
          },
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Automation Scripting and CLI",
          "id": 48,
          "rationale": "Uses scripts and command-line tooling to execute repeatable Azure operations and reduce manual work. This is a practical cluster because the role frequently automates provisioning, checks, and remediation tasks.",
          "slug": "automation-scripting-and-cli",
          "source": "db"
        },
        "dimension_id": 48,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Azure Cloud Engineer",
            "id": 4,
            "rationale": null,
            "role_archetype": null,
            "slug": "azure-cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Automation and Scripting for Operations",
          "id": 361,
          "rationale": "Scripts and lightweight automation used to execute repetitive virtualization tasks and enforce operational consistency. This is the practical glue that reduces manual host and VM administration.",
          "slug": "automation-and-scripting-for-operations",
          "source": "db"
        },
        "dimension_id": 361,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Virtualization Engineer",
            "id": 26,
            "rationale": null,
            "role_archetype": null,
            "slug": "virtualization-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Network Automation and Scripting",
          "id": 285,
          "rationale": "Covers scripts and automation used to configure, validate, and audit network devices and services. This cluster is coherent because repeatable network operations increasingly depend on programmatic changes and checks.",
          "slug": "network-automation-and-scripting",
          "source": "db"
        },
        "dimension_id": 285,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Network Engineer",
            "id": 21,
            "rationale": null,
            "role_archetype": null,
            "slug": "network-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for AI Workflows",
          "id": 261,
          "rationale": "Languages used to implement AI feature logic, orchestration, and response handling inside product code. This is the core coding surface for turning prompts and model calls into reliable application behavior.",
          "slug": "programming-languages-for-ai-workflows",
          "source": "db"
        },
        "dimension_id": 261,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "AI Engineer",
            "id": 12,
            "rationale": null,
            "role_archetype": null,
            "slug": "ai-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Backend Systems",
          "id": 140,
          "rationale": "Languages used to implement server-side business logic, request handlers, workers, and service integrations. This is the core coding surface for backend feature delivery and maintenance.",
          "slug": "programming-languages-for-backend-systems",
          "source": "db"
        },
        "dimension_id": 140,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Backend Engineer",
            "id": 14,
            "rationale": null,
            "role_archetype": null,
            "slug": "backend-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Data Work",
          "id": 67,
          "rationale": "Languages used to implement data pipelines, transformations, and operational utilities. This is the code layer for expressing extraction, parsing, validation, and orchestration logic in data engineering workflows.",
          "slug": "programming-languages-for-data-work",
          "source": "db"
        },
        "dimension_id": 67,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for ML Systems",
          "id": 113,
          "rationale": "Languages used to implement model integration code, inference services, and feature-processing logic. This is the core coding surface for turning trained models into product-facing software components.",
          "slug": "programming-languages-for-ml-systems",
          "source": "db"
        },
        "dimension_id": 113,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Security Work",
          "id": 328,
          "rationale": "Languages used to automate security tasks, write detection logic, and build analysis or remediation tooling. This is the core coding surface for a cybersecurity engineer across scripts, queries, and small utilities.",
          "slug": "programming-languages-for-security-work",
          "source": "db"
        },
        "dimension_id": 328,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Programming Languages for Test Automation",
          "id": 193,
          "rationale": "Languages used to implement automated checks, helper utilities, and test harness code. This is the core coding surface for turning test ideas into maintainable automation.",
          "slug": "programming-languages-for-test-automation",
          "source": "db"
        },
        "dimension_id": 193,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Automation Tester",
            "id": 16,
            "rationale": null,
            "role_archetype": null,
            "slug": "automation-tester",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Security Automation and Scripting",
          "id": 258,
          "rationale": "Automating repeatable security checks, enrichment, and remediation workflows. This cluster is coherent because the role often needs lightweight automation to scale analysis and response.",
          "slug": "security-automation-and-scripting",
          "source": "db"
        },
        "dimension_id": 258,
        "input_skill": "Python",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cybersecurity Engineer",
            "id": 9,
            "rationale": null,
            "role_archetype": null,
            "slug": "cybersecurity-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 393,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "TensorFlow",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 558,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "PyTorch",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 557,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "Scikit-learn",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 554,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Continuous Integration Test Integration",
          "id": 207,
          "rationale": "Integrating automated checks into shared build and merge workflows so results are repeatable and visible. This cluster is coherent because automation testers commonly configure test execution triggers, artifacts, and reporting hooks.",
          "slug": "continuous-integration-test-integration",
          "source": "db"
        },
        "dimension_id": 207,
        "input_skill": "GitHub Actions",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Automation Tester",
            "id": 16,
            "rationale": null,
            "role_archetype": null,
            "slug": "automation-tester",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 1250,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Workflow Orchestration Systems",
          "id": 64,
          "rationale": "Operational orchestration of ML jobs, dependencies, and handoffs across training, validation, deployment, and retraining. This is a useful split from training pipelines because it emphasizes the scheduler and control plane.",
          "slug": "workflow-orchestration-systems",
          "source": "db"
        },
        "dimension_id": 64,
        "input_skill": "Airflow",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          },
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 325,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Infrastructure Provisioning Templates",
          "id": 291,
          "rationale": "Declarative templates and modules used to create repeatable cloud resources and environments. This cluster covers the infrastructure definitions the role applies, reviews, and updates to keep environments consistent.",
          "slug": "infrastructure-provisioning-templates",
          "source": "db"
        },
        "dimension_id": 291,
        "input_skill": "Terraform",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 144,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Infrastructure as Code",
          "id": 22,
          "rationale": "Defines infrastructure and platform resources through versioned code so environments are repeatable and reviewable. This is a coherent cluster because it underpins environment consistency and change control.",
          "slug": "infrastructure-as-code",
          "source": "db"
        },
        "dimension_id": 22,
        "input_skill": "Terraform",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 144,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Infrastructure as Code and Declarative Provisioning",
          "id": 36,
          "rationale": "Defines cloud and platform infrastructure declaratively through versioned code so environments are repeatable, reviewable, and automatable. This includes authoring and maintaining IaC templates/modules, managing parameters and state, and using plan/apply workflows to provision and update resources across Azure and other cloud platforms.",
          "slug": "infrastructure-as-code-and-declarative-provisioning",
          "source": "db"
        },
        "dimension_id": 36,
        "input_skill": "Terraform",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Azure Cloud Engineer",
            "id": 4,
            "rationale": null,
            "role_archetype": null,
            "slug": "azure-cloud-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 144,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Containerization and Image Delivery",
          "id": 24,
          "rationale": "Builds, packages, and ships application and support workloads as container images. This cluster covers the artifact format and the mechanics of producing deployable images.",
          "slug": "containerization-and-image-delivery",
          "source": "db"
        },
        "dimension_id": 24,
        "input_skill": "Docker",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 153,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Model Serving Deployment and Runtime Packaging",
          "id": 52,
          "rationale": "Operational deployment of trained models into online, batch, or streaming serving environments, including packaging models and model servers into containers or managed inference runtimes, coordinating rollout, and handing off to inference systems. Covers serving frameworks and platforms such as TensorFlow Serving, TorchServe, Triton Inference Server, BentoML, KServe, and Seldon Core, plus container/runtime concerns like Docker images, GPU-enabled containers, base image selection, container entrypoints, runtime dependencies, and image scanning for model services.",
          "slug": "model-serving-deployment-and-runtime-packaging",
          "source": "db"
        },
        "dimension_id": 52,
        "input_skill": "Docker",
        "llm_role": null,
        "matched_chosen_role": true,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension saved",
        "role_dimension_saved": true,
        "roles_from_db": [
          {
            "display_name": "MLOps Engineer",
            "id": 5,
            "rationale": null,
            "role_archetype": null,
            "slug": "mlops-engineer",
            "source": "db"
          },
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 153,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Orchestration Platforms",
          "id": 25,
          "rationale": "Operates the platforms that schedule and run containerized workloads and related deployment primitives. This is separate from image delivery because it concerns runtime placement and service rollout behavior.",
          "slug": "orchestration-platforms",
          "source": "db"
        },
        "dimension_id": 25,
        "input_skill": "Kubernetes",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Cloud Engineer",
            "id": 18,
            "rationale": null,
            "role_archetype": null,
            "slug": "cloud-engineer",
            "source": "db"
          },
          {
            "display_name": "DevOps Engineer",
            "id": 1,
            "rationale": null,
            "role_archetype": "A DevOps Engineer enables reliable, repeatable delivery of software by designing and operating the processes that connect development and production. They focus on improving deployment flow, operational stability, and collaboration between teams through automation, standardization, and monitoring of delivery and runtime practices.",
            "slug": "devops-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 158,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Model Runtime Services",
          "id": 121,
          "rationale": "Consumer-level use of cloud services that host or support model inference applications. This cluster is coherent because MLEs often deploy and tune services on managed cloud compute, networking, and storage primitives.",
          "slug": "cloud-model-runtime-services",
          "source": "db"
        },
        "dimension_id": 121,
        "input_skill": "Pinecone",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Machine Learning Engineer",
            "id": 10,
            "rationale": null,
            "role_archetype": null,
            "slug": "machine-learning-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2621,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Cloud Data Platform Services",
          "id": 81,
          "rationale": "Consumer-level use of cloud services that support data engineering workloads. This includes managed compute, storage, networking-adjacent services, and security primitives used to run pipelines and data platforms.",
          "slug": "cloud-data-platform-services",
          "source": "db"
        },
        "dimension_id": 81,
        "input_skill": "OpenSearch",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Engineer",
            "id": 6,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-engineer",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2622,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "OpenSearch",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2622,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Applied Machine Learning Toolkits",
          "id": 94,
          "rationale": "Libraries and frameworks used to prototype and compare models quickly. This dimension captures the concrete tooling layer beneath modeling methods and evaluation.",
          "slug": "applied-machine-learning-toolkits",
          "source": "db"
        },
        "dimension_id": 94,
        "input_skill": "FAISS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [
          {
            "display_name": "Data Scientist",
            "id": 7,
            "rationale": null,
            "role_archetype": null,
            "slug": "data-scientist",
            "source": "db"
          }
        ],
        "skill_dimension_saved": true,
        "skill_id": 2623,
        "skill_tag": "in_db",
        "skipped_reason": null
      },
      {
        "chosen_role_id": 5,
        "dimension": {
          "difficulty_hint": "well_known",
          "display_name": "Version Control Systems",
          "id": 365,
          "rationale": "Tools and workflows for tracking source changes, branching, merging, and collaborating on code history. Git belongs here because it is the canonical distributed version control system used to manage revisions and coordinate team development.",
          "slug": "d_init_01",
          "source": "db"
        },
        "dimension_id": 365,
        "input_skill": "FAISS",
        "llm_role": null,
        "matched_chosen_role": false,
        "outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
        "role_dimension_saved": false,
        "roles_from_db": [],
        "skill_dimension_saved": true,
        "skill_id": 2623,
        "skill_tag": "in_db",
        "skipped_reason": null
      }
    ],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "832dda01-52e1-4739-b122-0a9cf9bc8100"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…