Pipeline run
48b52573-a42b-4e34-9481-72d73c5404f8
Client output enrichment
v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA descriptionvocab breakdown (legacy)
Signals
Post-classification
Captured for admin review
1 POST /skills/extract-from-jd
2 POST /skills/extract-details
3 POST /skills/final-role-output
ML Engineer
CASE Aslug: ml-engineer · id: 3 · source: db
The role aligns with primary skills focused on Python and various ML frameworks.
Resolution:
in_db
— role exists in library; skill↔dim and role↔dim links saved when applicable.
Job description
Job Title: Senior Machine Learning Engineer About the role: We are looking for a Senior ML Engineer to own the lifecycle of production machine-learning systems at our platform. You will work with applied scientists to take research models from notebook to production, build retrieval-augmented generation (RAG) pipelines on top of our internal document corpus, and own MLOps infrastructure end-to-end. Responsibilities: - Design and deploy production-grade ML services serving 50M+ predictions/day. - Build feature stores, training pipelines, and online inference infrastructure with PyTorch and Hugging Face Transformers. - Lead our LLM efforts: fine-tune open-source models (Llama 3, Mistral) and integrate with our RAG retrieval layer (FAISS, pgvector embeddings). - Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection. - Mentor junior ML engineers on best practices for reproducible experiments, model versioning (MLflow), and CI/CD for ML. Requirements: - 5+ years of hands-on ML engineering experience, with at least 2 years productionising deep-learning models. - Strong Python; production code with PyTorch, TensorFlow, or JAX. - Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding models (text-embedding-3, BGE, etc.). - LLM serving experience: vLLM, TGI, or Triton. - Familiar with MLOps tooling (MLflow, Kubeflow, Weights & Biases) and orchestration (Airflow, Prefect). - Comfort with Kubernetes for ML workloads; experience with GPU scheduling preferred. Nice to have: - Open-source contributions to ML frameworks or popular LLM tooling. - Experience with retrieval-augmented generation, sparse retrieval (BM25), and hybrid search. - Familiarity with vector index tuning, quantization (INT8/FP16), and inference optimisation. Location: Bengaluru (Hybrid) | Comp: competitive.
Skills from this JD
Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.
Aliases — catalog
- Python (CANONICAL) primary
- Python 2 (VERSION)
- Python 2.x (VERSION)
- Python 3 (VERSION)
- Python 3.10 (VERSION)
- Python 3.11 (VERSION)
- Python 3.12 (VERSION)
- Python 3.x (VERSION)
- py (VERSION)
- py2 (VERSION)
- py3 (VERSION)
- python 3 (VERSION)
- python 3.x (VERSION)
- python2 (VERSION)
- python3 (VERSION)
- python3.x (VERSION)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Language
- Sub-category
- Programming Language
- Vendor
- PSF
- License
- mit
- Year introduced
- 1991
- Confidence
- 0.99
- Version strategy
- SEPARATE_ENTITY
- Version tag
- 3
Maturity reasoning: Python appears in a very high volume of job descriptions across data, backend, automation, and ML roles, and remains a default hiring-pipeline language on major job boards and tech stacks.
Skill profile (library / DB)
- Skill nature
- LANGUAGE
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 6
- Sub-category id
- 96
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Cloud Security Scripting & DSL Languages Catalog dimension db id 248
Library dimension (catalog)
Roles linked in library: Cloud Security Engineer
-
Programming Languages Catalog dimension db id 1
Library dimension (catalog)
Roles linked in library: Backend Developer, Fullstack Developer
-
Programming Languages and Scripting Catalog dimension db id 59
Library dimension (catalog)
Roles linked in library: Cybersecurity Engineer
-
Programming Languages for Data Work Catalog dimension db id 21
Library dimension (catalog)
Roles linked in library: Data Engineer
-
Programming Languages for ML Systems Catalog dimension db id 39
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
-
Programming Languages for XR Catalog dimension db id 97
Library dimension (catalog)
Roles linked in library: AR/VR Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Cloud Security Scripting & DSL Languages
cloud-security-scripting-dsl-languages
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Programming Languages
programming-languages
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Programming Languages and Scripting
programming-languages-and-scripting
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Programming Languages for Data Work
programming-languages-for-data-work
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Programming Languages for ML Systems
programming-languages-for-ml-systems
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Programming Languages for XR
programming-languages-for-xr
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- PyTorch (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Library
- Sub-category
- Machine Learning Library
- Vendor
- Meta
- License
- bsd
- Year introduced
- 2016
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: PyTorch appears in a large volume of ML/AI job descriptions and is a standard framework in research and production, alongside TensorFlow and CUDA ecosystems.
Skill profile (library / DB)
- Skill nature
- LIBRARY
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 7
- Sub-category id
- 156
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
ML Frameworks and Libraries Catalog dimension db id 40
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
-
Model Fine-Tuning & Adaptation Catalog dimension db id 212
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Model Fine-Tuning & Adaptation
model-fine-tuning-adaptation
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Machine Learning Frameworks
- Sub-category
- general
- Skill nature
- TOOL
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Machine Learning Frameworks
- Sub-category
- general
- Skill nature
- TOOL
- Volatility
- FAST
- Typical lifespan
- SHORT_LIVED
- Version strategy
- VERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Machine Learning Frameworks
- Sub-category
- general
- Skill nature
- TOOL
- Volatility
- FAST
- Typical lifespan
- SHORT_LIVED
- Version strategy
- VERSIONED
Aliases — catalog
- FAISS (CANONICAL)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Library
- Sub-category
- Vector Search Library
- Vendor
- License
- apache_2
- Year introduced
- 2017
- Confidence
- 0.93
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: FAISS appears in growing number of AI/vector-search job descriptions and is widely used in RAG stacks, but it is still less universal than PostgreSQL or Kubernetes and often paired with managed vector DBs.
Skill profile (library / DB)
- Skill nature
- LIBRARY
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 7
- Sub-category id
- 894
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
ML Frameworks and Libraries Catalog dimension db id 40
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
-
React Frontend Development Catalog dimension db id 96
Library dimension (catalog)
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
React Frontend Development
d_init_01
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- pgvector (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Library
- Sub-category
- Database Extension Library
- Vendor
- ZomboDB
- License
- mit
- Year introduced
- 2021
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Appears in growing numbers of JDs for AI search/RAG roles, but remains a PostgreSQL extension rather than a universal database skill; GitHub adoption is rising yet still far below core DB tech.
Skill profile (library / DB)
- Skill nature
- LIBRARY
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 7
- Sub-category id
- 972
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Vector Databases Catalog dimension db id 198
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Vector Databases
vector-databases
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- A/B Testing (CANONICAL)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Methodology
- Sub-category
- Experiment Design Methodology
- Confidence
- 0.97
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Commonly listed in product, growth, and analytics job descriptions; major platforms like Optimizely and Google Optimize popularized it, and it remains a standard experimentation practice across SaaS and e-commerce.
Skill profile (library / DB)
- Skill nature
- METHODOLOGY
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 8
- Sub-category id
- 1214
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
React Frontend Development Catalog dimension db id 96
Library dimension (catalog)
-
Systems Programming Catalog dimension db id 166
Library dimension (catalog)
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
React Frontend Development
d_init_01
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Systems Programming
d_init_02
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- MLflow (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Tool
- Sub-category
- Mlops Tool
- Vendor
- Databricks
- License
- apache_2
- Year introduced
- 2018
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: MLflow appears frequently in MLOps job descriptions and is a standard open-source model tracking/registry tool; Databricks continues to invest in it, signaling broad adoption rather than niche use.
Skill profile (library / DB)
- Skill nature
- TOOL
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 13
- Sub-category id
- 1168
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
MLOps Platforms and Lifecycle Catalog dimension db id 43
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
MLOps Platforms and Lifecycle
mlops-platforms-and-lifecycle
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- TensorFlow (CANONICAL) primary
- TF1 (VERSION)
- TF2 (VERSION)
- TensorFlow 1 (VERSION)
- TensorFlow 1.x (VERSION)
- TensorFlow 2 (VERSION)
- TensorFlow 2.x (VERSION)
- tensorflow 1 (VERSION)
- tensorflow 1.x (VERSION)
- tensorflow 2 (VERSION)
- tensorflow 2.x (VERSION)
- tensorflow v1 (VERSION)
- tensorflow v2 (VERSION)
- tf (VERSION)
- tf1 (VERSION)
- tf2 (VERSION)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Library
- Sub-category
- Machine Learning Library
- Vendor
- License
- apache_2
- Year introduced
- 2015
- Confidence
- 0.90
- Version strategy
- SEPARATE_ENTITY
- Version tag
- 2.x
Maturity reasoning: TensorFlow appears in many ML/AI job descriptions and remains a standard production framework, with strong GitHub activity and broad vendor support from Google and cloud platforms.
Skill profile (library / DB)
- Skill nature
- LIBRARY
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 7
- Sub-category id
- 156
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
ML Frameworks and Libraries Catalog dimension db id 40
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- JAX (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Library
- Sub-category
- Machine Learning Library
- Vendor
- License
- apache_2
- Year introduced
- 2018
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: JAX appears increasingly in ML/AI job postings and research codebases, especially for high-performance training and TPU/GPU workloads, but it is not yet a universal hiring staple like PyTorch or TensorFlow.
Skill profile (library / DB)
- Skill nature
- LIBRARY
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 7
- Sub-category id
- 156
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
ML Frameworks and Libraries Catalog dimension db id 40
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Pinecone (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Platform
- Sub-category
- Vector Database Platform
- Vendor
- Pinecone
- License
- unknown
- Year introduced
- 2021
- Confidence
- 0.95
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Pinecone appears in a growing number of AI/vector-search job postings and vendor docs, but it is still far from universal compared with PostgreSQL or AWS.
Skill profile (library / DB)
- Skill nature
- PLATFORM
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 9
- Sub-category id
- 177
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
LLM Operations and Orchestration Catalog dimension db id 49
Library dimension (catalog)
Roles linked in library: AI Engineer, ML Engineer, ML Ops Engineer
-
Vector Databases Catalog dimension db id 198
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
LLM Operations and Orchestration
llm-operations-and-orchestration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Vector Databases
vector-databases
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Weaviate (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Platform
- Sub-category
- Vector Database Platform
- Vendor
- Weaviate
- License
- apache_2
- Year introduced
- 2018
- Confidence
- 0.95
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Weaviate appears in a growing number of AI/vector-search job postings and vendor docs, but JD volume is still far below PostgreSQL/AWS-level staples; GitHub activity and ecosystem integrations are rising.
Skill profile (library / DB)
- Skill nature
- PLATFORM
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 9
- Sub-category id
- 177
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Vector Databases Catalog dimension db id 198
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Vector Databases
vector-databases
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- text-embedding-3-large (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Service
- Sub-category
- Embedding Model Service
- Vendor
- OpenAI
- License
- proprietary
- Year introduced
- 2023
- Confidence
- 0.94
- Version strategy
- SEPARATE_ENTITY
- Version tag
- text-embedding-3-large
Maturity reasoning: OpenAI’s text-embedding-3-large is appearing in RAG/vector-search job posts and docs, but JD volume is still far below core skills like Python/AWS and it’s a newer model family rather than a universal standard.
Skill profile (library / DB)
- Skill nature
- CLOUD_SERVICE
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 11
- Sub-category id
- 1005
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Embedding Models Catalog dimension db id 199
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Embedding Models
embedding-models
|
— | — |
Skipped — no persistable v3 meta for new skill
skill_not_in_db_v3_proposed
|
Aliases — catalog
- BGE (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Library
- Sub-category
- Embedding Model Library
- Vendor
- Hugging Face
- License
- apache_2
- Year introduced
- 2020
- Confidence
- 0.82
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: BGE is increasingly listed in LLM/RAG job descriptions and widely used in GitHub examples for embeddings, but it is not yet a universal hiring staple like AWS or PostgreSQL.
Skill profile (library / DB)
- Skill nature
- LIBRARY
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 7
- Sub-category id
- 973
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Embedding Models Catalog dimension db id 199
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Embedding Models
embedding-models
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- vLLM (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Tool
- Sub-category
- Llm Inference Server
- Vendor
- vllm
- License
- apache_2
- Year introduced
- 2023
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: vLLM appears in growing numbers of LLM infra job postings and is widely referenced in GitHub/OSS deployments, but it is not yet a universal hiring staple like Kubernetes or AWS.
Skill profile (library / DB)
- Skill nature
- TOOL
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 13
- Sub-category id
- 1167
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
LLM Operations and Orchestration Catalog dimension db id 49
Library dimension (catalog)
Roles linked in library: AI Engineer, ML Engineer, ML Ops Engineer
-
LLM Serving & Deployment Catalog dimension db id 209
Library dimension (catalog)
Roles linked in library: AI Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
LLM Operations and Orchestration
llm-operations-and-orchestration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
LLM Serving & Deployment
llm-serving-deployment
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Machine Learning Frameworks
- Sub-category
- general
- Skill nature
- TOOL
- Volatility
- FAST
- Typical lifespan
- SHORT_LIVED
- Version strategy
- VERSIONED
Aliases — catalog
- Triton (CANONICAL)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Language
- Sub-category
- Gpu Kernel Language
- Vendor
- NVIDIA
- License
- other_open
- Year introduced
- 2015
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Triton appears in growing ML systems job postings and is increasingly used in open-source GPU kernel repos, but it is still far from a universal hiring staple like CUDA.
Skill profile (library / DB)
- Skill nature
- LANGUAGE
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 6
- Sub-category id
- 1033
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Model Optimization and Acceleration Catalog dimension db id 53
Library dimension (catalog)
Roles linked in library: ML Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Model Optimization and Acceleration
model-optimization-and-acceleration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Kubeflow (CANONICAL) primary
- Kubeflow 1.x (VERSION)
- Kubeflow 2.x (VERSION)
- Kubeflow v1 (VERSION)
- Kubeflow v2 (VERSION)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Framework
- Sub-category
- Mlops Framework
- Vendor
- License
- apache_2
- Year introduced
- 2017
- Confidence
- 0.90
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Kubeflow appears in some MLOps/ML platform JDs, but far less often than Kubernetes or managed ML platforms; GitHub activity is steady yet adoption remains specialized to ML infrastructure teams.
Skill profile (library / DB)
- Skill nature
- FRAMEWORK
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 5
- Sub-category id
- 1127
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
MLOps Platforms and Lifecycle Catalog dimension db id 43
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
MLOps Platforms and Lifecycle
mlops-platforms-and-lifecycle
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Weights & Biases (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Platform
- Sub-category
- Experiment Tracking Platform
- Vendor
- Weights & Biases, Inc.
- License
- other_open
- Year introduced
- 2018
- Confidence
- 0.97
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Frequently appears in ML/LLM job descriptions and vendor docs, but is still less universal than core cloud/ML stack tools; GitHub and ecosystem adoption are growing, not yet hiring-pipeline staple.
Skill profile (library / DB)
- Skill nature
- PLATFORM
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 9
- Sub-category id
- 171
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Experiment Tracking and Evaluation Catalog dimension db id 44
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Experiment Tracking and Evaluation
experiment-tracking-and-evaluation
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Airflow (CANONICAL) primary
- airflow 2 (VERSION)
- airflow-2 (VERSION)
- airflow2 (VERSION)
- airflow2.x (VERSION)
- apache airflow 2 (VERSION)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Tool
- Sub-category
- Workflow Orchestration Tool
- Vendor
- Apache Software Foundation
- License
- apache_2
- Year introduced
- 2014
- Confidence
- 0.95
- Version strategy
- SEPARATE_ENTITY
- Version tag
- 2.x
Maturity reasoning: Apache Airflow appears in many data engineering job postings and is a common orchestration choice in production stacks; its GitHub activity and ecosystem remain strong, with no vendor sunset or clear replacement dominating JDs.
Skill profile (library / DB)
- Skill nature
- TOOL
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 13
- Sub-category id
- 130
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Workflow Orchestration for ML Pipelines Catalog dimension db id 54
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Workflow Orchestration for ML Pipelines
workflow-orchestration-for-ml-pipelines
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Prefect (CANONICAL) primary
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Tool
- Sub-category
- Workflow Orchestration Tool
- Vendor
- Prefect Technologies, Inc.
- License
- apache_2
- Year introduced
- 2018
- Confidence
- 0.95
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: Prefect appears in growing numbers of data/ML workflow orchestration JDs, but Airflow remains the more common baseline; GitHub activity and vendor docs show active adoption rather than sunset.
Skill profile (library / DB)
- Skill nature
- TOOL
- Volatility
- EMERGING
- Typical lifespan
- EVERGREEN
- Category id
- 13
- Sub-category id
- 130
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Data Pipeline Orchestration Catalog dimension db id 23
Library dimension (catalog)
Roles linked in library: Data Engineer
-
Workflow Orchestration for ML Pipelines Catalog dimension db id 54
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Data Pipeline Orchestration
data-pipeline-orchestration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Workflow Orchestration for ML Pipelines
workflow-orchestration-for-ml-pipelines
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Aliases — catalog
- Kubernetes (CANONICAL) primary
- Kubernetes 1.0+ (VERSION)
- Kubernetes 1.x (VERSION)
- Kubernetes v1 (VERSION)
- k8s (VERSION)
- kubernetes 1.x (VERSION)
- kubernetes latest (VERSION)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Platform
- Sub-category
- Container Orchestration Platform
- Vendor
- Cloud Native Computing Foundation
- License
- apache_2
- Year introduced
- 2014
- Confidence
- 0.90
- Version strategy
- SEPARATE_ENTITY
- Version tag
- 1.30
Maturity reasoning: Broadly adopted in cloud-native stacks; Kubernetes appears in a large share of DevOps/SRE job descriptions and is the default orchestration platform across major cloud vendors.
Skill profile (library / DB)
- Skill nature
- PLATFORM
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 9
- Sub-category id
- 557
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
Container Orchestration Platforms Catalog dimension db id 134
Library dimension (catalog)
Roles linked in library: Cloud Architect, DevOps Engineer
-
Kubernetes for ML Workloads Catalog dimension db id 47
Library dimension (catalog)
Roles linked in library: ML Engineer, ML Ops Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
Container Orchestration Platforms
container-orchestration-platforms
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
Kubernetes for ML Workloads
kubernetes-for-ml-workloads
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Infrastructure Tools
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Search Algorithms
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Data Types
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Skill enrichment (orchestrator / LLM)
No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).
- Category
- Data Types
- Sub-category
- general
- Skill nature
- CONCEPT
- Volatility
- MEDIUM
- Typical lifespan
- MULTI_YEAR
- Version strategy
- UNVERSIONED
Aliases — catalog
- CI/CD (CANONICAL)
Context tags (catalog)
Stored enrichment (catalog DB)
- Category
- Methodology
- Sub-category
- Ci Cd Process
- Confidence
- 0.93
- Version strategy
- NOT_APPLICABLE
Maturity reasoning: CI/CD appears in a large share of software engineering JDs and is a standard requirement across DevOps, platform, and backend roles; major vendors like GitHub, GitLab, and AWS all center product roadmaps on CI/CD pipelines.
Skill profile (library / DB)
- Skill nature
- METHODOLOGY
- Volatility
- STABLE
- Typical lifespan
- EVERGREEN
- Category id
- 8
- Sub-category id
- 900
- Extractable
- True
- Also category
- False
Dimensions (API 2 worklist)
-
CI/CD Pipeline Platforms Catalog dimension db id 150
Library dimension (catalog)
Roles linked in library: DevOps Engineer
-
CI/CD for Machine Learning Catalog dimension db id 56
Library dimension (catalog)
Roles linked in library: ML Engineer
API 3 link attempts (this skill)
| Dimension | Skill↔dim | Role↔dim | Outcome |
|---|---|---|---|
|
CI/CD Pipeline Platforms
ci-cd-pipeline-platforms
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
|
CI/CD for Machine Learning
ci-cd-for-machine-learning
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
All API 3 persistence rows
Same grid as the skill-extractor “Persistence items” table: one row per (skill × dimension) work item.
| Skill | Tag | Dimension | Skill↔dim | Role↔dim | Outcome | Notes |
|---|---|---|---|---|---|---|
| Python | in_db |
Cloud Security Scripting & DSL Languages
cloud-security-scripting-dsl-languages
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Python | in_db |
Programming Languages
programming-languages
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Python | in_db |
Programming Languages and Scripting
programming-languages-and-scripting
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Python | in_db |
Programming Languages for Data Work
programming-languages-for-data-work
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Python | in_db |
Programming Languages for ML Systems
programming-languages-for-ml-systems
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Python | in_db |
Programming Languages for XR
programming-languages-for-xr
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| PyTorch | in_db |
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| PyTorch | in_db |
Model Fine-Tuning & Adaptation
model-fine-tuning-adaptation
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| FAISS | in_db |
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| FAISS | in_db |
React Frontend Development
d_init_01
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| pgvector | in_db |
Vector Databases
vector-databases
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| A/B Testing | in_db |
React Frontend Development
d_init_01
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| A/B Testing | in_db |
Systems Programming
d_init_02
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| MLflow | in_db |
MLOps Platforms and Lifecycle
mlops-platforms-and-lifecycle
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| TensorFlow | in_db |
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| JAX | in_db |
ML Frameworks and Libraries
ml-frameworks-and-libraries
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Pinecone | in_db |
LLM Operations and Orchestration
llm-operations-and-orchestration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Pinecone | in_db |
Vector Databases
vector-databases
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Weaviate | in_db |
Vector Databases
vector-databases
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| text-embedding-3 | new |
Embedding Models
embedding-models
|
— | — | Skipped — no persistable v3 meta for new skill | skill_not_in_db_v3_proposed |
| BGE | in_db |
Embedding Models
embedding-models
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| vLLM | in_db |
LLM Operations and Orchestration
llm-operations-and-orchestration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| vLLM | in_db |
LLM Serving & Deployment
llm-serving-deployment
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Triton | in_db |
Model Optimization and Acceleration
model-optimization-and-acceleration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Kubeflow | in_db |
MLOps Platforms and Lifecycle
mlops-platforms-and-lifecycle
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Weights & Biases | in_db |
Experiment Tracking and Evaluation
experiment-tracking-and-evaluation
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Airflow | in_db |
Workflow Orchestration for ML Pipelines
workflow-orchestration-for-ml-pipelines
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Prefect | in_db |
Data Pipeline Orchestration
data-pipeline-orchestration
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Prefect | in_db |
Workflow Orchestration for ML Pipelines
workflow-orchestration-for-ml-pipelines
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Kubernetes | in_db |
Container Orchestration Platforms
container-orchestration-platforms
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| Kubernetes | in_db |
Kubernetes for ML Workloads
kubernetes-for-ml-workloads
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| CI/CD | in_db |
CI/CD Pipeline Platforms
ci-cd-pipeline-platforms
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) | |
| CI/CD | in_db |
CI/CD for Machine Learning
ci-cd-for-machine-learning
|
✓ | — | Existing dimension (library) · Role↔dimension skipped (dimension not under chosen role) |
Library artifacts (this run)
| Kind | Detail | DB id |
|---|---|---|
| canonical_skill_proposed | Hugging Face Transformers | type=Machine Learning Frameworks subtype=general nature=TOOL lifespan=MULTI_YEAR | |
| canonical_skill_proposed | Llama 3 | type=Machine Learning Frameworks subtype=general nature=TOOL lifespan=SHORT_LIVED | |
| canonical_skill_proposed | Mistral | type=Machine Learning Frameworks subtype=general nature=TOOL lifespan=SHORT_LIVED | |
| canonical_skill_proposed | TGI | type=Machine Learning Frameworks subtype=general nature=TOOL lifespan=SHORT_LIVED | |
| canonical_skill_proposed | GPU scheduling | type=Infrastructure Tools subtype=general nature=CONCEPT lifespan=MULTI_YEAR | |
| canonical_skill_proposed | BM25 | type=Search Algorithms subtype=general nature=CONCEPT lifespan=EVERGREEN | |
| canonical_skill_proposed | INT8 | type=Data Types subtype=general nature=CONCEPT lifespan=MULTI_YEAR | |
| canonical_skill_proposed | FP16 | type=Data Types subtype=general nature=CONCEPT lifespan=MULTI_YEAR | |
| dimension_skill_link_proposed | text-embedding-3 ↔ Embedding Models |
nano JD Parser — gpt-4.1-nano click to toggle
Show raw JSON
{
"JD_type": "pass",
"about_company": null,
"certifications": [],
"company_name": null,
"ctc": {
"currency": null,
"max": null,
"min": null,
"period": null,
"raw": "competitive"
},
"domain": {
"primary": {
"aliases": [],
"domain": "Software \u0026 SaaS Products"
},
"secondary": null
},
"education": [],
"experience": {
"max": null,
"min": 5,
"raw": "5+ years of hands-on ML engineering experience, with at least 2 years productionising deep-learning models."
},
"job_locations": [
{
"aliases": [
"Bangalore"
],
"city": "Bengaluru",
"country": "India",
"state": null,
"work_mode": "hybrid"
}
],
"role": "Senior Machine Learning Engineer",
"role_aliases": [
"ML Engineer",
"Machine Learning Engineer",
"Senior ML Engineer"
],
"role_archetype": "Data",
"roles_and_responsibilities": [
{
"bullet_count": 0,
"heading": "About the role",
"heading_was_present": true,
"source_marker": {
"first_5_words": "We are looking for a",
"last_5_words": "infrastructure end-to-end."
},
"text": "We are looking for a Senior ML Engineer to own the lifecycle of production machine-learning systems at our platform. You will work with applied scientists to take research models from notebook to production, build retrieval-augmented generation (RAG) pipelines on top of our internal document corpus, and own MLOps infrastructure end-to-end.",
"word_count": 49
},
{
"bullet_count": 5,
"heading": "Responsibilities",
"heading_was_present": true,
"source_marker": {
"first_5_words": "Design and deploy production-grade",
"last_5_words": "versioning (MLflow), and CI/CD for ML."
},
"text": "- Design and deploy production-grade ML services serving 50M+ predictions/day.\n- Build feature stores, training pipelines, and online inference infrastructure with PyTorch and Hugging Face Transformers.\n- Lead our LLM efforts: fine-tune open-source models (Llama 3, Mistral) and integrate with our RAG retrieval layer (FAISS, pgvector embeddings).\n- Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.\n- Mentor junior ML engineers on best practices for reproducible experiments, model versioning (MLflow), and CI/CD for ML.",
"word_count": 102
},
{
"bullet_count": 6,
"heading": "Requirements",
"heading_was_present": true,
"source_marker": {
"first_5_words": "5+ years of hands-on ML",
"last_5_words": "experience with GPU scheduling preferred."
},
"text": "- 5+ years of hands-on ML engineering experience, with at least 2 years productionising deep-learning models.\n- Strong Python; production code with PyTorch, TensorFlow, or JAX.\n- Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding models (text-embedding-3, BGE, etc.).\n- LLM serving experience: vLLM, TGI, or Triton.\n- Familiar with MLOps tooling (MLflow, Kubeflow, Weights \u0026 Biases) and orchestration (Airflow, Prefect).\n- Comfort with Kubernetes for ML workloads; experience with GPU scheduling preferred.",
"word_count": 91
},
{
"bullet_count": 3,
"heading": "Nice to have",
"heading_was_present": true,
"source_marker": {
"first_5_words": "Open-source contributions to ML",
"last_5_words": "quantization (INT8/FP16), and inference optimisation."
},
"text": "- Open-source contributions to ML frameworks or popular LLM tooling.\n- Experience with retrieval-augmented generation, sparse retrieval (BM25), and hybrid search.\n- Familiarity with vector index tuning, quantization (INT8/FP16), and inference optimisation.",
"word_count": 42
}
],
"urls": []
}
API 1 — extract-from-jd click to toggle
{
"final_skills": [
{
"is_primary": true,
"skill_name": "Python"
},
{
"is_primary": true,
"skill_name": "PyTorch"
},
{
"is_primary": true,
"skill_name": "Hugging Face Transformers"
},
{
"is_primary": true,
"skill_name": "Llama 3"
},
{
"is_primary": true,
"skill_name": "Mistral"
},
{
"is_primary": true,
"skill_name": "FAISS"
},
{
"is_primary": true,
"skill_name": "pgvector"
},
{
"is_primary": true,
"skill_name": "A/B Testing"
},
{
"is_primary": true,
"skill_name": "MLflow"
},
{
"is_primary": false,
"skill_name": "TensorFlow"
},
{
"is_primary": false,
"skill_name": "JAX"
},
{
"is_primary": false,
"skill_name": "Pinecone"
},
{
"is_primary": false,
"skill_name": "Weaviate"
},
{
"is_primary": false,
"skill_name": "text-embedding-3"
},
{
"is_primary": false,
"skill_name": "BGE"
},
{
"is_primary": false,
"skill_name": "vLLM"
},
{
"is_primary": false,
"skill_name": "TGI"
},
{
"is_primary": false,
"skill_name": "Triton"
},
{
"is_primary": false,
"skill_name": "Kubeflow"
},
{
"is_primary": false,
"skill_name": "Weights \u0026 Biases"
},
{
"is_primary": false,
"skill_name": "Airflow"
},
{
"is_primary": false,
"skill_name": "Prefect"
},
{
"is_primary": false,
"skill_name": "Kubernetes"
},
{
"is_primary": false,
"skill_name": "GPU scheduling"
},
{
"is_primary": false,
"skill_name": "BM25"
},
{
"is_primary": false,
"skill_name": "INT8"
},
{
"is_primary": false,
"skill_name": "FP16"
},
{
"is_primary": false,
"skill_name": "CI/CD"
}
],
"jd_role": {
"display_name": "Senior Machine Learning Engineer",
"rationale": null,
"role_aliases": [
"ML Engineer",
"Machine Learning Engineer",
"Senior ML Engineer"
],
"role_archetype": "Data",
"slug": ""
},
"nano_parsed": {
"JD_type": "pass",
"about_company": null,
"certifications": [],
"company_name": null,
"ctc": {
"currency": null,
"max": null,
"min": null,
"period": null,
"raw": "competitive"
},
"domain": {
"primary": {
"aliases": [],
"domain": "Software \u0026 SaaS Products"
},
"secondary": null
},
"education": [],
"experience": {
"max": null,
"min": 5,
"raw": "5+ years of hands-on ML engineering experience, with at least 2 years productionising deep-learning models."
},
"job_locations": [
{
"aliases": [
"Bangalore"
],
"city": "Bengaluru",
"country": "India",
"state": null,
"work_mode": "hybrid"
}
],
"role": "Senior Machine Learning Engineer",
"role_aliases": [
"ML Engineer",
"Machine Learning Engineer",
"Senior ML Engineer"
],
"role_archetype": "Data",
"roles_and_responsibilities": [
{
"bullet_count": 0,
"heading": "About the role",
"heading_was_present": true,
"source_marker": {
"first_5_words": "We are looking for a",
"last_5_words": "infrastructure end-to-end."
},
"text": "We are looking for a Senior ML Engineer to own the lifecycle of production machine-learning systems at our platform. You will work with applied scientists to take research models from notebook to production, build retrieval-augmented generation (RAG) pipelines on top of our internal document corpus, and own MLOps infrastructure end-to-end.",
"word_count": 49
},
{
"bullet_count": 5,
"heading": "Responsibilities",
"heading_was_present": true,
"source_marker": {
"first_5_words": "Design and deploy production-grade",
"last_5_words": "versioning (MLflow), and CI/CD for ML."
},
"text": "- Design and deploy production-grade ML services serving 50M+ predictions/day.\n- Build feature stores, training pipelines, and online inference infrastructure with PyTorch and Hugging Face Transformers.\n- Lead our LLM efforts: fine-tune open-source models (Llama 3, Mistral) and integrate with our RAG retrieval layer (FAISS, pgvector embeddings).\n- Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.\n- Mentor junior ML engineers on best practices for reproducible experiments, model versioning (MLflow), and CI/CD for ML.",
"word_count": 102
},
{
"bullet_count": 6,
"heading": "Requirements",
"heading_was_present": true,
"source_marker": {
"first_5_words": "5+ years of hands-on ML",
"last_5_words": "experience with GPU scheduling preferred."
},
"text": "- 5+ years of hands-on ML engineering experience, with at least 2 years productionising deep-learning models.\n- Strong Python; production code with PyTorch, TensorFlow, or JAX.\n- Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding models (text-embedding-3, BGE, etc.).\n- LLM serving experience: vLLM, TGI, or Triton.\n- Familiar with MLOps tooling (MLflow, Kubeflow, Weights \u0026 Biases) and orchestration (Airflow, Prefect).\n- Comfort with Kubernetes for ML workloads; experience with GPU scheduling preferred.",
"word_count": 91
},
{
"bullet_count": 3,
"heading": "Nice to have",
"heading_was_present": true,
"source_marker": {
"first_5_words": "Open-source contributions to ML",
"last_5_words": "quantization (INT8/FP16), and inference optimisation."
},
"text": "- Open-source contributions to ML frameworks or popular LLM tooling.\n- Experience with retrieval-augmented generation, sparse retrieval (BM25), and hybrid search.\n- Familiarity with vector index tuning, quantization (INT8/FP16), and inference optimisation.",
"word_count": 42
}
],
"urls": []
},
"rejected": false,
"rejection_reason": null,
"run_id": "48b52573-a42b-4e34-9481-72d73c5404f8",
"stage3_signals": {
"alias_found": true,
"alias_match_roles": [
{
"display_name": "ML Engineer",
"kra_matches": null,
"matched_count": null,
"matched_skills": null,
"role_id": 3,
"score": 1.0,
"slug": "ml-engineer",
"total_count": null
}
],
"kra_match_roles": [
{
"display_name": "ML Ops Engineer",
"kra_matches": [
{
"kra_text": "Maintains model versioning, experiment lineage, and artifact tracking using MLflow, DVC, or Weights \u0026 Biases for reproducibility and auditability.",
"sentence": "Mentor junior ML engineers on best practices for reproducible experiments, model versioning (MLflow), and CI/CD for ML.",
"similarity": 0.6578
},
{
"kra_text": "Sets up model monitoring dashboards, data drift detection, prediction performance tracking, and alert routing for production ML systems.",
"sentence": "Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.",
"similarity": 0.6211
},
{
"kra_text": "Sets up model monitoring dashboards, data drift detection, prediction performance tracking, and alert routing for production ML systems.",
"sentence": "Design and deploy production-grade ML services serving 50M+ predictions/day.",
"similarity": 0.6049
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 16,
"score": 0.6279,
"slug": "ml-ops-engineer",
"total_count": null
},
{
"display_name": "ML Engineer",
"kra_matches": [
{
"kra_text": "Builds model serving infrastructure to deploy trained models as real-time prediction APIs or batch inference jobs using TorchServe, TensorFlow Serving, or SageMaker.",
"sentence": "Design and deploy production-grade ML services serving 50M+ predictions/day.",
"similarity": 0.6091
},
{
"kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
"sentence": "Build feature stores, training pipelines, and online inference infrastructure with PyTorch and Hugging Face Transformers.",
"similarity": 0.607
},
{
"kra_text": "Monitors production model behavior for data drift, concept drift, and prediction performance degradation using monitoring dashboards and alerting.",
"sentence": "Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.",
"similarity": 0.5963
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 3,
"score": 0.6041,
"slug": "ml-engineer",
"total_count": null
},
{
"display_name": "AI Engineer",
"kra_matches": [
{
"kra_text": "Defines evaluation frameworks, automated test suites, and human feedback loops to measure AI feature quality, accuracy, and consistency.",
"sentence": "Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.",
"similarity": 0.5699
},
{
"kra_text": "Designs and implements prompt engineering workflows, few-shot examples, chain-of-thought patterns, and structured output parsing for AI feature pipelines.",
"sentence": "Build feature stores, training pipelines, and online inference infrastructure with PyTorch and Hugging Face Transformers.",
"similarity": 0.5055
},
{
"kra_text": "Designs and implements prompt engineering workflows, few-shot examples, chain-of-thought patterns, and structured output parsing for AI feature pipelines.",
"sentence": "You will work with applied scientists to take research models from notebook to production, build retrieval-augmented generation (RAG) pipelines on top of our internal document corpus, and own MLOps infrastructure end-to-end.",
"similarity": 0.4857
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 13,
"score": 0.5204,
"slug": "ai-engineer",
"total_count": null
},
{
"display_name": "DevOps Engineer",
"kra_matches": [
{
"kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
"sentence": "Mentor junior ML engineers on best practices for reproducible experiments, model versioning (MLflow), and CI/CD for ML.",
"similarity": 0.4746
},
{
"kra_text": "Manages container orchestration with Kubernetes and Docker, deploying applications as pods, managing namespaces, and configuring auto-scaling across cloud environments.",
"sentence": "Comfort with Kubernetes for ML workloads; experience with GPU scheduling preferred.",
"similarity": 0.459
},
{
"kra_text": "Monitors CI/CD pipeline reliability, identifies bottlenecks in delivery workflows, and improves deployment frequency, lead time, and failure recovery rate.",
"sentence": "Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.",
"similarity": 0.4483
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 10,
"score": 0.4607,
"slug": "devops-engineer",
"total_count": null
},
{
"display_name": "AI Compliance Officer",
"kra_matches": [
{
"kra_text": "Monitors deployed AI systems for compliance policy drift, regulatory changes, and emerging requirements affecting existing AI deployments.",
"sentence": "Set up evaluation harnesses, A/B testing frameworks, and model monitoring/drift detection.",
"similarity": 0.5023
},
{
"kra_text": "Reviews AI use cases and model deployments against applicable regulations, internal ethics policies, and governance guidelines prior to production approval.",
"sentence": "Design and deploy production-grade ML services serving 50M+ predictions/day.",
"similarity": 0.4208
},
{
"kra_text": "Monitors deployed AI systems for compliance policy drift, regulatory changes, and emerging requirements affecting existing AI deployments.",
"sentence": "We are looking for a Senior ML Engineer to own the lifecycle of production machine-learning systems at our platform.",
"similarity": 0.4038
}
],
"matched_count": null,
"matched_skills": null,
"role_id": 12,
"score": 0.4423,
"slug": "ai-compliance-officer",
"total_count": null
}
],
"skill_match_roles": [
{
"display_name": "ML Engineer",
"kra_matches": null,
"matched_count": 4,
"matched_skills": [
"FAISS",
"MLflow",
"PyTorch",
"Python"
],
"role_id": 3,
"score": 0.4444,
"slug": "ml-engineer",
"total_count": 9
},
{
"display_name": "ML Ops Engineer",
"kra_matches": null,
"matched_count": 4,
"matched_skills": [
"FAISS",
"MLflow",
"PyTorch",
"Python"
],
"role_id": 16,
"score": 0.4444,
"slug": "ml-ops-engineer",
"total_count": 9
},
{
"display_name": "AI Engineer",
"kra_matches": null,
"matched_count": 2,
"matched_skills": [
"PyTorch",
"pgvector"
],
"role_id": 13,
"score": 0.2222,
"slug": "ai-engineer",
"total_count": 9
},
{
"display_name": "Fullstack Developer",
"kra_matches": null,
"matched_count": 1,
"matched_skills": [
"Python"
],
"role_id": 15,
"score": 0.1111,
"slug": "full-stack-engineer",
"total_count": 9
},
{
"display_name": "AR/VR Engineer",
"kra_matches": null,
"matched_count": 1,
"matched_skills": [
"Python"
],
"role_id": 8,
"score": 0.1111,
"slug": "ar-vr-engineer",
"total_count": 9
}
]
},
"stage4_decision": {
"alias_collision_detected": false,
"case": "A",
"chosen_role": {
"display_name": "ML Engineer",
"kra_matches": null,
"matched_count": null,
"matched_skills": null,
"role_id": 3,
"score": 1.0,
"slug": "ml-engineer",
"total_count": null
},
"confidence": 1.0,
"is_new_role": false,
"llm2_fired": false,
"llm2_reasoning": null,
"matched_dimensions": [],
"matched_kras": [],
"matched_skills": [],
"new_role_display_name": null,
"new_role_slug": null,
"queued": false,
"reasoning": "Exact alias hit on ml-engineer (1.0) \u2014 no other alias at this confidence; skill_top ml-engineer 0.44 does not contradict",
"sub_role": null
},
"stage5_updates": {
"centroid_n_after": 16,
"centroid_updated": true,
"collision_log_id": null,
"new_kra_attached": null,
"new_skills_attached": [
{
"is_primary": true,
"queue_id": 3243,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "Hugging Face Transformers",
"status": "pending"
},
{
"is_primary": true,
"queue_id": 3244,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "Llama 3",
"status": "pending"
},
{
"is_primary": true,
"queue_id": 3245,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "Mistral",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 3246,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "text-embedding-3",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 3247,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "TGI",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 3248,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "GPU scheduling",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 3249,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "BM25",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 3250,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "INT8",
"status": "pending"
},
{
"is_primary": false,
"queue_id": 3251,
"role_display_name": "ML Engineer",
"role_slug": "ml-engineer",
"skill_name": "FP16",
"status": "pending"
}
],
"queue_entry_id": null,
"v3_pipeline_triggered": false,
"v3_role_slug": null,
"v3_run_id": null
}
}
API 2 — extract-details
{
"alias_matches": [
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 67,
"existing_alias_text": "Python",
"input_term": "Python",
"matched_canonical": {
"category_id": 6,
"display_name": "Python",
"id": 5,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LANGUAGE",
"slug": "python",
"sub_category_id": 96,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 441,
"existing_alias_text": "PyTorch",
"input_term": "PyTorch",
"matched_canonical": {
"category_id": 7,
"display_name": "PyTorch",
"id": 195,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "pytorch",
"sub_category_id": 156,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 1820,
"existing_alias_text": "FAISS",
"input_term": "FAISS",
"matched_canonical": {
"category_id": 7,
"display_name": "FAISS",
"id": 1184,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "faiss",
"sub_category_id": 894,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 1882,
"existing_alias_text": "pgvector",
"input_term": "pgvector",
"matched_canonical": {
"category_id": 7,
"display_name": "pgvector",
"id": 1246,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "pgvector",
"sub_category_id": 972,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 2565,
"existing_alias_text": "A/B Testing",
"input_term": "A/B Testing",
"matched_canonical": {
"category_id": 8,
"display_name": "A/B Testing",
"id": 1613,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "METHODOLOGY",
"slug": "a-b-testing",
"sub_category_id": 1214,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 470,
"existing_alias_text": "MLflow",
"input_term": "MLflow",
"matched_canonical": {
"category_id": 13,
"display_name": "MLflow",
"id": 214,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "mlflow",
"sub_category_id": 1168,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 442,
"existing_alias_text": "TensorFlow",
"input_term": "TensorFlow",
"matched_canonical": {
"category_id": 7,
"display_name": "TensorFlow",
"id": 196,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "tensorflow",
"sub_category_id": 156,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 451,
"existing_alias_text": "JAX",
"input_term": "JAX",
"matched_canonical": {
"category_id": 7,
"display_name": "JAX",
"id": 199,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "jax",
"sub_category_id": 156,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 503,
"existing_alias_text": "Pinecone",
"input_term": "Pinecone",
"matched_canonical": {
"category_id": 9,
"display_name": "Pinecone",
"id": 242,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "pinecone",
"sub_category_id": 177,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 1878,
"existing_alias_text": "Weaviate",
"input_term": "Weaviate",
"matched_canonical": {
"category_id": 9,
"display_name": "Weaviate",
"id": 1242,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "weaviate",
"sub_category_id": 177,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "TODO: REMOVE AFTER TESTING \u2014 alias DB write disabled",
"alias_persisted": false,
"existing_alias_id": 1888,
"existing_alias_text": "text-embedding-3-large",
"input_term": "text-embedding-3",
"matched_canonical": {
"category_id": 11,
"display_name": "text-embedding-3-large",
"id": 1252,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "CLOUD_SERVICE",
"slug": "text-embedding-3-large",
"sub_category_id": 1005,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "embedding_alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 1885,
"existing_alias_text": "BGE",
"input_term": "BGE",
"matched_canonical": {
"category_id": 7,
"display_name": "BGE",
"id": 1249,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "bge",
"sub_category_id": 973,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 502,
"existing_alias_text": "vLLM",
"input_term": "vLLM",
"matched_canonical": {
"category_id": 13,
"display_name": "vLLM",
"id": 241,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "vllm",
"sub_category_id": 1167,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 2026,
"existing_alias_text": "Triton",
"input_term": "Triton",
"matched_canonical": {
"category_id": 6,
"display_name": "Triton",
"id": 1367,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LANGUAGE",
"slug": "triton",
"sub_category_id": 1033,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 465,
"existing_alias_text": "Kubeflow",
"input_term": "Kubeflow",
"matched_canonical": {
"category_id": 5,
"display_name": "Kubeflow",
"id": 213,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "FRAMEWORK",
"slug": "kubeflow",
"sub_category_id": 1127,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 471,
"existing_alias_text": "Weights \u0026 Biases",
"input_term": "Weights \u0026 Biases",
"matched_canonical": {
"category_id": 9,
"display_name": "Weights \u0026 Biases",
"id": 215,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "weights-biases",
"sub_category_id": 171,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 526,
"existing_alias_text": "Airflow",
"input_term": "Airflow",
"matched_canonical": {
"category_id": 13,
"display_name": "Airflow",
"id": 265,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "airflow",
"sub_category_id": 130,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 306,
"existing_alias_text": "Prefect",
"input_term": "Prefect",
"matched_canonical": {
"category_id": 13,
"display_name": "Prefect",
"id": 112,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "prefect",
"sub_category_id": 130,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 1267,
"existing_alias_text": "Kubernetes",
"input_term": "Kubernetes",
"matched_canonical": {
"category_id": 9,
"display_name": "Kubernetes",
"id": 726,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "kubernetes",
"sub_category_id": 557,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
},
{
"alias_persist_skipped_reason": "alias_text already exists for this canonical skill",
"alias_persisted": false,
"existing_alias_id": 1826,
"existing_alias_text": "CI/CD",
"input_term": "CI/CD",
"matched_canonical": {
"category_id": 8,
"display_name": "CI/CD",
"id": 1190,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "METHODOLOGY",
"slug": "ci-cd",
"sub_category_id": 900,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"matched_via": "alias"
}
],
"candidate_roles": [
{
"display_name": "Cloud Security Engineer",
"id": 23,
"rationale": null,
"role_archetype": null,
"slug": "cloud-security-engineer",
"source": "db"
},
{
"display_name": "Backend Developer",
"id": 1,
"rationale": null,
"role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
"slug": "backend-engineer",
"source": "db"
},
{
"display_name": "Fullstack Developer",
"id": 15,
"rationale": null,
"role_archetype": null,
"slug": "full-stack-engineer",
"source": "db"
},
{
"display_name": "Cybersecurity Engineer",
"id": 5,
"rationale": null,
"role_archetype": null,
"slug": "cybersecurity-engineer",
"source": "db"
},
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
},
{
"display_name": "AR/VR Engineer",
"id": 8,
"rationale": null,
"role_archetype": null,
"slug": "ar-vr-engineer",
"source": "db"
},
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "Cloud Architect",
"id": 9,
"rationale": null,
"role_archetype": null,
"slug": "cloud-architect",
"source": "db"
},
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
],
"chosen_role": {
"display_name": "Machine Learning Engineer",
"id": null,
"rationale": "The role aligns with primary skills focused on Python and various ML frameworks.",
"role_archetype": "A specialist in developing and deploying machine learning models.",
"slug": "machine-learning-engineer",
"source": "llm"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Cloud Security Scripting \u0026 DSL Languages",
"id": 248,
"rationale": "Proficiency in programming and domain-specific languages used to automate and script cloud security controls.",
"slug": "cloud-security-scripting-dsl-languages",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Cloud Security Engineer",
"id": 23,
"rationale": null,
"role_archetype": null,
"slug": "cloud-security-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages",
"id": 1,
"rationale": "Primary implementation languages used to build client and server feature code. Full stack engineers need enough fluency to move across layers and implement product behavior end to end.",
"slug": "programming-languages",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Backend Developer",
"id": 1,
"rationale": null,
"role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
"slug": "backend-engineer",
"source": "db"
},
{
"display_name": "Fullstack Developer",
"id": 15,
"rationale": null,
"role_archetype": null,
"slug": "full-stack-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages and Scripting",
"id": 59,
"rationale": "Languages used to write security automation, analysis scripts, detection logic, and remediation helpers. This is the primary implementation surface for a cybersecurity engineer across tooling and response workflows.",
"slug": "programming-languages-and-scripting",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Cybersecurity Engineer",
"id": 5,
"rationale": null,
"role_archetype": null,
"slug": "cybersecurity-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for Data Work",
"id": 21,
"rationale": "Languages used to implement data pipelines, transformations, and operational glue. This is the primary coding surface for building ingestion, enrichment, and automation logic in data engineering.",
"slug": "programming-languages-for-data-work",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for ML Systems",
"id": 39,
"rationale": "Languages used to build training code, inference services, evaluation jobs, and ML glue code. This is the primary implementation surface for ML engineers across experimentation and productionization.",
"slug": "programming-languages-for-ml-systems",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for XR",
"id": 97,
"rationale": "Primary implementation languages used to build immersive client features, interaction logic, and device-specific runtime behavior. This is the core coding surface for AR/VR experiences.",
"slug": "programming-languages-for-xr",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AR/VR Engineer",
"id": 8,
"rationale": null,
"role_archetype": null,
"slug": "ar-vr-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "PyTorch",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Model Fine-Tuning \u0026 Adaptation",
"id": 212,
"rationale": "Techniques and libraries for adapting pre-trained language models to specific tasks or domains.",
"slug": "model-fine-tuning-adaptation",
"source": "db"
},
"input_skill": "PyTorch",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "FAISS",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "React Frontend Development",
"id": 96,
"rationale": "Building interactive web user interfaces with React.js, including component composition, state management, hooks, and rendering patterns. React.js belongs here because it is a core library for client-side UI development in modern web applications.",
"slug": "d_init_01",
"source": "db"
},
"input_skill": "FAISS",
"llm_role": null,
"roles_from_db": []
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"input_skill": "pgvector",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "React Frontend Development",
"id": 96,
"rationale": "Building interactive web user interfaces with React.js, including component composition, state management, hooks, and rendering patterns. React.js belongs here because it is a core library for client-side UI development in modern web applications.",
"slug": "d_init_01",
"source": "db"
},
"input_skill": "A/B Testing",
"llm_role": null,
"roles_from_db": []
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Systems Programming",
"id": 166,
"rationale": "Systems programming covers low-level software development where performance, memory safety, and direct control over resources matter. Rust fits here because it is commonly used for OS-adjacent services, infrastructure components, and other performance-sensitive systems code.",
"slug": "d_init_02",
"source": "db"
},
"input_skill": "A/B Testing",
"llm_role": null,
"roles_from_db": []
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "MLOps Platforms and Lifecycle",
"id": 43,
"rationale": "End-to-end managed platforms used to train, deploy, register, and govern models across their lifecycle. This is the operational control plane for production ML workflows.",
"slug": "mlops-platforms-and-lifecycle",
"source": "db"
},
"input_skill": "MLflow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "TensorFlow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "JAX",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Operations and Orchestration",
"id": 49,
"rationale": "Operational stack for building, serving, evaluating, and orchestrating LLM-based systems. This includes vector retrieval, prompt workflows, LLM serving, and observability for generative applications.",
"slug": "llm-operations-and-orchestration",
"source": "db"
},
"input_skill": "Pinecone",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"input_skill": "Pinecone",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"input_skill": "Weaviate",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Embedding Models",
"id": 199,
"rationale": "Models and APIs used to convert text or other content into vector representations for retrieval, clustering, and semantic matching. This cluster is coherent because embedding choice affects retrieval quality, cost, and latency in AI features.",
"slug": "embedding-models",
"source": "db"
},
"input_skill": "text-embedding-3",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Embedding Models",
"id": 199,
"rationale": "Models and APIs used to convert text or other content into vector representations for retrieval, clustering, and semantic matching. This cluster is coherent because embedding choice affects retrieval quality, cost, and latency in AI features.",
"slug": "embedding-models",
"source": "db"
},
"input_skill": "BGE",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Operations and Orchestration",
"id": 49,
"rationale": "Operational stack for building, serving, evaluating, and orchestrating LLM-based systems. This includes vector retrieval, prompt workflows, LLM serving, and observability for generative applications.",
"slug": "llm-operations-and-orchestration",
"source": "db"
},
"input_skill": "vLLM",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Serving \u0026 Deployment",
"id": 209,
"rationale": "Tools and frameworks for hosting and serving LLM models in production environments.",
"slug": "llm-serving-deployment",
"source": "db"
},
"input_skill": "vLLM",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Model Optimization and Acceleration",
"id": 53,
"rationale": "Techniques for improving inference latency, throughput, memory use, and training efficiency. ML engineers use these methods to meet production constraints without sacrificing too much quality.",
"slug": "model-optimization-and-acceleration",
"source": "db"
},
"input_skill": "Triton",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "MLOps Platforms and Lifecycle",
"id": 43,
"rationale": "End-to-end managed platforms used to train, deploy, register, and govern models across their lifecycle. This is the operational control plane for production ML workflows.",
"slug": "mlops-platforms-and-lifecycle",
"source": "db"
},
"input_skill": "Kubeflow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Experiment Tracking and Evaluation",
"id": 44,
"rationale": "Tools and practices for recording experiments, comparing runs, and assessing model quality before release. This dimension focuses on reproducibility, metrics, artifacts, and offline evaluation workflows.",
"slug": "experiment-tracking-and-evaluation",
"source": "db"
},
"input_skill": "Weights \u0026 Biases",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Workflow Orchestration for ML Pipelines",
"id": 54,
"rationale": "Workflow engines used to coordinate training, evaluation, deployment, and retraining jobs. This cluster covers dependencies, retries, scheduling, and pipeline composition for ML lifecycle automation.",
"slug": "workflow-orchestration-for-ml-pipelines",
"source": "db"
},
"input_skill": "Airflow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Data Pipeline Orchestration",
"id": 23,
"rationale": "Workflow engines that schedule, coordinate, and recover batch data jobs. This cluster covers dependency management, retries, backfills, sensors, and operational control of pipeline DAGs.",
"slug": "data-pipeline-orchestration",
"source": "db"
},
"input_skill": "Prefect",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Workflow Orchestration for ML Pipelines",
"id": 54,
"rationale": "Workflow engines used to coordinate training, evaluation, deployment, and retraining jobs. This cluster covers dependencies, retries, scheduling, and pipeline composition for ML lifecycle automation.",
"slug": "workflow-orchestration-for-ml-pipelines",
"source": "db"
},
"input_skill": "Prefect",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Container Orchestration Platforms",
"id": 134,
"rationale": "Platforms that schedule and manage containerized workloads across clusters and environments. Cloud Architects need these to define workload placement standards, cluster boundaries, and platform capabilities.",
"slug": "container-orchestration-platforms",
"source": "db"
},
"input_skill": "Kubernetes",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Cloud Architect",
"id": 9,
"rationale": null,
"role_archetype": null,
"slug": "cloud-architect",
"source": "db"
},
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Kubernetes for ML Workloads",
"id": 47,
"rationale": "Kubernetes-native components used to schedule, accelerate, and isolate ML training and serving workloads. This includes GPU enablement and ML-specific controllers rather than generic cluster administration.",
"slug": "kubernetes-for-ml-workloads",
"source": "db"
},
"input_skill": "Kubernetes",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "CI/CD Pipeline Platforms",
"id": 150,
"rationale": "Systems used to define, run, and maintain automated build and deployment workflows. This cluster is coherent because the role owns delivery automation end to end, including pipeline reliability and promotion logic.",
"slug": "ci-cd-pipeline-platforms",
"source": "db"
},
"input_skill": "CI/CD",
"llm_role": null,
"roles_from_db": [
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "CI/CD for Machine Learning",
"id": 56,
"rationale": "Tools and platforms for automating ML model integration, testing, and deployment pipelines.",
"slug": "ci-cd-for-machine-learning",
"source": "db"
},
"input_skill": "CI/CD",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
}
]
}
],
"input_final_skills": [
"Python",
"PyTorch",
"Hugging Face Transformers",
"Llama 3",
"Mistral",
"FAISS",
"pgvector",
"A/B Testing",
"MLflow",
"TensorFlow",
"JAX",
"Pinecone",
"Weaviate",
"text-embedding-3",
"BGE",
"vLLM",
"TGI",
"Triton",
"Kubeflow",
"Weights \u0026 Biases",
"Airflow",
"Prefect",
"Kubernetes",
"GPU scheduling",
"BM25",
"INT8",
"FP16",
"CI/CD"
],
"input_llm_skills": [
"Python",
"PyTorch",
"Hugging Face Transformers",
"Llama 3",
"Mistral",
"FAISS",
"pgvector",
"A/B Testing",
"MLflow",
"TensorFlow",
"JAX",
"Pinecone",
"Weaviate",
"text-embedding-3",
"BGE",
"vLLM",
"TGI",
"Triton",
"Kubeflow",
"Weights \u0026 Biases",
"Airflow",
"Prefect",
"Kubernetes",
"GPU scheduling",
"BM25",
"INT8",
"FP16",
"CI/CD"
],
"new_aliases_persisted": 0,
"run_id": "48b52573-a42b-4e34-9481-72d73c5404f8",
"skills_detail": [
{
"aliases_in_db": [
{
"alias_text": "Python",
"alias_type": "CANONICAL",
"id": 67,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 2",
"alias_type": "VERSION",
"id": 72,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 2.x",
"alias_type": "VERSION",
"id": 74,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 3",
"alias_type": "VERSION",
"id": 73,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 3.10",
"alias_type": "VERSION",
"id": 76,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 3.11",
"alias_type": "VERSION",
"id": 77,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 3.12",
"alias_type": "VERSION",
"id": 78,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Python 3.x",
"alias_type": "VERSION",
"id": 75,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "py",
"alias_type": "VERSION",
"id": 2183,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "py2",
"alias_type": "VERSION",
"id": 68,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "py3",
"alias_type": "VERSION",
"id": 69,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "python 3",
"alias_type": "VERSION",
"id": 2186,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "python 3.x",
"alias_type": "VERSION",
"id": 2849,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "python2",
"alias_type": "VERSION",
"id": 70,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "python3",
"alias_type": "VERSION",
"id": 71,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "python3.x",
"alias_type": "VERSION",
"id": 2848,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 6,
"display_name": "Python",
"id": 5,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LANGUAGE",
"slug": "python",
"sub_category_id": 96,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Cloud Security Scripting \u0026 DSL Languages",
"id": 248,
"rationale": "Proficiency in programming and domain-specific languages used to automate and script cloud security controls.",
"slug": "cloud-security-scripting-dsl-languages",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Cloud Security Engineer",
"id": 23,
"rationale": null,
"role_archetype": null,
"slug": "cloud-security-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages",
"id": 1,
"rationale": "Primary implementation languages used to build client and server feature code. Full stack engineers need enough fluency to move across layers and implement product behavior end to end.",
"slug": "programming-languages",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Backend Developer",
"id": 1,
"rationale": null,
"role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
"slug": "backend-engineer",
"source": "db"
},
{
"display_name": "Fullstack Developer",
"id": 15,
"rationale": null,
"role_archetype": null,
"slug": "full-stack-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages and Scripting",
"id": 59,
"rationale": "Languages used to write security automation, analysis scripts, detection logic, and remediation helpers. This is the primary implementation surface for a cybersecurity engineer across tooling and response workflows.",
"slug": "programming-languages-and-scripting",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Cybersecurity Engineer",
"id": 5,
"rationale": null,
"role_archetype": null,
"slug": "cybersecurity-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for Data Work",
"id": 21,
"rationale": "Languages used to implement data pipelines, transformations, and operational glue. This is the primary coding surface for building ingestion, enrichment, and automation logic in data engineering.",
"slug": "programming-languages-for-data-work",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for ML Systems",
"id": 39,
"rationale": "Languages used to build training code, inference services, evaluation jobs, and ML glue code. This is the primary implementation surface for ML engineers across experimentation and productionization.",
"slug": "programming-languages-for-ml-systems",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for XR",
"id": 97,
"rationale": "Primary implementation languages used to build immersive client features, interaction logic, and device-specific runtime behavior. This is the core coding surface for AR/VR experiences.",
"slug": "programming-languages-for-xr",
"source": "db"
},
"input_skill": "Python",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AR/VR Engineer",
"id": 8,
"rationale": null,
"role_archetype": null,
"slug": "ar-vr-engineer",
"source": "db"
}
]
}
],
"input_skill": "Python",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "PyTorch",
"alias_type": "CANONICAL",
"id": 441,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 7,
"display_name": "PyTorch",
"id": 195,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "pytorch",
"sub_category_id": 156,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "PyTorch",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Model Fine-Tuning \u0026 Adaptation",
"id": 212,
"rationale": "Techniques and libraries for adapting pre-trained language models to specific tasks or domains.",
"slug": "model-fine-tuning-adaptation",
"source": "db"
},
"input_skill": "PyTorch",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "PyTorch",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Hugging Face Transformers",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Machine Learning Frameworks",
"skill_nature": "TOOL",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "hugging-face-transformers",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Llama 3",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Machine Learning Frameworks",
"skill_nature": "TOOL",
"sub_category": "general",
"typical_lifespan": "SHORT_LIVED",
"version_strategy": "VERSIONED",
"volatility": "FAST"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "llama-3",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "Mistral",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Machine Learning Frameworks",
"skill_nature": "TOOL",
"sub_category": "general",
"typical_lifespan": "SHORT_LIVED",
"version_strategy": "VERSIONED",
"volatility": "FAST"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "mistral",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "FAISS",
"alias_type": "CANONICAL",
"id": 1820,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 7,
"display_name": "FAISS",
"id": 1184,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "faiss",
"sub_category_id": 894,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "FAISS",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "React Frontend Development",
"id": 96,
"rationale": "Building interactive web user interfaces with React.js, including component composition, state management, hooks, and rendering patterns. React.js belongs here because it is a core library for client-side UI development in modern web applications.",
"slug": "d_init_01",
"source": "db"
},
"input_skill": "FAISS",
"llm_role": null,
"roles_from_db": []
}
],
"input_skill": "FAISS",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "pgvector",
"alias_type": "CANONICAL",
"id": 1882,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 7,
"display_name": "pgvector",
"id": 1246,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "pgvector",
"sub_category_id": 972,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"input_skill": "pgvector",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "pgvector",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "A/B Testing",
"alias_type": "CANONICAL",
"id": 2565,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 8,
"display_name": "A/B Testing",
"id": 1613,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "METHODOLOGY",
"slug": "a-b-testing",
"sub_category_id": 1214,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "React Frontend Development",
"id": 96,
"rationale": "Building interactive web user interfaces with React.js, including component composition, state management, hooks, and rendering patterns. React.js belongs here because it is a core library for client-side UI development in modern web applications.",
"slug": "d_init_01",
"source": "db"
},
"input_skill": "A/B Testing",
"llm_role": null,
"roles_from_db": []
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Systems Programming",
"id": 166,
"rationale": "Systems programming covers low-level software development where performance, memory safety, and direct control over resources matter. Rust fits here because it is commonly used for OS-adjacent services, infrastructure components, and other performance-sensitive systems code.",
"slug": "d_init_02",
"source": "db"
},
"input_skill": "A/B Testing",
"llm_role": null,
"roles_from_db": []
}
],
"input_skill": "A/B Testing",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "MLflow",
"alias_type": "CANONICAL",
"id": 470,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 13,
"display_name": "MLflow",
"id": 214,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "mlflow",
"sub_category_id": 1168,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "MLOps Platforms and Lifecycle",
"id": 43,
"rationale": "End-to-end managed platforms used to train, deploy, register, and govern models across their lifecycle. This is the operational control plane for production ML workflows.",
"slug": "mlops-platforms-and-lifecycle",
"source": "db"
},
"input_skill": "MLflow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "MLflow",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "TensorFlow",
"alias_type": "CANONICAL",
"id": 442,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "TF1",
"alias_type": "VERSION",
"id": 443,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "TF2",
"alias_type": "VERSION",
"id": 444,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "TensorFlow 1",
"alias_type": "VERSION",
"id": 445,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "TensorFlow 1.x",
"alias_type": "VERSION",
"id": 447,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "TensorFlow 2",
"alias_type": "VERSION",
"id": 446,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "TensorFlow 2.x",
"alias_type": "VERSION",
"id": 448,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tensorflow 1",
"alias_type": "VERSION",
"id": 2490,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tensorflow 1.x",
"alias_type": "VERSION",
"id": 2494,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tensorflow 2",
"alias_type": "VERSION",
"id": 2491,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tensorflow 2.x",
"alias_type": "VERSION",
"id": 2495,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tensorflow v1",
"alias_type": "VERSION",
"id": 2492,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tensorflow v2",
"alias_type": "VERSION",
"id": 2493,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tf",
"alias_type": "VERSION",
"id": 2487,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tf1",
"alias_type": "VERSION",
"id": 2488,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "tf2",
"alias_type": "VERSION",
"id": 2489,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 7,
"display_name": "TensorFlow",
"id": 196,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "tensorflow",
"sub_category_id": 156,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "TensorFlow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "TensorFlow",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "JAX",
"alias_type": "CANONICAL",
"id": 451,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 7,
"display_name": "JAX",
"id": 199,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "jax",
"sub_category_id": 156,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"input_skill": "JAX",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "JAX",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Pinecone",
"alias_type": "CANONICAL",
"id": 503,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 9,
"display_name": "Pinecone",
"id": 242,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "pinecone",
"sub_category_id": 177,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Operations and Orchestration",
"id": 49,
"rationale": "Operational stack for building, serving, evaluating, and orchestrating LLM-based systems. This includes vector retrieval, prompt workflows, LLM serving, and observability for generative applications.",
"slug": "llm-operations-and-orchestration",
"source": "db"
},
"input_skill": "Pinecone",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"input_skill": "Pinecone",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "Pinecone",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Weaviate",
"alias_type": "CANONICAL",
"id": 1878,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 9,
"display_name": "Weaviate",
"id": 1242,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "weaviate",
"sub_category_id": 177,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"input_skill": "Weaviate",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "Weaviate",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "text-embedding-3-large",
"alias_type": "CANONICAL",
"id": 1888,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 11,
"display_name": "text-embedding-3-large",
"id": 1252,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "CLOUD_SERVICE",
"slug": "text-embedding-3-large",
"sub_category_id": 1005,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Embedding Models",
"id": 199,
"rationale": "Models and APIs used to convert text or other content into vector representations for retrieval, clustering, and semantic matching. This cluster is coherent because embedding choice affects retrieval quality, cost, and latency in AI features.",
"slug": "embedding-models",
"source": "db"
},
"input_skill": "text-embedding-3",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "text-embedding-3",
"matched_via": "embedding_alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "BGE",
"alias_type": "CANONICAL",
"id": 1885,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 7,
"display_name": "BGE",
"id": 1249,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LIBRARY",
"slug": "bge",
"sub_category_id": 973,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Embedding Models",
"id": 199,
"rationale": "Models and APIs used to convert text or other content into vector representations for retrieval, clustering, and semantic matching. This cluster is coherent because embedding choice affects retrieval quality, cost, and latency in AI features.",
"slug": "embedding-models",
"source": "db"
},
"input_skill": "BGE",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "BGE",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "vLLM",
"alias_type": "CANONICAL",
"id": 502,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 13,
"display_name": "vLLM",
"id": 241,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "vllm",
"sub_category_id": 1167,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Operations and Orchestration",
"id": 49,
"rationale": "Operational stack for building, serving, evaluating, and orchestrating LLM-based systems. This includes vector retrieval, prompt workflows, LLM serving, and observability for generative applications.",
"slug": "llm-operations-and-orchestration",
"source": "db"
},
"input_skill": "vLLM",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Serving \u0026 Deployment",
"id": 209,
"rationale": "Tools and frameworks for hosting and serving LLM models in production environments.",
"slug": "llm-serving-deployment",
"source": "db"
},
"input_skill": "vLLM",
"llm_role": null,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
]
}
],
"input_skill": "vLLM",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "TGI",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Machine Learning Frameworks",
"skill_nature": "TOOL",
"sub_category": "general",
"typical_lifespan": "SHORT_LIVED",
"version_strategy": "VERSIONED",
"volatility": "FAST"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "tgi",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Triton",
"alias_type": "CANONICAL",
"id": 2026,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 6,
"display_name": "Triton",
"id": 1367,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "LANGUAGE",
"slug": "triton",
"sub_category_id": 1033,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Model Optimization and Acceleration",
"id": 53,
"rationale": "Techniques for improving inference latency, throughput, memory use, and training efficiency. ML engineers use these methods to meet production constraints without sacrificing too much quality.",
"slug": "model-optimization-and-acceleration",
"source": "db"
},
"input_skill": "Triton",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
}
]
}
],
"input_skill": "Triton",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Kubeflow",
"alias_type": "CANONICAL",
"id": 465,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubeflow 1.x",
"alias_type": "VERSION",
"id": 468,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubeflow 2.x",
"alias_type": "VERSION",
"id": 469,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubeflow v1",
"alias_type": "VERSION",
"id": 466,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubeflow v2",
"alias_type": "VERSION",
"id": 467,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 5,
"display_name": "Kubeflow",
"id": 213,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "FRAMEWORK",
"slug": "kubeflow",
"sub_category_id": 1127,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "MLOps Platforms and Lifecycle",
"id": 43,
"rationale": "End-to-end managed platforms used to train, deploy, register, and govern models across their lifecycle. This is the operational control plane for production ML workflows.",
"slug": "mlops-platforms-and-lifecycle",
"source": "db"
},
"input_skill": "Kubeflow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "Kubeflow",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Weights \u0026 Biases",
"alias_type": "CANONICAL",
"id": 471,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 9,
"display_name": "Weights \u0026 Biases",
"id": 215,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "weights-biases",
"sub_category_id": 171,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Experiment Tracking and Evaluation",
"id": 44,
"rationale": "Tools and practices for recording experiments, comparing runs, and assessing model quality before release. This dimension focuses on reproducibility, metrics, artifacts, and offline evaluation workflows.",
"slug": "experiment-tracking-and-evaluation",
"source": "db"
},
"input_skill": "Weights \u0026 Biases",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "Weights \u0026 Biases",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Airflow",
"alias_type": "CANONICAL",
"id": 526,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "airflow 2",
"alias_type": "VERSION",
"id": 2477,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "airflow-2",
"alias_type": "VERSION",
"id": 2478,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "airflow2",
"alias_type": "VERSION",
"id": 2476,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "airflow2.x",
"alias_type": "VERSION",
"id": 2479,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "apache airflow 2",
"alias_type": "VERSION",
"id": 2480,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 13,
"display_name": "Airflow",
"id": 265,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "airflow",
"sub_category_id": 130,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Workflow Orchestration for ML Pipelines",
"id": 54,
"rationale": "Workflow engines used to coordinate training, evaluation, deployment, and retraining jobs. This cluster covers dependencies, retries, scheduling, and pipeline composition for ML lifecycle automation.",
"slug": "workflow-orchestration-for-ml-pipelines",
"source": "db"
},
"input_skill": "Airflow",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "Airflow",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Prefect",
"alias_type": "CANONICAL",
"id": 306,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 13,
"display_name": "Prefect",
"id": 112,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "TOOL",
"slug": "prefect",
"sub_category_id": 130,
"typical_lifespan": "EVERGREEN",
"volatility": "EMERGING"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Data Pipeline Orchestration",
"id": 23,
"rationale": "Workflow engines that schedule, coordinate, and recover batch data jobs. This cluster covers dependency management, retries, backfills, sensors, and operational control of pipeline DAGs.",
"slug": "data-pipeline-orchestration",
"source": "db"
},
"input_skill": "Prefect",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Workflow Orchestration for ML Pipelines",
"id": 54,
"rationale": "Workflow engines used to coordinate training, evaluation, deployment, and retraining jobs. This cluster covers dependencies, retries, scheduling, and pipeline composition for ML lifecycle automation.",
"slug": "workflow-orchestration-for-ml-pipelines",
"source": "db"
},
"input_skill": "Prefect",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "Prefect",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "Kubernetes",
"alias_type": "CANONICAL",
"id": 1267,
"is_primary": true,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubernetes 1.0+",
"alias_type": "VERSION",
"id": 1271,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubernetes 1.x",
"alias_type": "VERSION",
"id": 1270,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "Kubernetes v1",
"alias_type": "VERSION",
"id": 1269,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "k8s",
"alias_type": "VERSION",
"id": 1268,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "kubernetes 1.x",
"alias_type": "VERSION",
"id": 1400,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
},
{
"alias_text": "kubernetes latest",
"alias_type": "VERSION",
"id": 1401,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 9,
"display_name": "Kubernetes",
"id": 726,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "PLATFORM",
"slug": "kubernetes",
"sub_category_id": 557,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Container Orchestration Platforms",
"id": 134,
"rationale": "Platforms that schedule and manage containerized workloads across clusters and environments. Cloud Architects need these to define workload placement standards, cluster boundaries, and platform capabilities.",
"slug": "container-orchestration-platforms",
"source": "db"
},
"input_skill": "Kubernetes",
"llm_role": null,
"roles_from_db": [
{
"display_name": "Cloud Architect",
"id": 9,
"rationale": null,
"role_archetype": null,
"slug": "cloud-architect",
"source": "db"
},
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Kubernetes for ML Workloads",
"id": 47,
"rationale": "Kubernetes-native components used to schedule, accelerate, and isolate ML training and serving workloads. This includes GPU enablement and ML-specific controllers rather than generic cluster administration.",
"slug": "kubernetes-for-ml-workloads",
"source": "db"
},
"input_skill": "Kubernetes",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
]
}
],
"input_skill": "Kubernetes",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "GPU scheduling",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Infrastructure Tools",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "gpu-scheduling",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "BM25",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Search Algorithms",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "EVERGREEN",
"version_strategy": "UNVERSIONED",
"volatility": "STABLE"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "bm25",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "INT8",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Data Types",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "int8",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [],
"canonical": null,
"dimensions": [],
"input_skill": "FP16",
"matched_via": null,
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": {
"derived": {
"category": "Data Types",
"skill_nature": "CONCEPT",
"sub_category": "general",
"typical_lifespan": "MULTI_YEAR",
"version_strategy": "UNVERSIONED",
"volatility": "MEDIUM"
},
"enrichment": null,
"keep_log": [],
"locked_dimensions": [],
"merge_log": [],
"placed": null,
"relationships": null,
"skill_id": "fp16",
"split_log": [],
"typed": null,
"warnings": []
},
"source_tag": "llm",
"was_in_llm_skills": true
},
{
"aliases_in_db": [
{
"alias_text": "CI/CD",
"alias_type": "CANONICAL",
"id": 1826,
"is_primary": false,
"match_strategy": "CASE_INSENSITIVE"
}
],
"canonical": {
"category_id": 8,
"display_name": "CI/CD",
"id": 1190,
"is_also_category": false,
"is_extractable": true,
"skill_nature": "METHODOLOGY",
"slug": "ci-cd",
"sub_category_id": 900,
"typical_lifespan": "EVERGREEN",
"volatility": "STABLE"
},
"dimensions": [
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "CI/CD Pipeline Platforms",
"id": 150,
"rationale": "Systems used to define, run, and maintain automated build and deployment workflows. This cluster is coherent because the role owns delivery automation end to end, including pipeline reliability and promotion logic.",
"slug": "ci-cd-pipeline-platforms",
"source": "db"
},
"input_skill": "CI/CD",
"llm_role": null,
"roles_from_db": [
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
]
},
{
"dimension": {
"difficulty_hint": "well_known",
"display_name": "CI/CD for Machine Learning",
"id": 56,
"rationale": "Tools and platforms for automating ML model integration, testing, and deployment pipelines.",
"slug": "ci-cd-for-machine-learning",
"source": "db"
},
"input_skill": "CI/CD",
"llm_role": null,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
}
]
}
],
"input_skill": "CI/CD",
"matched_via": "alias",
"new_alias_persisted": false,
"new_alias_text": null,
"new_skill_meta": null,
"source_tag": "db",
"was_in_llm_skills": true
}
],
"unmatched_skills": [
"Hugging Face Transformers",
"Llama 3",
"Mistral",
"TGI",
"GPU scheduling",
"BM25",
"INT8",
"FP16"
]
}
API 3 — final-role-output
{
"chosen_role": {
"display_name": "ML Engineer",
"id": 3,
"rationale": "The role aligns with primary skills focused on Python and various ML frameworks.",
"role_archetype": "A specialist in developing and deploying machine learning models.",
"slug": "ml-engineer",
"source": "db"
},
"chosen_role_resolution": "in_db",
"final_input_skills": [
{
"skill": "Python",
"tag": "in_db"
},
{
"skill": "PyTorch",
"tag": "in_db"
},
{
"skill": "Hugging Face Transformers",
"tag": "new"
},
{
"skill": "Llama 3",
"tag": "new"
},
{
"skill": "Mistral",
"tag": "new"
},
{
"skill": "FAISS",
"tag": "in_db"
},
{
"skill": "pgvector",
"tag": "in_db"
},
{
"skill": "A/B Testing",
"tag": "in_db"
},
{
"skill": "MLflow",
"tag": "in_db"
},
{
"skill": "TensorFlow",
"tag": "in_db"
},
{
"skill": "JAX",
"tag": "in_db"
},
{
"skill": "Pinecone",
"tag": "in_db"
},
{
"skill": "Weaviate",
"tag": "in_db"
},
{
"skill": "text-embedding-3",
"tag": "in_db"
},
{
"skill": "BGE",
"tag": "in_db"
},
{
"skill": "vLLM",
"tag": "in_db"
},
{
"skill": "TGI",
"tag": "new"
},
{
"skill": "Triton",
"tag": "in_db"
},
{
"skill": "Kubeflow",
"tag": "in_db"
},
{
"skill": "Weights \u0026 Biases",
"tag": "in_db"
},
{
"skill": "Airflow",
"tag": "in_db"
},
{
"skill": "Prefect",
"tag": "in_db"
},
{
"skill": "Kubernetes",
"tag": "in_db"
},
{
"skill": "GPU scheduling",
"tag": "new"
},
{
"skill": "BM25",
"tag": "new"
},
{
"skill": "INT8",
"tag": "new"
},
{
"skill": "FP16",
"tag": "new"
},
{
"skill": "CI/CD",
"tag": "in_db"
}
],
"llm_cost_api1_usd": null,
"llm_cost_api2_usd": null,
"llm_cost_api3_usd": null,
"llm_cost_total_usd": null,
"persistence": {
"items": [
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Cloud Security Scripting \u0026 DSL Languages",
"id": 248,
"rationale": "Proficiency in programming and domain-specific languages used to automate and script cloud security controls.",
"slug": "cloud-security-scripting-dsl-languages",
"source": "db"
},
"dimension_id": 248,
"input_skill": "Python",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "Cloud Security Engineer",
"id": 23,
"rationale": null,
"role_archetype": null,
"slug": "cloud-security-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 5,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages",
"id": 1,
"rationale": "Primary implementation languages used to build client and server feature code. Full stack engineers need enough fluency to move across layers and implement product behavior end to end.",
"slug": "programming-languages",
"source": "db"
},
"dimension_id": 1,
"input_skill": "Python",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "Backend Developer",
"id": 1,
"rationale": null,
"role_archetype": "A Backend Engineer designs, builds, and maintains the server-side logic and data handling that power applications and services. They focus on implementing reliable business functionality, integrating with other systems, and ensuring the backend is scalable, maintainable, and observable.",
"slug": "backend-engineer",
"source": "db"
},
{
"display_name": "Fullstack Developer",
"id": 15,
"rationale": null,
"role_archetype": null,
"slug": "full-stack-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 5,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages and Scripting",
"id": 59,
"rationale": "Languages used to write security automation, analysis scripts, detection logic, and remediation helpers. This is the primary implementation surface for a cybersecurity engineer across tooling and response workflows.",
"slug": "programming-languages-and-scripting",
"source": "db"
},
"dimension_id": 59,
"input_skill": "Python",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "Cybersecurity Engineer",
"id": 5,
"rationale": null,
"role_archetype": null,
"slug": "cybersecurity-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 5,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for Data Work",
"id": 21,
"rationale": "Languages used to implement data pipelines, transformations, and operational glue. This is the primary coding surface for building ingestion, enrichment, and automation logic in data engineering.",
"slug": "programming-languages-for-data-work",
"source": "db"
},
"dimension_id": 21,
"input_skill": "Python",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 5,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for ML Systems",
"id": 39,
"rationale": "Languages used to build training code, inference services, evaluation jobs, and ML glue code. This is the primary implementation surface for ML engineers across experimentation and productionization.",
"slug": "programming-languages-for-ml-systems",
"source": "db"
},
"dimension_id": 39,
"input_skill": "Python",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 5,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Programming Languages for XR",
"id": 97,
"rationale": "Primary implementation languages used to build immersive client features, interaction logic, and device-specific runtime behavior. This is the core coding surface for AR/VR experiences.",
"slug": "programming-languages-for-xr",
"source": "db"
},
"dimension_id": 97,
"input_skill": "Python",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AR/VR Engineer",
"id": 8,
"rationale": null,
"role_archetype": null,
"slug": "ar-vr-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 5,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"dimension_id": 40,
"input_skill": "PyTorch",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 195,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Model Fine-Tuning \u0026 Adaptation",
"id": 212,
"rationale": "Techniques and libraries for adapting pre-trained language models to specific tasks or domains.",
"slug": "model-fine-tuning-adaptation",
"source": "db"
},
"dimension_id": 212,
"input_skill": "PyTorch",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 195,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"dimension_id": 40,
"input_skill": "FAISS",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1184,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "React Frontend Development",
"id": 96,
"rationale": "Building interactive web user interfaces with React.js, including component composition, state management, hooks, and rendering patterns. React.js belongs here because it is a core library for client-side UI development in modern web applications.",
"slug": "d_init_01",
"source": "db"
},
"dimension_id": 96,
"input_skill": "FAISS",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [],
"skill_dimension_saved": true,
"skill_id": 1184,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"dimension_id": 198,
"input_skill": "pgvector",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1246,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "React Frontend Development",
"id": 96,
"rationale": "Building interactive web user interfaces with React.js, including component composition, state management, hooks, and rendering patterns. React.js belongs here because it is a core library for client-side UI development in modern web applications.",
"slug": "d_init_01",
"source": "db"
},
"dimension_id": 96,
"input_skill": "A/B Testing",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [],
"skill_dimension_saved": true,
"skill_id": 1613,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Systems Programming",
"id": 166,
"rationale": "Systems programming covers low-level software development where performance, memory safety, and direct control over resources matter. Rust fits here because it is commonly used for OS-adjacent services, infrastructure components, and other performance-sensitive systems code.",
"slug": "d_init_02",
"source": "db"
},
"dimension_id": 166,
"input_skill": "A/B Testing",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [],
"skill_dimension_saved": true,
"skill_id": 1613,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "MLOps Platforms and Lifecycle",
"id": 43,
"rationale": "End-to-end managed platforms used to train, deploy, register, and govern models across their lifecycle. This is the operational control plane for production ML workflows.",
"slug": "mlops-platforms-and-lifecycle",
"source": "db"
},
"dimension_id": 43,
"input_skill": "MLflow",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 214,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"dimension_id": 40,
"input_skill": "TensorFlow",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 196,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "ML Frameworks and Libraries",
"id": 40,
"rationale": "Core libraries used to define models, train them, run inference, and evaluate predictive performance. These frameworks shape how ML engineers express model architectures and training loops.",
"slug": "ml-frameworks-and-libraries",
"source": "db"
},
"dimension_id": 40,
"input_skill": "JAX",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 199,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Operations and Orchestration",
"id": 49,
"rationale": "Operational stack for building, serving, evaluating, and orchestrating LLM-based systems. This includes vector retrieval, prompt workflows, LLM serving, and observability for generative applications.",
"slug": "llm-operations-and-orchestration",
"source": "db"
},
"dimension_id": 49,
"input_skill": "Pinecone",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 242,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"dimension_id": 198,
"input_skill": "Pinecone",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 242,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Vector Databases",
"id": 198,
"rationale": "Specialized storage and indexing systems used to persist embeddings and support similarity search. This is a distinct vendor-family cluster because AI features often depend on a concrete vector store choice and its operational behavior.",
"slug": "vector-databases",
"source": "db"
},
"dimension_id": 198,
"input_skill": "Weaviate",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1242,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Embedding Models",
"id": 199,
"rationale": "Models and APIs used to convert text or other content into vector representations for retrieval, clustering, and semantic matching. This cluster is coherent because embedding choice affects retrieval quality, cost, and latency in AI features.",
"slug": "embedding-models",
"source": "db"
},
"dimension_id": 199,
"input_skill": "text-embedding-3",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Skipped \u2014 no persistable v3 meta for new skill",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": false,
"skill_id": null,
"skill_tag": "new",
"skipped_reason": "skill_not_in_db_v3_proposed"
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Embedding Models",
"id": 199,
"rationale": "Models and APIs used to convert text or other content into vector representations for retrieval, clustering, and semantic matching. This cluster is coherent because embedding choice affects retrieval quality, cost, and latency in AI features.",
"slug": "embedding-models",
"source": "db"
},
"dimension_id": 199,
"input_skill": "BGE",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1249,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Operations and Orchestration",
"id": 49,
"rationale": "Operational stack for building, serving, evaluating, and orchestrating LLM-based systems. This includes vector retrieval, prompt workflows, LLM serving, and observability for generative applications.",
"slug": "llm-operations-and-orchestration",
"source": "db"
},
"dimension_id": 49,
"input_skill": "vLLM",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
},
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 241,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "LLM Serving \u0026 Deployment",
"id": 209,
"rationale": "Tools and frameworks for hosting and serving LLM models in production environments.",
"slug": "llm-serving-deployment",
"source": "db"
},
"dimension_id": 209,
"input_skill": "vLLM",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "AI Engineer",
"id": 13,
"rationale": null,
"role_archetype": null,
"slug": "ai-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 241,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Model Optimization and Acceleration",
"id": 53,
"rationale": "Techniques for improving inference latency, throughput, memory use, and training efficiency. ML engineers use these methods to meet production constraints without sacrificing too much quality.",
"slug": "model-optimization-and-acceleration",
"source": "db"
},
"dimension_id": 53,
"input_skill": "Triton",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1367,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "MLOps Platforms and Lifecycle",
"id": 43,
"rationale": "End-to-end managed platforms used to train, deploy, register, and govern models across their lifecycle. This is the operational control plane for production ML workflows.",
"slug": "mlops-platforms-and-lifecycle",
"source": "db"
},
"dimension_id": 43,
"input_skill": "Kubeflow",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 213,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Experiment Tracking and Evaluation",
"id": 44,
"rationale": "Tools and practices for recording experiments, comparing runs, and assessing model quality before release. This dimension focuses on reproducibility, metrics, artifacts, and offline evaluation workflows.",
"slug": "experiment-tracking-and-evaluation",
"source": "db"
},
"dimension_id": 44,
"input_skill": "Weights \u0026 Biases",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 215,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Workflow Orchestration for ML Pipelines",
"id": 54,
"rationale": "Workflow engines used to coordinate training, evaluation, deployment, and retraining jobs. This cluster covers dependencies, retries, scheduling, and pipeline composition for ML lifecycle automation.",
"slug": "workflow-orchestration-for-ml-pipelines",
"source": "db"
},
"dimension_id": 54,
"input_skill": "Airflow",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 265,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Data Pipeline Orchestration",
"id": 23,
"rationale": "Workflow engines that schedule, coordinate, and recover batch data jobs. This cluster covers dependency management, retries, backfills, sensors, and operational control of pipeline DAGs.",
"slug": "data-pipeline-orchestration",
"source": "db"
},
"dimension_id": 23,
"input_skill": "Prefect",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "Data Engineer",
"id": 2,
"rationale": null,
"role_archetype": null,
"slug": "data-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 112,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Workflow Orchestration for ML Pipelines",
"id": 54,
"rationale": "Workflow engines used to coordinate training, evaluation, deployment, and retraining jobs. This cluster covers dependencies, retries, scheduling, and pipeline composition for ML lifecycle automation.",
"slug": "workflow-orchestration-for-ml-pipelines",
"source": "db"
},
"dimension_id": 54,
"input_skill": "Prefect",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 112,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Container Orchestration Platforms",
"id": 134,
"rationale": "Platforms that schedule and manage containerized workloads across clusters and environments. Cloud Architects need these to define workload placement standards, cluster boundaries, and platform capabilities.",
"slug": "container-orchestration-platforms",
"source": "db"
},
"dimension_id": 134,
"input_skill": "Kubernetes",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "Cloud Architect",
"id": 9,
"rationale": null,
"role_archetype": null,
"slug": "cloud-architect",
"source": "db"
},
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 726,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "Kubernetes for ML Workloads",
"id": 47,
"rationale": "Kubernetes-native components used to schedule, accelerate, and isolate ML training and serving workloads. This includes GPU enablement and ML-specific controllers rather than generic cluster administration.",
"slug": "kubernetes-for-ml-workloads",
"source": "db"
},
"dimension_id": 47,
"input_skill": "Kubernetes",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
},
{
"display_name": "ML Ops Engineer",
"id": 16,
"rationale": null,
"role_archetype": null,
"slug": "ml-ops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 726,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "CI/CD Pipeline Platforms",
"id": 150,
"rationale": "Systems used to define, run, and maintain automated build and deployment workflows. This cluster is coherent because the role owns delivery automation end to end, including pipeline reliability and promotion logic.",
"slug": "ci-cd-pipeline-platforms",
"source": "db"
},
"dimension_id": 150,
"input_skill": "CI/CD",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "DevOps Engineer",
"id": 10,
"rationale": null,
"role_archetype": null,
"slug": "devops-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1190,
"skill_tag": "in_db",
"skipped_reason": null
},
{
"chosen_role_id": 3,
"dimension": {
"difficulty_hint": "well_known",
"display_name": "CI/CD for Machine Learning",
"id": 56,
"rationale": "Tools and platforms for automating ML model integration, testing, and deployment pipelines.",
"slug": "ci-cd-for-machine-learning",
"source": "db"
},
"dimension_id": 56,
"input_skill": "CI/CD",
"llm_role": null,
"matched_chosen_role": false,
"outcome_line": "Existing dimension (library) \u00b7 Role\u2194dimension skipped (dimension not under chosen role)",
"role_dimension_saved": false,
"roles_from_db": [
{
"display_name": "ML Engineer",
"id": 3,
"rationale": null,
"role_archetype": null,
"slug": "ml-engineer",
"source": "db"
}
],
"skill_dimension_saved": true,
"skill_id": 1190,
"skill_tag": "in_db",
"skipped_reason": null
}
],
"new_skills_created": 0,
"role_dimension_saved": 0,
"skill_dimension_saved": 0,
"skipped": 1
},
"planner_output": null,
"run_id": "48b52573-a42b-4e34-9481-72d73c5404f8"
}
LLM Calls
Every model call made for this run, in pipeline order. Click a card to see the model's response.