← Back to history

Pipeline run

fc829bda-6a1e-4086-befc-133024825a6d

Pipeline LLM cost (USD)
API 1: $0.0034 API 2: $0.0003 API 3: $0.0000 Total: $0.0037

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description
role baseline loaded sources · ai_index: jd · nature_of_work: jd · tech_stack_maturity: jd
Nature of work · Data pipeline development
Build and maintain cloud-to-warehouse data pipelines, keep ETL/BI and warehouse tooling running, and document lineage while partnering with data/analytics teams to deliver trusted self-service datasets and improve standards.
"Design and build data pipelines from various cloud data sources for the enterprise data warehouse."
Tech stack maturity
Mainstream Modern
AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)
0.00 / 5
· Title match
· Has AI skill
· AI skill (primary)
· AI skill (secondary)
· On AI team
· Builds AI products
vocab breakdown (legacy)
Assistants (×1):
Frameworks (×2):
Models / concepts (×3):
Evidence — skills matched in JD (8)
Data Pipelines Cloud Data Sources Data Warehouse ETL BI Tools Data Lineage Standards Best Practices
Skill cluster (1 dimension groups, role-scoped)
Cross-cutting / unaligned
Data Pipelines Cloud Data Sources Data Warehouse ETL BI Tools Data Lineage Standards Best Practices
Show KRA description ↓
Design and build data pipelines from various cloud data sources for the enterprise data warehouse. Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service. Own and document data pipelines and data lineage. Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools). Continuously identify areas of improvement and ensure application of standards and best practices.

Signals

Skill
Alias data-engineer
1.00
KRA data-engineer
0.66

Post-classification

Centroidupdated · n=361
Alias collision log
New-role queue
New skills captured8
New KRA captured

Captured for admin review

Data Pipelines primary Data Engineer pending
Cloud Data Sources primary Data Engineer pending
Data Warehouse primary Data Engineer pending
ETL primary Data Engineer pending
BI Tools primary Data Engineer pending
Data Lineage Data Engineer pending
Standards Data Engineer pending
Best Practices Data Engineer pending
Status: completed Created: 2026-05-27T15:54:12.473633Z Updated: 2026-06-12T15:47:14.033112Z API 3 duration: 2063 ms
Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

Data Engineer

CASE A

slug: data-engineer · id: 2 · source: db

Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

0
New skills
0
Skill↔dim saved
0
Role↔dim saved
0
Skipped

Job description

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

 Job Category Products and Technology Job Details 

Senior Data Engineer (Sales & CS)

As a Data Engineer, working in our Business Technology - Data and Analytics team, you will be working cross-functionally with business domain experts, analytics, and engineering teams to design and build data pipelines for our Data Warehouse. The data pipelines you build and maintain will enable our business partners and decision makers to get actionable insights from our Product and Corporate Systems. You would also be responsible for enhancements and maintaining our data warehouse.

This is a great role for people passionate about working with data and data systems, and who love solving problems. It is for people who love technical challenges and are always looking for ways to improve existing software, processes and infrastructure. They are self-starters, detail and quality oriented.

In short, we are looking for someone who takes pride in the craft and wants to be part of a team of talented data engineers and architects working to further Slack’s growth. If this role has your name written all over it, please contact us with a resume so that we can explore further.

Responsibilities

 Design and build data pipelines from various cloud data sources for the enterprise data warehouse  Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service  Own and document data pipelines and data lineage  Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools)  Continuously identify areas of improvement and ensure application of standards and best practices 

Requirements

 BS degree in Computer Science or other engineering discipline.  5+ years of experience working as a Data Engineer or a similar role.  2+ years of experience working with Sales data analytics or sourcing data from sales systems  Very strong experience in writing complex SQLs and dimensional modeling  Hands-on experience working with data warehouse technologies (Snowflake, Redshift) and Big Data technologies (e.g Hadoop, Hive, Spark)  Hands on experience building data pipelines using ETL tools (e.g. Informatica, Matillion, Snaplogic) sourcing data from SaaS applications  Proficiency with Python  Ability to work on multiple areas like Data pipeline ETL, Data modeling & design, writing complex SQL queries etc.  Ability to build the automation processes for the data quality and data reconciliation  Understanding of CRM systems such as Salesforce is a big plus.  Understanding of sales metrics for SaaS companies is a big plus.  Proficiency in Airflow is a big plus.  Passionate about various technologies including but not limited to SQL/No SQL/MPP databases etc.  Excellent written and verbal communication and interpersonal skills, able to effectively collaborate with technical and business partners 

Slack is the collaboration hub of choice for companies of all sizes, all across the world. By using Slack, they ensure that the right people are always in the loop, that key information is always at their fingertips, and new team members can get up to speed easily. With Slack, teams are better connected.

Launched in February 2014, Slack is the fastest growing business application ever and is used by thousands of teams and millions of users every day. We currently have many offices worldwide, including in San Francisco, Vancouver, Dublin, Melbourne, New York, London, Tokyo, Toronto, Denver and Pune.

Ensuring a diverse and inclusive workplace where we learn from each other is core to Slack's values. We welcome people of different backgrounds, experiences, abilities and perspectives. We are an equal opportunity employer and a pleasant and supportive place to work. Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

 Come do the best work of your life here at Slack. 

 Accommodations 

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form .

 Posting Statement 

At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at Salesforce and explore our benefits.

Salesforce.com and Salesforce.org are Equal Employment Opportunity and Affirmative Action Employers. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce.com and Salesforce.org do not accept unsolicited headhunter and agency resumes. Salesforce.com and Salesforce.org will not pay any third-party agency or company that does not have a signed agreement with Salesforce.com or Salesforce.org .

Salesforce welcomes all.

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

Data Pipelines Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
CONCEPT
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Cloud Data Sources Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Cloud Platforms
Sub-category
general
Skill nature
PLATFORM
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Data Warehouse Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Databases
Sub-category
general
Skill nature
CONCEPT
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
ETL Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
PRACTICE
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
BI Tools Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
TOOL
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Data Lineage Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Data Engineering Tools
Sub-category
general
Skill nature
CONCEPT
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED
Standards Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Best Practices
Sub-category
general
Skill nature
CONCEPT
Volatility
STABLE
Typical lifespan
EVERGREEN
Version strategy
UNVERSIONED
Best Practices Secondary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Best Practices
Sub-category
general
Skill nature
PRACTICE
Volatility
STABLE
Typical lifespan
EVERGREEN
Version strategy
UNVERSIONED

Library artifacts (this run)

Kind Detail DB id
canonical_skill_proposed Data Pipelines | type=Data Engineering Tools subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed Cloud Data Sources | type=Cloud Platforms subtype=general nature=PLATFORM lifespan=MULTI_YEAR
canonical_skill_proposed Data Warehouse | type=Databases subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed ETL | type=Data Engineering Tools subtype=general nature=PRACTICE lifespan=MULTI_YEAR
canonical_skill_proposed BI Tools | type=Data Engineering Tools subtype=general nature=TOOL lifespan=MULTI_YEAR
canonical_skill_proposed Data Lineage | type=Data Engineering Tools subtype=general nature=CONCEPT lifespan=MULTI_YEAR
canonical_skill_proposed Standards | type=Best Practices subtype=general nature=CONCEPT lifespan=EVERGREEN
canonical_skill_proposed Best Practices | type=Best Practices subtype=general nature=PRACTICE lifespan=EVERGREEN
nano JD Parser — gpt-4.1-nano click to toggle
RoleSenior Data Engineer (Sales & CS)
CompanySalesforce
Experience5+ years of experience working as a Data Engineer or a similar role.
DomainSoftware & SaaS Products
Location San Francisco, United States (null)
JD type pass
Show raw JSON
{
  "JD_type": "pass",
  "about_company": {
    "source_marker": {
      "first_5_words": "Slack is the collaboration hub",
      "last_5_words": "teams are better connected."
    },
    "text": "Slack is the collaboration hub of choice for companies of all sizes, all across the world. By using Slack, they ensure that the right people are always in the loop, that key information is always at their fingertips, and new team members can get up to speed easily. With Slack, teams are better connected.",
    "word_count": 64
  },
  "certifications": [],
  "company_name": "Salesforce",
  "ctc": null,
  "domain": {
    "primary": {
      "aliases": [
        "SaaS",
        "Cloud Computing"
      ],
      "domain": "Software \u0026 SaaS Products"
    },
    "secondary": null
  },
  "education": [
    {
      "level": "Bachelor\u0027s",
      "qualification": "BTECH/BE/BSC - Computer Science (or other engineering discipline)",
      "raw": "BS degree in Computer Science or other engineering discipline.",
      "requirement": "required"
    }
  ],
  "experience": {
    "max": null,
    "min": 5,
    "raw": "5+ years of experience working as a Data Engineer or a similar role."
  },
  "job_locations": [
    {
      "aliases": [
        "SF"
      ],
      "city": "San Francisco",
      "country": "United States",
      "state": "California",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Vancouver",
      "country": "Canada",
      "state": "British Columbia",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Dublin",
      "country": "Ireland",
      "state": "Dublin",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Melbourne",
      "country": "Australia",
      "state": "Victoria",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "New York",
      "country": "United States",
      "state": "New York",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "London",
      "country": "United Kingdom",
      "state": "England",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Tokyo",
      "country": "Japan",
      "state": "Tokyo",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Toronto",
      "country": "Canada",
      "state": "Ontario",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Denver",
      "country": "United States",
      "state": "Colorado",
      "work_mode": "null"
    },
    {
      "aliases": [],
      "city": "Pune",
      "country": "India",
      "state": "Maharashtra",
      "work_mode": "null"
    }
  ],
  "role": "Senior Data Engineer (Sales \u0026 CS)",
  "role_aliases": [
    "Data Engineer",
    "Senior Data Engineer",
    "Data Pipeline Engineer"
  ],
  "role_archetype": "Data",
  "roles_and_responsibilities": [
    {
      "bullet_count": 5,
      "heading": "Responsibilities",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "Design and build data pipelines",
        "last_5_words": "standards and best practices."
      },
      "text": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.\nPartner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.\nOwn and document data pipelines and data lineage.\nSupport and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).\nContinuously identify areas of improvement and ensure application of standards and best practices.",
      "word_count": 64
    }
  ],
  "urls": []
}
API 1 — extract-from-jd click to toggle
{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "Data Pipelines"
    },
    {
      "is_primary": true,
      "skill_name": "Cloud Data Sources"
    },
    {
      "is_primary": true,
      "skill_name": "Data Warehouse"
    },
    {
      "is_primary": true,
      "skill_name": "ETL"
    },
    {
      "is_primary": true,
      "skill_name": "BI Tools"
    },
    {
      "is_primary": false,
      "skill_name": "Data Lineage"
    },
    {
      "is_primary": false,
      "skill_name": "Standards"
    },
    {
      "is_primary": false,
      "skill_name": "Best Practices"
    }
  ],
  "jd_role": {
    "display_name": "Senior Data Engineer (Sales \u0026 CS)",
    "rationale": null,
    "role_aliases": [
      "Data Engineer",
      "Senior Data Engineer",
      "Data Pipeline Engineer"
    ],
    "role_archetype": "Data",
    "slug": ""
  },
  "nano_parsed": {
    "JD_type": "pass",
    "about_company": {
      "source_marker": {
        "first_5_words": "Slack is the collaboration hub",
        "last_5_words": "teams are better connected."
      },
      "text": "Slack is the collaboration hub of choice for companies of all sizes, all across the world. By using Slack, they ensure that the right people are always in the loop, that key information is always at their fingertips, and new team members can get up to speed easily. With Slack, teams are better connected.",
      "word_count": 64
    },
    "certifications": [],
    "company_name": "Salesforce",
    "ctc": null,
    "domain": {
      "primary": {
        "aliases": [
          "SaaS",
          "Cloud Computing"
        ],
        "domain": "Software \u0026 SaaS Products"
      },
      "secondary": null
    },
    "education": [
      {
        "level": "Bachelor\u0027s",
        "qualification": "BTECH/BE/BSC - Computer Science (or other engineering discipline)",
        "raw": "BS degree in Computer Science or other engineering discipline.",
        "requirement": "required"
      }
    ],
    "experience": {
      "max": null,
      "min": 5,
      "raw": "5+ years of experience working as a Data Engineer or a similar role."
    },
    "job_locations": [
      {
        "aliases": [
          "SF"
        ],
        "city": "San Francisco",
        "country": "United States",
        "state": "California",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Vancouver",
        "country": "Canada",
        "state": "British Columbia",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Dublin",
        "country": "Ireland",
        "state": "Dublin",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Melbourne",
        "country": "Australia",
        "state": "Victoria",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "New York",
        "country": "United States",
        "state": "New York",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "London",
        "country": "United Kingdom",
        "state": "England",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Tokyo",
        "country": "Japan",
        "state": "Tokyo",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Toronto",
        "country": "Canada",
        "state": "Ontario",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Denver",
        "country": "United States",
        "state": "Colorado",
        "work_mode": "null"
      },
      {
        "aliases": [],
        "city": "Pune",
        "country": "India",
        "state": "Maharashtra",
        "work_mode": "null"
      }
    ],
    "role": "Senior Data Engineer (Sales \u0026 CS)",
    "role_aliases": [
      "Data Engineer",
      "Senior Data Engineer",
      "Data Pipeline Engineer"
    ],
    "role_archetype": "Data",
    "roles_and_responsibilities": [
      {
        "bullet_count": 5,
        "heading": "Responsibilities",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "Design and build data pipelines",
          "last_5_words": "standards and best practices."
        },
        "text": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.\nPartner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.\nOwn and document data pipelines and data lineage.\nSupport and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).\nContinuously identify areas of improvement and ensure application of standards and best practices.",
        "word_count": 64
      }
    ],
    "urls": []
  },
  "rejected": false,
  "rejection_reason": null,
  "run_id": "fc829bda-6a1e-4086-befc-133024825a6d",
  "stage3_signals": {
    "alias_found": true,
    "alias_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 1.0,
        "slug": "data-engineer",
        "total_count": null
      }
    ],
    "kra_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": [
          {
            "kra_text": "Works with data analysts, data scientists, and business stakeholders to define data models, ingestion schedules, and data delivery requirements.",
            "sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
            "similarity": 0.6885
          },
          {
            "kra_text": "Builds data ingestion pipelines to collect data from transactional databases, third-party APIs, event streams, and file sources into centralized data platforms.",
            "sentence": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.",
            "similarity": 0.6694
          },
          {
            "kra_text": "Maintains data catalog entries, column-level data lineage, and technical documentation to support data discoverability and governance across the organization.",
            "sentence": "Own and document data pipelines and data lineage.",
            "similarity": 0.6361
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 0.6646,
        "slug": "data-engineer",
        "total_count": null
      },
      {
        "display_name": "Svelte Frontend Developer",
        "kra_matches": [
          {
            "kra_text": "backend data integration",
            "sentence": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.",
            "similarity": 0.4725
          },
          {
            "kra_text": "backend data integration",
            "sentence": "Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).",
            "similarity": 0.4687
          },
          {
            "kra_text": "backend data integration",
            "sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
            "similarity": 0.4667
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 92,
        "score": 0.4693,
        "slug": "svelte-frontend-developer",
        "total_count": null
      },
      {
        "display_name": "ML Engineer",
        "kra_matches": [
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
            "similarity": 0.4905
          },
          {
            "kra_text": "Designs end-to-end ML training pipelines and model inference workflows using TensorFlow, PyTorch, or scikit-learn on cloud ML platforms.",
            "sentence": "Design and build data pipelines from various cloud data sources for the enterprise data warehouse.",
            "similarity": 0.4672
          },
          {
            "kra_text": "Prepares, cleans, and transforms training datasets, manages feature stores, and builds feature engineering pipelines for model training.",
            "sentence": "Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).",
            "similarity": 0.4435
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 3,
        "score": 0.4671,
        "slug": "ml-engineer",
        "total_count": null
      },
      {
        "display_name": "MLOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Maintains model versioning, experiment lineage, and artifact tracking using MLflow, DVC, or Weights \u0026 Biases for reproducibility and auditability.",
            "sentence": "Own and document data pipelines and data lineage.",
            "similarity": 0.5024
          },
          {
            "kra_text": "Supports ML platform incidents by diagnosing model serving failures, feature store pipeline breaks, and training environment configuration issues.",
            "sentence": "Support and Maintain analytics tech ecosystem (data warehouse, ETL and BI tools).",
            "similarity": 0.4656
          },
          {
            "kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
            "sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
            "similarity": 0.4216
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 16,
        "score": 0.4632,
        "slug": "ml-ops-engineer",
        "total_count": null
      },
      {
        "display_name": "DevOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Continuously identify areas of improvement and ensure application of standards and best practices.",
            "similarity": 0.5096
          },
          {
            "kra_text": "Collaborates with development teams to improve build processes, reduce deployment friction, containerize applications, and adopt DevOps best practices.",
            "sentence": "Partner with Data Engineers, Data architects, domain experts, data analysts and other teams to build foundational data sets that are trusted, well understood, aligned with business strategy and enable self-service.",
            "similarity": 0.4324
          },
          {
            "kra_text": "Builds and maintains CI/CD pipelines using Jenkins, GitHub Actions, GitLab CI, or CircleCI to automate build, test, security scanning, and deployment workflows.",
            "sentence": "Own and document data pipelines and data lineage.",
            "similarity": 0.423
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 10,
        "score": 0.455,
        "slug": "devops-engineer",
        "total_count": null
      }
    ],
    "skill_match_roles": []
  },
  "stage4_decision": {
    "alias_collision_detected": false,
    "case": "A",
    "chosen_role": {
      "display_name": "Data Engineer",
      "kra_matches": null,
      "matched_count": null,
      "matched_skills": null,
      "role_id": 2,
      "score": 1.0,
      "slug": "data-engineer",
      "total_count": null
    },
    "confidence": 1.0,
    "is_new_role": false,
    "llm2_fired": false,
    "llm2_reasoning": null,
    "matched_dimensions": [],
    "matched_kras": [],
    "matched_skills": [],
    "new_role_display_name": null,
    "new_role_slug": null,
    "queued": false,
    "reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "sub_role": null
  },
  "stage5_updates": {
    "centroid_n_after": 361,
    "centroid_updated": true,
    "collision_log_id": null,
    "new_kra_attached": null,
    "new_skills_attached": [
      {
        "is_primary": true,
        "queue_id": 17103,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Pipelines",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 17104,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Cloud Data Sources",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 17105,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Warehouse",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 17106,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "ETL",
        "status": "pending"
      },
      {
        "is_primary": true,
        "queue_id": 17107,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "BI Tools",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 17108,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Data Lineage",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 17109,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Standards",
        "status": "pending"
      },
      {
        "is_primary": false,
        "queue_id": 17110,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Best Practices",
        "status": "pending"
      }
    ],
    "queue_entry_id": null,
    "v3_pipeline_triggered": false,
    "v3_role_slug": null,
    "v3_run_id": null
  }
}
API 2 — extract-details
{
  "alias_matches": [],
  "candidate_roles": [],
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "dimensions": [],
  "input_final_skills": [
    "Data Pipelines",
    "Cloud Data Sources",
    "Data Warehouse",
    "ETL",
    "BI Tools",
    "Data Lineage",
    "Standards",
    "Best Practices"
  ],
  "input_llm_skills": [
    "Data Pipelines",
    "Cloud Data Sources",
    "Data Warehouse",
    "ETL",
    "BI Tools",
    "Data Lineage",
    "Standards",
    "Best Practices"
  ],
  "new_aliases_persisted": 0,
  "run_id": "fc829bda-6a1e-4086-befc-133024825a6d",
  "skills_detail": [
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Data Pipelines",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "data-pipelines",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Cloud Data Sources",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Cloud Platforms",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "cloud-data-sources",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Data Warehouse",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Databases",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "data-warehouse",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "ETL",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "etl",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "BI Tools",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "TOOL",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "bi-tools",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Data Lineage",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Data Engineering Tools",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "data-lineage",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Standards",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Best Practices",
          "skill_nature": "CONCEPT",
          "sub_category": "general",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "UNVERSIONED",
          "volatility": "STABLE"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "standards",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    },
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Best Practices",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Best Practices",
          "skill_nature": "PRACTICE",
          "sub_category": "general",
          "typical_lifespan": "EVERGREEN",
          "version_strategy": "UNVERSIONED",
          "volatility": "STABLE"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "best-practices",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": [
    "Data Pipelines",
    "Cloud Data Sources",
    "Data Warehouse",
    "ETL",
    "BI Tools",
    "Data Lineage",
    "Standards",
    "Best Practices"
  ]
}
API 3 — final-role-output
{
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "Data Pipelines",
      "tag": "new"
    },
    {
      "skill": "Cloud Data Sources",
      "tag": "new"
    },
    {
      "skill": "Data Warehouse",
      "tag": "new"
    },
    {
      "skill": "ETL",
      "tag": "new"
    },
    {
      "skill": "BI Tools",
      "tag": "new"
    },
    {
      "skill": "Data Lineage",
      "tag": "new"
    },
    {
      "skill": "Standards",
      "tag": "new"
    },
    {
      "skill": "Best Practices",
      "tag": "new"
    }
  ],
  "llm_cost_api1_usd": null,
  "llm_cost_api2_usd": null,
  "llm_cost_api3_usd": null,
  "llm_cost_total_usd": null,
  "persistence": {
    "items": [],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "fc829bda-6a1e-4086-befc-133024825a6d"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…