Pipeline run

27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c

Pipeline LLM cost (USD)

API 1: $0.0027 API 2: $0.0001 API 3: $0.0000 Total: $0.0028

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

SPARSE JD role baseline loaded sources · ai_index: role_baseline · nature_of_work: jd · tech_stack_maturity: role_baseline

Nature of work · Data pipeline development

Build and maintain data platform integrations and pipelines in Informatica IICS, while collaborating on platform design, troubleshooting issues, and improving data quality and workflow efficiency. Also provide guidance to junior team members and help shape cohesive system/data-model integration.

"Implement data integration processes using Informatica Intelligent Cloud Services."

Tech stack maturity

Modern Cloud Native

Data engineers typically build cloud-based batch and streaming pipelines and warehouse models, but AI is usually incidental rather than central to the role.

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

1.20 / 5

· Title match

· Has AI skill

· AI skill (primary)

· AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): —

Models / concepts (×3): —

Evidence — skills matched in JD (1)

Informatica Intelligent Cloud Services

Skill cluster (1 dimension groups, role-scoped)

Cross-cutting / unaligned

Informatica Intelligent Cloud Services

Show KRA description ↓

- Expected to perform independently and become an SME. - Required active participation/contribution in team discussions. - Contribute in providing solutions to work related problems. - Assist with the data platform blueprint and design. - Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. - Develop and maintain data platform components. - Implement data integration processes using Informatica Intelligent Cloud Services. - Optimize data workflows and pipelines for efficient data processing. - Troubleshoot and resolve data integration issues. - Ensure data quality and integrity throughout the data platform. - Stay updated with the latest trends and advancements in data engineering. - Provide technical guidance and support to junior team members.

Signals

Skill —

—

Alias data-engineer

1.00

KRA data-engineer

0.64

Post-classification

Centroidupdated · n=251

Alias collision log—

New-role queue—

New skills captured1

New KRA captured—

Captured for admin review

Informatica Intelligent Cloud Services primary ↔ Data Engineer pending

Status: completed Created: 2026-05-27T15:03:48.921289Z Updated: 2026-06-12T16:55:07.281298Z API 3 duration: 2281 ms

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

Data Engineer

CASE A

slug: data-engineer · id: 2 · source: db

Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

New skills

Skill↔dim saved

Role↔dim saved

Skipped

Job description

Project Role : Data Platform Engineer

Project Role Description : Assists with the data platform blueprint and design, encompassing the relevant data platform components. Collaborates with the Integration Architects and Data Architects to ensure cohesive integration between systems and data models.

Must have skills : Informatica Intelligent Cloud Services

Good to have skills : Google BigQuery

Minimum 3 Year(s) Of Experience Is Required

Educational Qualification : 15 years full time education

Summary: As a Data Platform Engineer, you will assist with the data platform blueprint and design, collaborating with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. You will play a crucial role in the development and maintenance of the data platform components. Join our team in Mumbai and contribute to the success of our data engineering projects. Roles & Responsibilities: - Expected to perform independently and become an SME. - Required active participation/contribution in team discussions. - Contribute in providing solutions to work related problems. - Assist with the data platform blueprint and design. - Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. - Develop and maintain data platform components. - Implement data integration processes using Informatica Intelligent Cloud Services. - Optimize data workflows and pipelines for efficient data processing. - Troubleshoot and resolve data integration issues. - Ensure data quality and integrity throughout the data platform. - Stay updated with the latest trends and advancements in data engineering. - Provide technical guidance and support to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Informatica Intelligent Cloud Services. - Good To Have Skills: Experience with Google BigQuery. - Strong understanding of data integration concepts and techniques. - Experience in designing and implementing data workflows and pipelines. - Familiarity with cloud-based data platforms and services. - Knowledge of SQL and database management systems. - Experience with data quality and data governance practices. - Ability to troubleshoot and resolve data integration issues. Additional Information: - The candidate should have a minimum of 3 years of experience in Informatica Intelligent Cloud Services. - This position is based at our Mumbai office. - A 15 years full time education is required.

15 years full time education

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

Informatica Intelligent Cloud Services Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields

Category: Cloud Platforms
Sub-category: general
Skill nature: PLATFORM
Volatility: MEDIUM
Typical lifespan: MULTI_YEAR
Version strategy: UNVERSIONED

Library artifacts (this run)

Kind	Detail	DB id
canonical_skill_proposed	Informatica Intelligent Cloud Services \| type=Cloud Platforms subtype=general nature=PLATFORM lifespan=MULTI_YEAR

nano JD Parser — gpt-4.1-nano click to toggle

RoleData Platform Engineer

ExperienceMinimum 3 Year(s) Of Experience Is Required

DomainOther

Location Mumbai, India (onsite)

JD type pass

Show raw JSON

{
  "JD_type": "pass",
  "about_company": null,
  "certifications": [],
  "company_name": null,
  "ctc": null,
  "domain": {
    "primary": {
      "aliases": [],
      "domain": "Other"
    },
    "secondary": null
  },
  "education": [
    {
      "level": "null",
      "qualification": "null - null",
      "raw": "15 years full time education",
      "requirement": "required"
    }
  ],
  "experience": {
    "max": null,
    "min": 3,
    "raw": "Minimum 3 Year(s) Of Experience Is Required"
  },
  "job_locations": [
    {
      "aliases": [
        "Bombay"
      ],
      "city": "Mumbai",
      "country": "India",
      "state": null,
      "work_mode": "onsite"
    }
  ],
  "role": "Data Platform Engineer",
  "role_aliases": [
    "Data Engineer",
    "Data Integration Engineer"
  ],
  "role_archetype": "Data",
  "roles_and_responsibilities": [
    {
      "bullet_count": 12,
      "heading": "Roles \u0026 Responsibilities",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "- Expected to perform independently",
        "last_5_words": "to junior team members."
      },
      "text": "- Expected to perform independently and become an SME.\n- Required active participation/contribution in team discussions.\n- Contribute in providing solutions to work related problems.\n- Assist with the data platform blueprint and design.\n- Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.\n- Develop and maintain data platform components.\n- Implement data integration processes using Informatica Intelligent Cloud Services.\n- Optimize data workflows and pipelines for efficient data processing.\n- Troubleshoot and resolve data integration issues.\n- Ensure data quality and integrity throughout the data platform.\n- Stay updated with the latest trends and advancements in data engineering.\n- Provide technical guidance and support to junior team members.",
      "word_count": 114
    }
  ],
  "urls": []
}

API 1 — extract-from-jd click to toggle

{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "Informatica Intelligent Cloud Services"
    }
  ],
  "jd_role": {
    "display_name": "Data Platform Engineer",
    "rationale": null,
    "role_aliases": [
      "Data Engineer",
      "Data Integration Engineer"
    ],
    "role_archetype": "Data",
    "slug": ""
  },
  "nano_parsed": {
    "JD_type": "pass",
    "about_company": null,
    "certifications": [],
    "company_name": null,
    "ctc": null,
    "domain": {
      "primary": {
        "aliases": [],
        "domain": "Other"
      },
      "secondary": null
    },
    "education": [
      {
        "level": "null",
        "qualification": "null - null",
        "raw": "15 years full time education",
        "requirement": "required"
      }
    ],
    "experience": {
      "max": null,
      "min": 3,
      "raw": "Minimum 3 Year(s) Of Experience Is Required"
    },
    "job_locations": [
      {
        "aliases": [
          "Bombay"
        ],
        "city": "Mumbai",
        "country": "India",
        "state": null,
        "work_mode": "onsite"
      }
    ],
    "role": "Data Platform Engineer",
    "role_aliases": [
      "Data Engineer",
      "Data Integration Engineer"
    ],
    "role_archetype": "Data",
    "roles_and_responsibilities": [
      {
        "bullet_count": 12,
        "heading": "Roles \u0026 Responsibilities",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "- Expected to perform independently",
          "last_5_words": "to junior team members."
        },
        "text": "- Expected to perform independently and become an SME.\n- Required active participation/contribution in team discussions.\n- Contribute in providing solutions to work related problems.\n- Assist with the data platform blueprint and design.\n- Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.\n- Develop and maintain data platform components.\n- Implement data integration processes using Informatica Intelligent Cloud Services.\n- Optimize data workflows and pipelines for efficient data processing.\n- Troubleshoot and resolve data integration issues.\n- Ensure data quality and integrity throughout the data platform.\n- Stay updated with the latest trends and advancements in data engineering.\n- Provide technical guidance and support to junior team members.",
        "word_count": 114
      }
    ],
    "urls": []
  },
  "rejected": false,
  "rejection_reason": null,
  "run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c",
  "stage3_signals": {
    "alias_found": true,
    "alias_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 1.0,
        "slug": "data-engineer",
        "total_count": null
      }
    ],
    "kra_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": [
          {
            "kra_text": "Implements data quality validation rules, reconciliation checks, and anomaly detection to ensure data completeness, accuracy, and consistency.",
            "sentence": "Ensure data quality and integrity throughout the data platform.",
            "similarity": 0.6593
          },
          {
            "kra_text": "Optimizes pipeline throughput, partitioning strategies, and query performance across cloud data warehouses like Snowflake, BigQuery, or Redshift.",
            "sentence": "Optimize data workflows and pipelines for efficient data processing.",
            "similarity": 0.6523
          },
          {
            "kra_text": "Designs dimensional models, star schemas, data vault structures, and curated data mart tables to support BI tools and self-service analytics consumption.",
            "sentence": "Assist with the data platform blueprint and design.",
            "similarity": 0.6138
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 0.6418,
        "slug": "data-engineer",
        "total_count": null
      },
      {
        "display_name": "Svelte Frontend Developer",
        "kra_matches": [
          {
            "kra_text": "backend data integration",
            "sentence": "Troubleshoot and resolve data integration issues.",
            "similarity": 0.5869
          },
          {
            "kra_text": "backend data integration",
            "sentence": "Develop and maintain data platform components.",
            "similarity": 0.5181
          },
          {
            "kra_text": "backend data integration",
            "sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
            "similarity": 0.5145
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 92,
        "score": 0.5399,
        "slug": "svelte-frontend-developer",
        "total_count": null
      },
      {
        "display_name": "AI Engineer",
        "kra_matches": [
          {
            "kra_text": "Designs and implements prompt engineering workflows, few-shot examples, chain-of-thought patterns, and structured output parsing for AI feature pipelines.",
            "sentence": "Optimize data workflows and pipelines for efficient data processing.",
            "similarity": 0.5375
          },
          {
            "kra_text": "Integrates AI model API responses with application business logic, database writes, event publishing, and downstream service orchestration.",
            "sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
            "similarity": 0.4868
          },
          {
            "kra_text": "Integrates AI model API responses with application business logic, database writes, event publishing, and downstream service orchestration.",
            "sentence": "Implement data integration processes using Informatica Intelligent Cloud Services.",
            "similarity": 0.473
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 13,
        "score": 0.4991,
        "slug": "ai-engineer",
        "total_count": null
      },
      {
        "display_name": "Python Backend Developer",
        "kra_matches": [
          {
            "kra_text": "Maintain data access and persistence",
            "sentence": "Develop and maintain data platform components.",
            "similarity": 0.5191
          },
          {
            "kra_text": "Maintain data access and persistence",
            "sentence": "Ensure data quality and integrity throughout the data platform.",
            "similarity": 0.4966
          },
          {
            "kra_text": "Troubleshoot server-side defects",
            "sentence": "Troubleshoot and resolve data integration issues.",
            "similarity": 0.4757
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 80,
        "score": 0.4971,
        "slug": "python-backend-developer",
        "total_count": null
      },
      {
        "display_name": "MLOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
            "sentence": "Ensure data quality and integrity throughout the data platform.",
            "similarity": 0.5263
          },
          {
            "kra_text": "Coordinates model promotion workflows across development, staging, and production environments including integration testing and data contract validation.",
            "sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
            "similarity": 0.4824
          },
          {
            "kra_text": "Automates ML platform operations including scheduled retraining triggers, pipeline orchestration, evaluation workflows, and alerting configuration.",
            "sentence": "Optimize data workflows and pipelines for efficient data processing.",
            "similarity": 0.482
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 16,
        "score": 0.4969,
        "slug": "ml-ops-engineer",
        "total_count": null
      }
    ],
    "skill_match_roles": []
  },
  "stage4_decision": {
    "alias_collision_detected": false,
    "case": "A",
    "chosen_role": {
      "display_name": "Data Engineer",
      "kra_matches": null,
      "matched_count": null,
      "matched_skills": null,
      "role_id": 2,
      "score": 1.0,
      "slug": "data-engineer",
      "total_count": null
    },
    "confidence": 1.0,
    "is_new_role": false,
    "llm2_fired": false,
    "llm2_reasoning": null,
    "matched_dimensions": [],
    "matched_kras": [],
    "matched_skills": [],
    "new_role_display_name": null,
    "new_role_slug": null,
    "queued": false,
    "reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "sub_role": null
  },
  "stage5_updates": {
    "centroid_n_after": 251,
    "centroid_updated": true,
    "collision_log_id": null,
    "new_kra_attached": null,
    "new_skills_attached": [
      {
        "is_primary": true,
        "queue_id": 12410,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Informatica Intelligent Cloud Services",
        "status": "pending"
      }
    ],
    "queue_entry_id": null,
    "v3_pipeline_triggered": false,
    "v3_role_slug": null,
    "v3_run_id": null
  }
}

API 2 — extract-details

{
  "alias_matches": [],
  "candidate_roles": [],
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "dimensions": [],
  "input_final_skills": [
    "Informatica Intelligent Cloud Services"
  ],
  "input_llm_skills": [
    "Informatica Intelligent Cloud Services"
  ],
  "new_aliases_persisted": 0,
  "run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c",
  "skills_detail": [
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Informatica Intelligent Cloud Services",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Cloud Platforms",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "informatica-intelligent-cloud-services",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": [
    "Informatica Intelligent Cloud Services"
  ]
}

API 3 — final-role-output

{
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "Informatica Intelligent Cloud Services",
      "tag": "new"
    }
  ],
  "llm_cost_api1_usd": null,
  "llm_cost_api2_usd": null,
  "llm_cost_api3_usd": null,
  "llm_cost_total_usd": null,
  "persistence": {
    "items": [],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…