← Back to history

Pipeline run

27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c

Pipeline LLM cost (USD)
API 1: $0.0027 API 2: $0.0001 API 3: $0.0000 Total: $0.0028

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description
SPARSE JD role baseline loaded sources · ai_index: role_baseline · nature_of_work: jd · tech_stack_maturity: role_baseline
Nature of work · Data pipeline development
Build and maintain data platform integrations and pipelines in Informatica IICS, while collaborating on platform design, troubleshooting issues, and improving data quality and workflow efficiency. Also provide guidance to junior team members and help shape cohesive system/data-model integration.
"Implement data integration processes using Informatica Intelligent Cloud Services."
Tech stack maturity
Modern Cloud Native
Data engineers typically build cloud-based batch and streaming pipelines and warehouse models, but AI is usually incidental rather than central to the role.
AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)
1.20 / 5
· Title match
· Has AI skill
· AI skill (primary)
· AI skill (secondary)
· On AI team
· Builds AI products
vocab breakdown (legacy)
Assistants (×1):
Frameworks (×2):
Models / concepts (×3):
Evidence — skills matched in JD (1)
Informatica Intelligent Cloud Services
Skill cluster (1 dimension groups, role-scoped)
Cross-cutting / unaligned
Informatica Intelligent Cloud Services
Show KRA description ↓
- Expected to perform independently and become an SME. - Required active participation/contribution in team discussions. - Contribute in providing solutions to work related problems. - Assist with the data platform blueprint and design. - Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. - Develop and maintain data platform components. - Implement data integration processes using Informatica Intelligent Cloud Services. - Optimize data workflows and pipelines for efficient data processing. - Troubleshoot and resolve data integration issues. - Ensure data quality and integrity throughout the data platform. - Stay updated with the latest trends and advancements in data engineering. - Provide technical guidance and support to junior team members.

Signals

Skill
Alias data-engineer
1.00
KRA data-engineer
0.64

Post-classification

Centroidupdated · n=251
Alias collision log
New-role queue
New skills captured1
New KRA captured

Captured for admin review

Informatica Intelligent Cloud Services primary Data Engineer pending
Status: completed Created: 2026-05-27T15:03:48.921289Z Updated: 2026-06-12T16:55:07.281298Z API 3 duration: 2281 ms
Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

Data Engineer

CASE A

slug: data-engineer · id: 2 · source: db

Exact alias hit on data-engineer (1.0) — no other alias at this confidence; skill_top absent does not contradict

Resolution: in_db — role exists in library; skill↔dim and role↔dim links saved when applicable.

0
New skills
0
Skill↔dim saved
0
Role↔dim saved
0
Skipped

Job description

Project Role : Data Platform Engineer

Project Role Description : Assists with the data platform blueprint and design, encompassing the relevant data platform components. Collaborates with the Integration Architects and Data Architects to ensure cohesive integration between systems and data models.

Must have skills : Informatica Intelligent Cloud Services

Good to have skills : Google BigQuery

Minimum 3 Year(s) Of Experience Is Required

Educational Qualification : 15 years full time education

Summary: As a Data Platform Engineer, you will assist with the data platform blueprint and design, collaborating with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. You will play a crucial role in the development and maintenance of the data platform components. Join our team in Mumbai and contribute to the success of our data engineering projects. Roles & Responsibilities: - Expected to perform independently and become an SME. - Required active participation/contribution in team discussions. - Contribute in providing solutions to work related problems. - Assist with the data platform blueprint and design. - Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models. - Develop and maintain data platform components. - Implement data integration processes using Informatica Intelligent Cloud Services. - Optimize data workflows and pipelines for efficient data processing. - Troubleshoot and resolve data integration issues. - Ensure data quality and integrity throughout the data platform. - Stay updated with the latest trends and advancements in data engineering. - Provide technical guidance and support to junior team members. Professional & Technical Skills: - Must To Have Skills: Proficiency in Informatica Intelligent Cloud Services. - Good To Have Skills: Experience with Google BigQuery. - Strong understanding of data integration concepts and techniques. - Experience in designing and implementing data workflows and pipelines. - Familiarity with cloud-based data platforms and services. - Knowledge of SQL and database management systems. - Experience with data quality and data governance practices. - Ability to troubleshoot and resolve data integration issues. Additional Information: - The candidate should have a minimum of 3 years of experience in Informatica Intelligent Cloud Services. - This position is based at our Mumbai office. - A 15 years full time education is required.

15 years full time education

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

Informatica Intelligent Cloud Services Primary New / orchestrated API 3: new canonical path (new) New / unmatched skill (orchestrated in API 2)

Skill enrichment (orchestrator / LLM)

No Stage 7 enrichment blob on this skill (orchestrator skipped enrichment).

Derived legacy fields
Category
Cloud Platforms
Sub-category
general
Skill nature
PLATFORM
Volatility
MEDIUM
Typical lifespan
MULTI_YEAR
Version strategy
UNVERSIONED

Library artifacts (this run)

Kind Detail DB id
canonical_skill_proposed Informatica Intelligent Cloud Services | type=Cloud Platforms subtype=general nature=PLATFORM lifespan=MULTI_YEAR
nano JD Parser — gpt-4.1-nano click to toggle
RoleData Platform Engineer
ExperienceMinimum 3 Year(s) Of Experience Is Required
DomainOther
Location Mumbai, India (onsite)
JD type pass
Show raw JSON
{
  "JD_type": "pass",
  "about_company": null,
  "certifications": [],
  "company_name": null,
  "ctc": null,
  "domain": {
    "primary": {
      "aliases": [],
      "domain": "Other"
    },
    "secondary": null
  },
  "education": [
    {
      "level": "null",
      "qualification": "null - null",
      "raw": "15 years full time education",
      "requirement": "required"
    }
  ],
  "experience": {
    "max": null,
    "min": 3,
    "raw": "Minimum 3 Year(s) Of Experience Is Required"
  },
  "job_locations": [
    {
      "aliases": [
        "Bombay"
      ],
      "city": "Mumbai",
      "country": "India",
      "state": null,
      "work_mode": "onsite"
    }
  ],
  "role": "Data Platform Engineer",
  "role_aliases": [
    "Data Engineer",
    "Data Integration Engineer"
  ],
  "role_archetype": "Data",
  "roles_and_responsibilities": [
    {
      "bullet_count": 12,
      "heading": "Roles \u0026 Responsibilities",
      "heading_was_present": true,
      "source_marker": {
        "first_5_words": "- Expected to perform independently",
        "last_5_words": "to junior team members."
      },
      "text": "- Expected to perform independently and become an SME.\n- Required active participation/contribution in team discussions.\n- Contribute in providing solutions to work related problems.\n- Assist with the data platform blueprint and design.\n- Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.\n- Develop and maintain data platform components.\n- Implement data integration processes using Informatica Intelligent Cloud Services.\n- Optimize data workflows and pipelines for efficient data processing.\n- Troubleshoot and resolve data integration issues.\n- Ensure data quality and integrity throughout the data platform.\n- Stay updated with the latest trends and advancements in data engineering.\n- Provide technical guidance and support to junior team members.",
      "word_count": 114
    }
  ],
  "urls": []
}
API 1 — extract-from-jd click to toggle
{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "Informatica Intelligent Cloud Services"
    }
  ],
  "jd_role": {
    "display_name": "Data Platform Engineer",
    "rationale": null,
    "role_aliases": [
      "Data Engineer",
      "Data Integration Engineer"
    ],
    "role_archetype": "Data",
    "slug": ""
  },
  "nano_parsed": {
    "JD_type": "pass",
    "about_company": null,
    "certifications": [],
    "company_name": null,
    "ctc": null,
    "domain": {
      "primary": {
        "aliases": [],
        "domain": "Other"
      },
      "secondary": null
    },
    "education": [
      {
        "level": "null",
        "qualification": "null - null",
        "raw": "15 years full time education",
        "requirement": "required"
      }
    ],
    "experience": {
      "max": null,
      "min": 3,
      "raw": "Minimum 3 Year(s) Of Experience Is Required"
    },
    "job_locations": [
      {
        "aliases": [
          "Bombay"
        ],
        "city": "Mumbai",
        "country": "India",
        "state": null,
        "work_mode": "onsite"
      }
    ],
    "role": "Data Platform Engineer",
    "role_aliases": [
      "Data Engineer",
      "Data Integration Engineer"
    ],
    "role_archetype": "Data",
    "roles_and_responsibilities": [
      {
        "bullet_count": 12,
        "heading": "Roles \u0026 Responsibilities",
        "heading_was_present": true,
        "source_marker": {
          "first_5_words": "- Expected to perform independently",
          "last_5_words": "to junior team members."
        },
        "text": "- Expected to perform independently and become an SME.\n- Required active participation/contribution in team discussions.\n- Contribute in providing solutions to work related problems.\n- Assist with the data platform blueprint and design.\n- Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.\n- Develop and maintain data platform components.\n- Implement data integration processes using Informatica Intelligent Cloud Services.\n- Optimize data workflows and pipelines for efficient data processing.\n- Troubleshoot and resolve data integration issues.\n- Ensure data quality and integrity throughout the data platform.\n- Stay updated with the latest trends and advancements in data engineering.\n- Provide technical guidance and support to junior team members.",
        "word_count": 114
      }
    ],
    "urls": []
  },
  "rejected": false,
  "rejection_reason": null,
  "run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c",
  "stage3_signals": {
    "alias_found": true,
    "alias_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": null,
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 1.0,
        "slug": "data-engineer",
        "total_count": null
      }
    ],
    "kra_match_roles": [
      {
        "display_name": "Data Engineer",
        "kra_matches": [
          {
            "kra_text": "Implements data quality validation rules, reconciliation checks, and anomaly detection to ensure data completeness, accuracy, and consistency.",
            "sentence": "Ensure data quality and integrity throughout the data platform.",
            "similarity": 0.6593
          },
          {
            "kra_text": "Optimizes pipeline throughput, partitioning strategies, and query performance across cloud data warehouses like Snowflake, BigQuery, or Redshift.",
            "sentence": "Optimize data workflows and pipelines for efficient data processing.",
            "similarity": 0.6523
          },
          {
            "kra_text": "Designs dimensional models, star schemas, data vault structures, and curated data mart tables to support BI tools and self-service analytics consumption.",
            "sentence": "Assist with the data platform blueprint and design.",
            "similarity": 0.6138
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 2,
        "score": 0.6418,
        "slug": "data-engineer",
        "total_count": null
      },
      {
        "display_name": "Svelte Frontend Developer",
        "kra_matches": [
          {
            "kra_text": "backend data integration",
            "sentence": "Troubleshoot and resolve data integration issues.",
            "similarity": 0.5869
          },
          {
            "kra_text": "backend data integration",
            "sentence": "Develop and maintain data platform components.",
            "similarity": 0.5181
          },
          {
            "kra_text": "backend data integration",
            "sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
            "similarity": 0.5145
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 92,
        "score": 0.5399,
        "slug": "svelte-frontend-developer",
        "total_count": null
      },
      {
        "display_name": "AI Engineer",
        "kra_matches": [
          {
            "kra_text": "Designs and implements prompt engineering workflows, few-shot examples, chain-of-thought patterns, and structured output parsing for AI feature pipelines.",
            "sentence": "Optimize data workflows and pipelines for efficient data processing.",
            "similarity": 0.5375
          },
          {
            "kra_text": "Integrates AI model API responses with application business logic, database writes, event publishing, and downstream service orchestration.",
            "sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
            "similarity": 0.4868
          },
          {
            "kra_text": "Integrates AI model API responses with application business logic, database writes, event publishing, and downstream service orchestration.",
            "sentence": "Implement data integration processes using Informatica Intelligent Cloud Services.",
            "similarity": 0.473
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 13,
        "score": 0.4991,
        "slug": "ai-engineer",
        "total_count": null
      },
      {
        "display_name": "Python Backend Developer",
        "kra_matches": [
          {
            "kra_text": "Maintain data access and persistence",
            "sentence": "Develop and maintain data platform components.",
            "similarity": 0.5191
          },
          {
            "kra_text": "Maintain data access and persistence",
            "sentence": "Ensure data quality and integrity throughout the data platform.",
            "similarity": 0.4966
          },
          {
            "kra_text": "Troubleshoot server-side defects",
            "sentence": "Troubleshoot and resolve data integration issues.",
            "similarity": 0.4757
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 80,
        "score": 0.4971,
        "slug": "python-backend-developer",
        "total_count": null
      },
      {
        "display_name": "MLOps Engineer",
        "kra_matches": [
          {
            "kra_text": "Validates model performance benchmarks, data schema contracts, and system integration health before signing off on production release readiness.",
            "sentence": "Ensure data quality and integrity throughout the data platform.",
            "similarity": 0.5263
          },
          {
            "kra_text": "Coordinates model promotion workflows across development, staging, and production environments including integration testing and data contract validation.",
            "sentence": "Collaborate with Integration Architects and Data Architects to ensure cohesive integration between systems and data models.",
            "similarity": 0.4824
          },
          {
            "kra_text": "Automates ML platform operations including scheduled retraining triggers, pipeline orchestration, evaluation workflows, and alerting configuration.",
            "sentence": "Optimize data workflows and pipelines for efficient data processing.",
            "similarity": 0.482
          }
        ],
        "matched_count": null,
        "matched_skills": null,
        "role_id": 16,
        "score": 0.4969,
        "slug": "ml-ops-engineer",
        "total_count": null
      }
    ],
    "skill_match_roles": []
  },
  "stage4_decision": {
    "alias_collision_detected": false,
    "case": "A",
    "chosen_role": {
      "display_name": "Data Engineer",
      "kra_matches": null,
      "matched_count": null,
      "matched_skills": null,
      "role_id": 2,
      "score": 1.0,
      "slug": "data-engineer",
      "total_count": null
    },
    "confidence": 1.0,
    "is_new_role": false,
    "llm2_fired": false,
    "llm2_reasoning": null,
    "matched_dimensions": [],
    "matched_kras": [],
    "matched_skills": [],
    "new_role_display_name": null,
    "new_role_slug": null,
    "queued": false,
    "reasoning": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "sub_role": null
  },
  "stage5_updates": {
    "centroid_n_after": 251,
    "centroid_updated": true,
    "collision_log_id": null,
    "new_kra_attached": null,
    "new_skills_attached": [
      {
        "is_primary": true,
        "queue_id": 12410,
        "role_display_name": "Data Engineer",
        "role_slug": "data-engineer",
        "skill_name": "Informatica Intelligent Cloud Services",
        "status": "pending"
      }
    ],
    "queue_entry_id": null,
    "v3_pipeline_triggered": false,
    "v3_role_slug": null,
    "v3_run_id": null
  }
}
API 2 — extract-details
{
  "alias_matches": [],
  "candidate_roles": [],
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "dimensions": [],
  "input_final_skills": [
    "Informatica Intelligent Cloud Services"
  ],
  "input_llm_skills": [
    "Informatica Intelligent Cloud Services"
  ],
  "new_aliases_persisted": 0,
  "run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c",
  "skills_detail": [
    {
      "aliases_in_db": [],
      "canonical": null,
      "dimensions": [],
      "input_skill": "Informatica Intelligent Cloud Services",
      "matched_via": null,
      "new_alias_persisted": false,
      "new_alias_text": null,
      "new_skill_meta": {
        "derived": {
          "category": "Cloud Platforms",
          "skill_nature": "PLATFORM",
          "sub_category": "general",
          "typical_lifespan": "MULTI_YEAR",
          "version_strategy": "UNVERSIONED",
          "volatility": "MEDIUM"
        },
        "enrichment": null,
        "keep_log": [],
        "locked_dimensions": [],
        "merge_log": [],
        "placed": null,
        "relationships": null,
        "skill_id": "informatica-intelligent-cloud-services",
        "split_log": [],
        "typed": null,
        "warnings": []
      },
      "source_tag": "llm",
      "was_in_llm_skills": true
    }
  ],
  "unmatched_skills": [
    "Informatica Intelligent Cloud Services"
  ]
}
API 3 — final-role-output
{
  "chosen_role": {
    "display_name": "Data Engineer",
    "id": 2,
    "rationale": "Exact alias hit on data-engineer (1.0) \u2014 no other alias at this confidence; skill_top absent does not contradict",
    "role_archetype": null,
    "slug": "data-engineer",
    "source": "db"
  },
  "chosen_role_resolution": "in_db",
  "final_input_skills": [
    {
      "skill": "Informatica Intelligent Cloud Services",
      "tag": "new"
    }
  ],
  "llm_cost_api1_usd": null,
  "llm_cost_api2_usd": null,
  "llm_cost_api3_usd": null,
  "llm_cost_total_usd": null,
  "persistence": {
    "items": [],
    "new_skills_created": 0,
    "role_dimension_saved": 0,
    "skill_dimension_saved": 0,
    "skipped": 0
  },
  "planner_output": null,
  "run_id": "27ea4ff7-d84b-4cc3-bcf1-ed1a2c19dd1c"
}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…