← Back to history

Pipeline run

9d19bc35-b551-41a5-bc0f-bbb5aec8a408

Client output enrichment

v2 Skill cluster · Nature of work · AI index · Tech stack maturity · Evidence · KRA description

Nature of work

—

no_db_connection

Tech stack maturity

Mainstream Modern

AI index (0 = no AI use, 5 = totally AI-dependent · v2.1)

0.50 / 5

· Title match

✓ Has AI skill

· AI skill (primary)

✓ AI skill (secondary)

· On AI team

· Builds AI products

vocab breakdown (legacy)

Assistants (×1): —

Frameworks (×2): pgvector

Models / concepts (×3): AI

Evidence — skills matched in JD (15)

PostgreSQL SQL Elasticsearch Redis Python Alembic golang-migrate TimescaleDB ClickHouse Kafka Airflow dbt Metabase Superset pgvector

Skill cluster (0 dimension groups, role-scoped)

No dimension groups computed for this JD.

Status: extract_from_jd_done Created: 2026-05-10T13:31:35.451432Z Updated: 2026-05-10T13:31:35.451432Z

Flow Current 3-step pipeline

1 POST /skills/extract-from-jd

2 POST /skills/extract-details

3 POST /skills/final-role-output

Role Chosen role & resolution

No chosen role stored for this run.

Job description

Job Summary

You will design and optimize the database architecture supporting marketplace modules. You'll manage PostgreSQL (including TimescaleDB for time-series data), Elasticsearch for search, Redis for caching, and build ETL pipelines for analytics and reporting dashboards.

Key Responsibilities / Duties

Database Design & Management

Design PostgreSQL schemas for 40+ tables across 6 domains

Implement partitioning strategies for large tables (transactions, listings)

Configure TimescaleDB hypertables for time-series data

Write and optimize complex SQL queries

Manage database migrations (Alembic/golang-migrate)

Search & Analytics

Design Elasticsearch indices for multi-module search

Implement faceted search with aggregations

Build real-time analytics pipelines using Kafka + ClickHouse

Create reporting tables for dashboards (Entity Admin, Finance Manager)

Performance

Implement query optimization and indexing strategies

Configure read replicas for analytics workloads

Set up connection pooling (PgBouncer)

Monitor database performance (slow query logs, pg_stat_statements)

Data Governance

Implement data retention policies

Manage backup and recovery procedures

Ensure GDPR/CCPA compliance for data deletion

Implement data anonymization for M7 (Enterprise Data)

Required Skills & Qualifications

Must-Have (4+ years overall)

SkillProficiencyNotes

PostgreSQL4+ yearsPerformance tuning, partitioning, replication

SQL4+ yearsComplex joins, window functions, CTEs

Elasticsearch2+ yearsIndex design, aggregations, query DSL

Redis2+ yearsCaching, rate limiting

Python2+ yearsETL scripts

Database Migrations2+ yearsAlembic or golang-migrate

Preferred / Good-to-Have Skills

SkillWhy It Matters

TimescaleDBTime-series (price history, utilization metrics)

ClickHouseReal-time analytics

KafkaEvent streaming

Airflow / dbtETL orchestration

Metabase / SupersetDashboarding

pgvectorVector search for AI features

Key Tables / Data Models

Transactions: tables – transactions, escrow_holds | Row growth estimate – 10M/year

Listings: tables – listings, listing_versions | Row growth estimate – 5M/year

Price History: tables – price_history (TimescaleDB) | Row growth estimate – 100M/year

AI Conversations: tables – ai_conversations | Row growth estimate – 50M/year

Skills from this JD

Each row merges API 1 extraction, API 2 library match / v3 orchestration (dimensions + locked dims), and API 3 persistence tags.

PostgreSQL Primary No API 2 row (run stopped after API 1 or history missing)

SQL Primary No API 2 row (run stopped after API 1 or history missing)

Elasticsearch Primary No API 2 row (run stopped after API 1 or history missing)

Redis Primary No API 2 row (run stopped after API 1 or history missing)

Python Primary No API 2 row (run stopped after API 1 or history missing)

Alembic Primary No API 2 row (run stopped after API 1 or history missing)

golang-migrate Primary No API 2 row (run stopped after API 1 or history missing)

TimescaleDB Secondary No API 2 row (run stopped after API 1 or history missing)

ClickHouse Secondary No API 2 row (run stopped after API 1 or history missing)

Kafka Secondary No API 2 row (run stopped after API 1 or history missing)

Airflow Secondary No API 2 row (run stopped after API 1 or history missing)

dbt Secondary No API 2 row (run stopped after API 1 or history missing)

Metabase Secondary No API 2 row (run stopped after API 1 or history missing)

Superset Secondary No API 2 row (run stopped after API 1 or history missing)

pgvector Secondary No API 2 row (run stopped after API 1 or history missing)

Library artifacts (this run)

No artifact rows for this run.

API 1 — extract-from-jd click to toggle

{
  "final_skills": [
    {
      "is_primary": true,
      "skill_name": "PostgreSQL"
    },
    {
      "is_primary": true,
      "skill_name": "SQL"
    },
    {
      "is_primary": true,
      "skill_name": "Elasticsearch"
    },
    {
      "is_primary": true,
      "skill_name": "Redis"
    },
    {
      "is_primary": true,
      "skill_name": "Python"
    },
    {
      "is_primary": true,
      "skill_name": "Alembic"
    },
    {
      "is_primary": true,
      "skill_name": "golang-migrate"
    },
    {
      "is_primary": false,
      "skill_name": "TimescaleDB"
    },
    {
      "is_primary": false,
      "skill_name": "ClickHouse"
    },
    {
      "is_primary": false,
      "skill_name": "Kafka"
    },
    {
      "is_primary": false,
      "skill_name": "Airflow"
    },
    {
      "is_primary": false,
      "skill_name": "dbt"
    },
    {
      "is_primary": false,
      "skill_name": "Metabase"
    },
    {
      "is_primary": false,
      "skill_name": "Superset"
    },
    {
      "is_primary": false,
      "skill_name": "pgvector"
    }
  ],
  "run_id": null
}

API 2 — extract-details

{}

API 3 — final-role-output

{}

LLM Calls

Every model call made for this run, in pipeline order. Click a card to see the model's response.

Loading…