SOURAV
ROY
◆ DATA-AI ARCHITECT ◆
PRESS ANY KEY
OR TAP / CLICK TO START
↑ ↓ navigate
? help menu
S jump to sloth
collect coins
COINS0
SCORE0
ACH.0/8
press ?
HOW TO PLAY
◆ CONTROLS & ACHIEVEMENTS ◆

CONTROLS

jump to prev / next section
S
jump to slothDB
?
toggle this help menu
ESC
close overlays
click coins to collect

ACHIEVEMENTS

◆ ALL ACHIEVEMENTS UNLOCKED ◆
YOU WIN
You've explored the whole portfolio. Thanks for playing — now let's actually connect and build something.
COINS0
SCORE0
ACH.8/8
$
PLAYER 1 READY

SOURAV
ROY

data-ai architect shipping pipelines

Qlik Certified Solution Architect building the full data-AI stack — enterprise ETL/ELT on Talend, Python, Snowflake, BigQuery, AWS & GCP, and LLM-powered pipelines.

CHARACTER FILE
Sourav Roy avatar
TAP ME
ETL/ELT
MAX
SNOWFLAKE
92
PYTHON
90
AWS + GCP
85
LLMs
72
$

who / what

$
sourav@data-ai ~ /about
sourav@data-ai $ whoami
Data engineer building the pipes that fintech quietly runs on. By day, I ship production ETL/ELT systems where dependability matters most - pipelines you notice only when they break, and mine rarely do. By night, I fall down Python rabbit holes and wonder what LLMs make possible next.
sourav@data-ai $ cat philosophy.md
I care about systems that just work. Warehouses that answer questions instead of raising them. Audit trails that hold up under scrutiny. Code that saves someone's afternoon.
sourav@data-ai $ ls -la certifications/
# Qlik Certified Solution Architect
sourav@data-ai $
$

the stack

◈ CORE
Talend Snowflake Python SQL PL/SQL
☁ CLOUD
AWS GCP Azure
⊕ DATABASES
PostgreSQL Cassandra MongoDB Oracle DuckDB
✦ AI / LLM
LLM Pipelines Generative AI Vector DBs
⚙ TOOLING
Java Maven Docker Git Linux Shell
◎ EXPLORING
Video automation Blender scripting DuckDB internals
$

what I'm building

EXPLORING

LLM-Powered Pipelines

Injecting language models into data pipelines — classification, extraction, enrichment. Early-stage experimentation on where LLMs genuinely earn their compute in ETL.

PRODUCTION

Enterprise ETL Systems

Talend-based ingestion and transformation pipelines moving transaction data at fintech scale — with audit trails, rollback paths, and reconciliation baked in.

TINKERING

Data Video Automation

Python + FFmpeg + generative AI tooling for automated video post-production. Because side projects deserve their own side projects.

$

github stats

GitHub stats
Top languages
GitHub streak
Activity graph
$

let's talk