I build production platforms for LLM systems, data engineering, and cloud infrastructure.
I work at the intersection of large-scale data platforms and applied AI systems: data lakes, lakehouses, orchestration, cloud infrastructure, LLM gateways, agents, RAG, model serving, observability, and cost-aware production operations.
- AI systems: LLM gateways, agents, tool calling, RAG, model serving, grounded generation, cost metering, request telemetry.
- Data engineering: Spark, Snowflake, BigQuery, Databricks Iceberg, Delta Lake, CDC ingestion, orchestration, schema-as-code, lineage, governance.
- Platform engineering: AWS, Azure, GCP, Terraform / CDK / CloudFormation, containers, serverless, CI/CD, observability, reliability.
LLM gateways · agents & tool calling · RAG · model serving · prompt engineering · grounded generation · LLM observability · token-level cost metering
Lakehouse architecture · medallion pipelines · CDC ingestion · schema-as-code · catalog & lineage · PII masking · multi-cloud migrations
Containers · serverless · event-driven architectures · OIDC CI/CD · infrastructure as code · production operations
CI/CD quality gates · alarms & dashboards · DLQ/quarantine flows · request tracing · cost attribution · production runbooks
| Project | Focus | Stack |
|---|---|---|
| rag-evals | CI-friendly RAG evaluation: retrieval metrics, LLM-as-judge faithfulness, and a regression gate that fails the build when quality drops. | Python · BM25 · AWS Bedrock · GitHub Actions |
| aws-snowflake-etl | AWS to Snowflake data pipeline for analytics workloads. | Python · AWS · Snowflake · ETL |
| azure-etl | Azure ETL pipeline ingesting external API data for analytical processing. | Azure · Python · ETL · APIs |
| aws-etl | AWS ETL pipeline using open datasets and cloud storage patterns. | AWS · Python · S3 · ETL |
| gcp-etl | GCP ETL pipeline built around public data ingestion and processing. | GCP · Python · ETL · BigQuery |
| vini-dataengineer | Data engineering study projects across core pipeline patterns. | Jupyter · Spark · SQL · Data Engineering |
| vini-project-covid-data-BR | Brazilian COVID data pipeline and analytics project. | Spark · Kafka · Jupyter · Analytics |



