Skip to content
View camposvinicius's full-sized avatar
💥
💥

Block or report camposvinicius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
camposvinicius/README.md

Vinícius Campos

AI & Data Platform Engineer

I build production platforms for LLM systems, data engineering, and cloud infrastructure.

LinkedIn Email


Focus

I work at the intersection of large-scale data platforms and applied AI systems: data lakes, lakehouses, orchestration, cloud infrastructure, LLM gateways, agents, RAG, model serving, observability, and cost-aware production operations.

  • AI systems: LLM gateways, agents, tool calling, RAG, model serving, grounded generation, cost metering, request telemetry.
  • Data engineering: Spark, Snowflake, BigQuery, Databricks Iceberg, Delta Lake, CDC ingestion, orchestration, schema-as-code, lineage, governance.
  • Platform engineering: AWS, Azure, GCP, Terraform / CDK / CloudFormation, containers, serverless, CI/CD, observability, reliability.

AI & LLM

Claude OpenAI Gemini Amazon Bedrock LangChain LangGraph Hugging Face vLLM llama.cpp FAISS OpenSearch PyTorch Ray MLflow Weights and Biases FastAPI Streamlit Stripe

LLM gateways · agents & tool calling · RAG · model serving · prompt engineering · grounded generation · LLM observability · token-level cost metering


Data Platforms

Apache Spark PySpark Snowflake Apache Iceberg Delta Lake Databricks Apache Airflow DataHub Apache Kafka Liquibase AWS Glue Athena EMR Serverless Redshift DynamoDB BigQuery Dataproc Synapse Azure Data Factory HDInsight

Lakehouse architecture · medallion pipelines · CDC ingestion · schema-as-code · catalog & lineage · PII masking · multi-cloud migrations


Cloud & Platform

AWS Azure GCP Terraform AWS CDK CloudFormation Docker Kubernetes Amazon ECS Amazon EKS AWS Lambda Step Functions SQS and SNS API Gateway CloudFront AWS WAF AWS DMS LocalStack

Containers · serverless · event-driven architectures · OIDC CI/CD · infrastructure as code · production operations


DevOps, Databases & Observability

GitHub Actions GitLab CI Bitbucket Pipelines ArgoCD Jenkins UrbanCode SonarCloud Snyk PostgreSQL MySQL SQL Server MongoDB Redis Amazon Aurora Firebase Prometheus Grafana CloudWatch AWS X-Ray

CI/CD quality gates · alarms & dashboards · DLQ/quarantine flows · request tracing · cost attribution · production runbooks


Programming

Python SQL TypeScript JavaScript HCL Bash Jupyter Apache Zeppelin


GitHub Stats

GitHub stats Top languages

Featured Projects

Project Focus Stack
rag-evals CI-friendly RAG evaluation: retrieval metrics, LLM-as-judge faithfulness, and a regression gate that fails the build when quality drops. Python · BM25 · AWS Bedrock · GitHub Actions
aws-snowflake-etl AWS to Snowflake data pipeline for analytics workloads. Python · AWS · Snowflake · ETL
azure-etl Azure ETL pipeline ingesting external API data for analytical processing. Azure · Python · ETL · APIs
aws-etl AWS ETL pipeline using open datasets and cloud storage patterns. AWS · Python · S3 · ETL
gcp-etl GCP ETL pipeline built around public data ingestion and processing. GCP · Python · ETL · BigQuery
vini-dataengineer Data engineering study projects across core pipeline patterns. Jupyter · Spark · SQL · Data Engineering
vini-project-covid-data-BR Brazilian COVID data pipeline and analytics project. Spark · Kafka · Jupyter · Analytics

Pinned Loading

  1. gcp-etl gcp-etl Public

    This is a pipeline of an ETL application in GCP with open airport code data, which you can find here: https://datahub.io/core/airport-codes/r/airport-codes_zip.zip, it's about a zipped .json, which…

    Smarty 15 3

  2. vini-project-covid-data-BR vini-project-covid-data-BR Public

    In this project I performed the ETL process of Brazilian Covid Data made available by the government, using Spark as the main technology for sending data to Kafka and ElasticSearch!

    Jupyter Notebook 1

  3. vini-dataengineer vini-dataengineer Public

    Some data engineering projects using key technologies.

    Jupyter Notebook 3

  4. aws-etl aws-etl Public

    This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some…

    Smarty 18 3