A builder who happens to do data at scale.
Multilingual senior data engineer with 19+ years building high-impact, scalable data infrastructure across tech, financial, and insurance industries. I design cloud-native data lakes, real-time and batch pipelines, and AI-powered metadata platforms on the AWS ecosystem.
At Amazon I led the architecture of a 70+ PB centralized data lake built with AWS Glue (Spark), Lake Formation and Athena, supporting 6,000+ internal users. Certified for Red Security (highly critical data), it replaced a legacy ETL system and introduced cost-effective scalability, observability, and cross-team data transparency.
My work spans advanced data modeling and warehousing on Redshift and Lake Formation through to real-time pipelines with Kinesis, Glue Streaming and Lambda, integrating RDS, DynamoDB and OpenSearch with Airflow for orchestration. I led a Data Mesh rollout on AWS DataZone, enhanced with LLMs and machine learning to curate datasets, standardize metrics, and enable self-service discovery across Amazon Operations.
On MLOps, I have delivered infrastructure for ML applications on SageMaker, Bedrock and Kendra, supporting scalable training, secure deployment, and intelligent search. I am also a dedicated mentor and hiring contributor, supporting the talent pipeline through engineering mentorship and technical interviews. Off the clock, I tinker in a homelab and answer to Freddie.
Led the architecture and delivery of a 70+ PB centralized data lake on AWS Glue, Redshift, Lake Formation and Athena (Presto), supporting over 6,000 internal users. Replaced legacy Redshift ETL pipelines, earned Red Security certification for highly critical data, and cut storage and compute costs by 40%. Spearheaded a Data Mesh rollout on AWS DataZone for a governed, self-service business catalog, and mentored engineers across teams on architecture, code quality and operational excellence.
Built technologies to streamline data consumption across Property and Casualty insurance and risk, using AWS CDK, Glue, Lambda, ECS Fargate, ECR, Transfer Family, Lake Formation, Redshift and Athena. Set up and managed data lakes, optimized pipelines, and delivered data ingestion applications in Python, JavaScript and TypeScript.
Improved ingestion and table performance for all Salesforce datasets by automating object ingestion into Redshift with Python BOTO3, S3, Data Pipeline and EC2. Designed the Career Choice program data warehouse on Redshift and Glue with self-service tools like QuickSight, and raised code quality with standards and automated deploys on CodeCommit and CodePipeline.
Led enterprise transformation in Property and Casualty insurance across 7+ countries in Latin America, Europe and the US, focused on risk, claims and underwriting operations. Designed a future-state data warehouse for the AIG Shared Services Center, unifying many sources and automating 1,400+ reports. Built productivity tools that cut policy issuance cycle times, and trained regional leaders on Lean Six Sigma.
Designed and communicated analyses and recommendations to identify, validate and refine profitable strategic actions for executive and Board-level audiences. Led analysts through complex business analysis using SAS, Oracle, SQL Server and advanced Excel (VBA), and delivered two global KPI dashboards spanning HR, Finance, Operations, Customers and Claims for the LAC and US regions.
Integrated systems and technologies by coordinating internal and external partners worldwide, streamlining decision-making. Designed and supported complex ETL and ODS processes from many database sources using SAS, Oracle and Microsoft tools, and validated documentation through the Change Approval Board to reduce implementation errors.
Built the first auto insurance data warehouse in a client environment following the SDLC methodology, coordinating with business users, designing the data model and programming ETL and ODS with SAS BI, SAS DI and SAS Guide to extract from the eBao policy management system.
Implemented a collections employee scorecard by consolidating company performance data into a single database, and cut a BAU process runtime by 90% by re-writing SAS code and removing bottlenecks. Built the first collections web dashboard from scratch with MySQL, PHP, HTML5, jQuery and CSS for managers to track results.
Worked on Brazil's Credit, Fraud and Collections enterprise data warehouse, defining and validating KPIs with internal and external vendors. Automated 60% of a back-office process while leading a small team, and cut a collections dashboard refresh from 60 to 30 minutes by re-designing the process and database.
Cut a collectors scorecard process from 8 hours to 20 minutes by automating database extractions and the Excel dashboard with SAS Base and VBA. Built collections data warehouse tables and views in PL/SQL, and automated the boleto creation process to the CNAB bank standard, reducing customer complaints.