Global Stability & Risk Forecasting (GDELT)
Analyzed subsets of a 2PB global events dataset to identify risk trends and support stability forecasting, including anomaly detection and a validated Random Forest model.
I enjoy making things. Here are a selection of projects that I have worked on over the years.
Analyzed subsets of a 2PB global events dataset to identify risk trends and support stability forecasting, including anomaly detection and a validated Random Forest model.
Built end-to-end ETL pipelines with PySpark/Spark SQL to ingest 23GB of logs into a Data Lake using Medallion Architecture, then delivered KPI dashboards in Tableau/Power BI.
Built a Top-N recommender using implicit-feedback ALS on the Steam-200k dataset, learning player preference profiles from gameplay behavior and recommending similar games.
Led a team of 5 to perform a security audit and penetration testing activities for a live SaaS client, producing technical and executive reports and mitigating high-priority risks.
Built a fitness coach chatbot powered by Gemini API using prompt engineering and LangChain.