Headshot


Ryan Gilbert


Computer Science and Statistics Student at Boston University. Interested in Machine Learning, AI, Data Science, and Quantitative Finance.

LinkedIn GitHub

Education

Boston University

Expected Graduation: May 2026

Major: Computer Science

Minor: Statistics

GPA: 3.88


Relevant Coursework

CS 440: Intro to AI

Introduction to computer systems that exhibit intelligent behavior, in particular, perceptual and robotic systems. Topics include human-computer interfaces, computer vision, robotics, gameplay, pattern recognition, knowledge representation, planning.


CS 210: Computer Systems

Fundamental concepts of computer systems and systems programming. Hardware fundamentals including digital logic, memory systems, processor design, buses, I/O subsystems, data representations, computer arithmetic, and instruction- set architecture. Software concepts including assembly language programming, operating systems, assemblers, linkers, and systems programming in C.


CS 506: Data Science Tools and Applications

Covers practical skills in working with data and introduces a wide range of techniques that are commonly used in the analysis of data, such as clustering, classification, regression, and network analysis.

CS 541: Probability in Computing

Covers practical skills in machine learning including techniques for clustering, classification, regression, feature selection, and model compression. Emphasizes hands-on application of methods via programming on real- world datasets.


DS 594: Spark! Data Visualization X-Lab Practicum

The Data Visualization X-Lab Practicum offers students an opportunity to learn data visualization skills through course and project-based work. Projects will be completed on a schedule that aligns with topics being covered in class and assignments. This course provides an accurate experience of solving real-world problems with data visualization, and the various tradeoffs that need to be considered. Whether it's how to efficiently use color and space, effectively understand the profile of a dataset or cautiously avoid bias, this course will provide students with a solid understanding of applicable data visualization practices.


CS 330: Introduction to Analysis of Algorithms

Examines the basic principles of algorithm design and analysis; graph algorithms; greedy algorithms; dynamic programming; network flows; polynomial- time reductions; NP-hard and NP-complete problems; approximation algorithms; randomized algorithms.

MA 684: Modern Regression Analysis in R

Application of multivariate data analytic techniques. Multiple regression and correlation, confounding and interaction, variable selection, categorical predictors and outcomes, logistic regression, factor analysis, MANOVA, discriminant analysis, regression with longitudinal data, repeated measures, ANOVA.


CS 320: Concepts of Programming Languages

Concepts involved in the design of programming languages. Bindings, argument transmission, and control structures. Environments: compile-time, load-time, and run-time. Interpreters.


MA 416: Analysis of Variance

Fundamental concepts and analytical skills in analysis of variance, including crossed and nested designs, as well as fixed- and random- effect models. Trend analysis for repeated measures, expected mean squares, and non-parametric techniques. SAS is used throughout the course.

Experiences


Boston University - Research Computing Services


Technical Intern

Jan 2025 – Present

  • Built an agentic Retrieval-Augmented Generation (RAG) chatbot using ChromaDB and the OpenAI API to support users of Boston University's HPC cluster, integrating scraped documentation and internal files; implemented a FastAPI-based client-server architecture with SSE streaming, JWT-based session management, and logging for real-time interaction and system monitoring.
  • Developed data processing and analytics tools to detect and reduce inefficient GPU job usage, leveraging Python, Pandas, and visualization techniques to improve resource allocation on a high-performance computing cluster.

Boston University - Image and Video Computing Lab


Research Assistant (Paid)

Dec 2024 – Present

  • Developed an interactive web application for visualizing diffusion model embeddings and interpreting black-box machine learning models using K-SAE clustering, enabling intuitive analysis of high-dimensional datasets.
  • Implemented a Gradio web interface for comparing standard and modified Stable Diffusion model outputs, allowing users to experiment with concept steering.


BU Finance and Investment Club


VP, Lead of BUAlpha

Sep 2022 – Present

  • Led a team of 50+ members, conducting workshops on topics such as Monte Carlo methods, Fama-French models, and Statistical Arbitrage.
  • Cooperated with other members to research and develop quantitative finance based projects, including a trading strategy achieving a 22.5% return in a six week simulated test.


MassMutual Data Days for Good


Data Scientist Mentee

June 2023

  • Spearheaded a team of student mentees, data scientists, and others at Mass Mutual to analyze voter history and demographics datasets (about 10 million rows) to uncover trends among voters in Boston.
  • Delivered an interactive website featuring maps and charts, as well as a concise presentation summarizing conclusions to support our client.

Certificates

JPMorgan Chase & Co. Quantitative Research Virtual Experience Program on Forage - February 2024
Certificate
  • Forage provides company sponsored job simulations for students to learn skills
  • Completed a simulation focused on quantitative research methods
  • Analyzed a book of loans to estimate a customer's probability of default
  • Used dynamic programming to convert FICO scores into categorical data to predict defaults

Check out the simulation here: https://www.theforage.com/simulations/jpmorgan/quantitative-research-11oc

Lyft Back-End Engineering Job Simulation - October 2023
Certificate
  • Completed the Back-End Engineering job simulation, taking over development of an unfinished project for the Lyft Rentals team.
  • Drafted a UML class diagram representing a new reorganized architecture.
  • Refactored a messy codebase inherited from another team to accurately reflect my new design.
  • Implemented unit tests and added new functionality using test-driven development.

Check out the simulation here: https://www.theforage.com/simulations/lyft/back-end-engineering-he82

PwC Switzerland Power BI Job Simulation on Forage - May 2024
Certificate
  • Completed a job simulation where I strengthened my PowerBI skills to better understand clients and their data visualisation needs.
  • Demonstrated expertise in data visualization through the creation of Power BI dashboards that effectively conveyed KPIs, showcasing the ability to respond to client requests with well-designed solutions.
  • Strong communication skills reflected in the concise and informative email communication with engagement partners, delivering valuable insights and actionable suggestions based on data analysis.
  • Leveraged analytical problem-solving skills to examine HR data, particularly focusing on gender-related KPIs, and identified root causes for gender balance issues at the executive management level, highlighting a commitment to data-driven decision-making.

Check out the simulation here: https://www.theforage.com/simulations/pwc-ch/power-bi-cqxg

BCG Data Science Job Simulation on Forage - May 2024
Certificate
  • Completed a customer churn analysis simulation for XYZ Analytics, demonstrating advanced data analytics skills, identifying essential client data and outlining a strategic investigation approach.
  • Conducted efficient data analysis using Python, including Pandas and NumPy. Employed data visualization techniques for insightful trend interpretation.
  • Completed the engineering and optimization of a random forest model, achieving an 85% accuracy rate in predicting customer churn.
  • Completed a concise executive summary for the Associate Director, delivering actionable insights for informed decision-making based on the analysis.



Check out the simulation here: https://www.theforage.com/simulations/bcg/data-science-ccdz

Projects

Agentic RAG Chatbot for HPC Support

Associated with BU Research Computing Services RAG Chatbot

Engineered a Retrieval-Augmented Generation (RAG) chatbot using ChromaDB and the OpenAI API to support users of Boston University’s high-performance computing cluster. The system integrates scraped documentation and internal files with tool-use capabilities to access real-time system data. Built with FastAPI, the client-server model features SSE streaming, JWT-based session handling, and full interaction logging.

Technologies: Python, FastAPI, OpenAI API, ChromaDB, SSE, JWT

GPU Job Efficiency Analytics

Associated with BU Research Computing Services GPU Analytics

Created a suite of data processing and visualization tools to analyze GPU resource utilization on BU’s HPC cluster. Detected inefficient job usage patterns by analyzing job logs using Pandas and visualizations, helping RCS optimize job scheduling and resource allocation.






Technologies: Python, Pandas, Matplotlib

Diffusion Embedding Visualizer

Associated with BU Image and Video Computing Lab Embedding Visualizer

Developed an interactive web application to visualize and interpret latent embeddings of diffusion models using K-SAE clustering. The tool helps researchers explore and understand high-dimensional model representations through an intuitive interface.






Technologies: Python, K-SAE, Flask

Gradio Interface for Concept Steering in Diffusion Models

Associated with BU Image and Video Computing Lab Gradio Diffusion Tool

View Gradio Tool

Implemented a Gradio-based web interface for side-by-side comparison of outputs from baseline and modified Stable Diffusion models. Allows researchers to experiment with parameters and visualize concept steering effects in real-time.





Technologies: Python, Gradio, Stable Diffusion, Hugging Face

CNN Based Reinforcement Learning for Algo Trading

Associated with BUFC Project

View Presentation

Researching a CNN based reinforcement learning agent trained on candlestick data to trade the market. The underlying models use TensorFlow's sequential and functional APIs. Image generation was optimized by utilizing the numba library in Python. This project is associated with BUFC, and was presented during the end of semester presentations. I am still working on this project, and plan to write a paper to provide detailed information about my research.

Technologies: Python, TensorFlow, Numba

Mean Reversion Inspired Algorithmic Trading

Associated with BUFC Project

View Presentation

Developed and live tested an algorithmic trading strategy for the 2024 Spring semester. Succesfully created a model which significantly beat the market during our live test. Heavily leveraged Jupyter notebooks to develop a strategy that relies on HistGradientBoostingRegressor, a Scikit-Learn model. Presented work to the rest of the club and Alumni Advisory Board.




Technologies: Python, Scikit-Learn, Jupyter

Live Machine Learning Based Algorithmic Trading

Associated with Personal Algo Trading
Project


Developed multiple algorithmic trading strategies that leverage machine learning to identify profitable opportunities in the forein exchange markets. Implemented a program hosted on an AWS EC2 instance to trade a variety of assets. The program serves lives predictions on fresh pricing data updated every fifteen minutes. I am still actively working on this project.



Technologies: Python, AWS EC2, SSH, Scikit-Learn, Pandas, NumPy

Voting Patterns in Boston

Associated with Mass Mutual x BU Spark! Data Days for Good Program Project

View Source Code, Website, and More

Collaborated with a team of student mentees, data scientists, and others at Mass Mutual to analyze voter history and demographics datasets (~10 million rows) to uncover trends among voters in Boston. Our deliverable included an interactive website featuring maps and charts, as well as a concise presentation summarizing the problem and our conclusions to support our client.


Technologies: Python, Pandas, Flask, HTML, JavaScript, CSS

Tetris Reinforcement Learning Agent

Associated with CS440

Project

View Course Description

Learned about various artificial intelligence topics in CS440. The final assignment was to create a reinforcement learning agent to play a modified version of Tetris. Designed features, neural network architecture, reward function, and exploration function. Agent succesfully learned how to play Tetris.





Technologies: Java

Grid Trading Strategy (Implemented Live)

Associated with Personal Algo Trading

Project


Created a "grid strategy" that profits off volatility in the foreign exchange market. Due to the nature of the strategy, a large strategy parameter optimization was done. The model was deployed live on an AWS EC2 instance, where it automatically managed orders through the OANDA API. The strategy's performance reached a maximum return of 52%; however, after a large drawdown, I stopped the live trading, ending with a return of 11%.

Technologies: Python, AWS EC2, SSH, Pandas, NumPy

LLM Powered Notes App

Afternoon project Project

View Github

Made a Tkinter-based note-taking application enhanced with LLM (Large Language Model) capabilities, allowing users to receive autocompletion suggestions based on their inputs. The app allows users to write notes, save files, open existing files, and undo changes.







Technologies: Ollama, LLM, Python

Algo Trading VIX With Random Forest

Associated with BUFC Project

View Presentation

Explored various strategies utilizing a RandomForest model to predict the VIX index's next day direction. Backtest revealed promising results for the strategy. Presented work to the rest of the club.








Technologies: Python, Scikit-Learn, Jupyter

Competitions

Optiver - Trading at the Close

337/4436 (prelim), 608/4436 (final) Project

View Presentation

Forecasted stock price movement for the closing period using orderbook data. Employed various techniques such as K-fold cross validation, ensembling, and more.


Technologies: Python, LightGBM, Scikit-Learn

NESS Statathon Competition 2023

Won HSB Theme, Best LB Score for Travelers

Project

View Github 1

View Github 2



Placed first in the HSB theme analyzing sensor time series data. Had the best score for the Travelers insurance fraud detection theme using a custom RandomForest-based model. Presented further findings and business implications to a panel of judges.

Technologies: Python, Scikit-Learn, Pandas, NumPy, TensorFlow

Tech For Change Hackathon 2024

Won Best Overall Hack Project

View Devpost Submission

Implemented a disease prediction website with cosine similarity and GPT models. Users input symptoms and potential predictions are returned with further information. Led my team by effectively exploiting differing skill sets.

Technologies: Python, Flask, OpenAI ChatGPT API, HTML, JavaScript, CSS

Georgia Tech Hacklytics 2024

Participant

View Devfolio (Demo & Source Code)

Created an interactive site where vistors can learn about and create trading strategies. Users can set parameters and see backtested performance of a pairs trading or machine learning strategy. Demoed live to various judges.

Technologies: Python, Flask, HTML, JavaScript, CSS

IMC Trading Prosperity 2023

Top 2.5% Project

Placed in top 2.5% of in IMC Prosperity, a quant trading competition/hackathon. Used various algo trading techniques such as market making, seasonality identificaiton, correlation analysis, and more.



Technologies: Python

BU Spark! Art + Computing Hack

Won Tech Innovation Theme Project

Created a web app where users can play with AI generated image puzzles.






Technologies: Python, Flask, HuggingFace API, HTML, JavaScript, CSS

HDSI Agri Datathon 2024

Second Place Project

View link to paper

Built a parallelized pipeline to isolate agricultural land, extract NDVI values, and analyze crop phenology using SARIMA and LSTM models. Addressed missing data with stochastic imputation. The project helps predict agricultural trends and assess food security.

Technologies: Python, statsmodels, multithreading, TensorFlow, GeoTIFF

Welcome Back Mini Hack 2024

Participant Project

View Github

Built a Flask-based web app that provides therapy solutions for users struggling with different mental health issues, using principles of Cognitive Behavioral Therapy (CBT). The app walks users through a series of questions and provides appropriate therapy solutions based on their responses.

Technologies: Ollama, llama3.2, Python, Flask

NESS Statathon Competition 2024

Third Place Project

View Presentation

Created an accurate model capable of predicting the probability of an insurance policy converting. Presented findings and business implications.



Technologies: Python, Scikit-Learn, Pandas, NumPy, LightGBM

BostonHacks Mini Hacks

Second Place Project

Developed a Flask based webapp designed to centralize and gamify BU's surveys by providing students with a unique way of earning rewards.



Technologies: Python, Flask, HTML, JavaScript, CSS

Jane Street’s Mystery Planet

Participant Project

Invited to Jane Street’s escape room/puzzle hunt event. Competed with a team to solve puzzles and engage in a market game. Learned about the problems Jane Street works on.


Technologies: Puzzle solving, teamwork, communication

CS506 Class Competition

4/189 Project

View Github

Used NLP techniques to analyze a dataset of Amazon movie reviews to predict the star rating of a review. Achieved a 4th place finish out of 189 students in the class competition.

Technologies: NLTK, LightGBM, Scikit-Learn