Career Profile

Former Scientific Researcher in Machine Learning looking to enter the private sector and finance world. Broad, interdisciplinary experience in research ranging from biomedical to cosmological. Interested primarily in the intersection of medicine, finance, and technology.

Experiences

Physical Cosmology Researcher

2017 - 2018, 2019
Division of Observational Cosmology and Particle Physics, NYU
  • Detecting stars during the day turning tens of thousands of short-exposure digital photographs into one long exposure photograph, adjusting other parameters in post-processing. Writing software to find star in every exposure, shift all exposures to place star at the same location, and use median-combine (or other method) on the shifted exposures to make an extremely sensitive exposure
  • Using Kepler as a thermometer: dark currents produce 9.5% of the noise on lightcurves on the Kepler spacecraft, which is far more precise in flux detection and SNR than the actual temperature readings of instruments on the spacecraft. The goal of the project is to be able to deduce temperature fluctuations of the CCD photoreceptors from analysis of noise on the lightcurves of tens of thousands of stars.

  • Constructing Bayesian hierarchical models of high-energy transient phenomena (black holes, neutron stars) through X-ray data analysis as well as stellar spectroscopy and exoplanet measurement and discovery using a Pythonic pipeline for time series analysis such as Fourier analysis and (quasi-)periodicity detection.
  • Primarily investigated Boyajian’s Star (Tabetha’s Star), which is known for aperiodic dips in luminosity, by modeling these dips as eclipses or planet transits through my hypothesis centered around it being a brown dwarf

Machine Learning Researcher

2018 - 2019
Division of Behavioral Medicine, Columbia University Medical Center
  • Using machine learning within the context of neural circuits by creating generative adversarial networks as well as pre-trained convolutional neural networks to illuminate the neural circuit governing reinforcement learning under state uncertainty.
  • Rebuilt the lab’s website in full-stack: 99/100 Google PageSpeed Score, Syntax highlighting powered by Rouge, LaTeX math blocks, powered by KaTeX, built-in icons for social media, simple and semantic HTML, structured Data for core entities, Google Analytics and Google Fonts support, Disqus comments
  • Built inexpensive supercomputer for extensive deep learning and data processing utilizing price trends

Projects

Image Recognition - Built a convolutional neural network using a small dataset of time-series dopamine spikes among population ensembles of neurons by employing a pre-trained CNN through Tensorflow. Transformed dataset into heatmaps and wavelets, and wrote an algorithm with Python and OpenCV using distortion correction, image rectification, color transforms, and gradient thresholding. Enhanced this pipeline with Keras, using latest deep learning architecture (VGG16) that is extremely accurate and lean ; essential for our small dataset.
Web Development, and Computer Building - Re-built a laboratory’s website in full-stack and built an inexpensive deep learning computer for extensive deep learning and data processing.
Self-Learned subjects through reading and online lectures - Relevant coursework includes: Probability, Statistics, Design of Algorithms, Linux, R, Bioinformatics, Windows PowerShell, Tensorflow and Keras, Cancer Genomics, Financial Analysis, Deep Learning: GANs and Variational Autoencoders, R Programming, Data visualization, Candlestick trading, Blockchain, Crypto

Skills & Proficiency

NLP - LangChain, Vector DB, LLMs, AWS Sagemaker

Document Extraction - Apache Nutch, JSoup

Information Retrieval - MiniLM, Faiss, PineCone, Vespa

Pipeline Management - Flyte, Airflow, Metaflow, PySpark, Kafka

Deep Learning - ARIMA, Prophet, SARIMAX for time-series forecasting

Deep Learning - LGB, XGB, Catboost, Autoencoders, GAN, FC, LSTM, GRU, RNN for complex data analysis and prediction