Hi, I'm Devdatt Golwala. Currently pursuing my M.S. in Data Science at Columbia University, I'm passionate about Data Science, AI and Software Engineering.
New York, NY
An AI engineer designing personalized, scalable systems that bridge data and product.
Specialized in leveraging Machine Learning, AI and software engineering to solve complex challenges. Experienced across Finance, Healthcare and Enterprise AI with impactful roles at Intuit, Adobe and HPE
Experience
Adobe
AI Researcher
Building a personalized multimodal LLM pipeline for image editing using LLaVA, GPT-4o-Vision, and RAP (Retrieval-Augmented Personalization), leveraging Reddit PSR data, synthetic generation, and an agentic prompt-enhancement workflow.
Modeling user personas in latent space from edit deltas and linguistic patterns to enable persona-aware, instruction-faithful, and style-consistent edits.
Intuit
AI Science Intern
Engineered a personalized tax topic recommendation system for TurboTax using Quantile Regression to predict time spent, reducing abandonment by 32%, lifting CTR by 60%, and increasing engagement by 45%.
Distilled GPT-4.1 as teacher into a LLaMA-3.1-8B student via reflection-based training for tax-field extraction, improving alignment on complex tax fields from 56% to 83%, increasing recall by 52%, and reducing inference cost by 89%.
Columbia University & University of North Carolina, Chapel Hill
Graduate Research Assistant – Robert N. Butler Columbia Aging Center
Built a RAG-based biomedical Q&A system using GPT-4o, LangChain, and vector-based retrieval over research publications, achieving 81.2% top-3 document recall.
Developed an ASR pipeline using the Whisper model to transcribe Add Health study audio, achieving 87.4% precision and 82.1% recall on cognitive assessment tasks.
Hewlett-Packard Enterprise
Software Engineer I
Developed a predictive maintenance microservice for HPE Proliant servers, decreasing downtime by 20%.
Implemented Autoencoders and LSTMs for server temperature data modeling, improving anomaly detection accuracy.
Built a backend framework automating log-to-JIRA defect mapping using spaCy, FastAPI, and ReactJS, cutting turnaround time from 3 days to 1 day.
Streamlined log aggregation via Kafka and Grafana, reducing redundancy fourfold and issue resolution time by 35%.
Reliance Jio
Data Science Intern
Performed exception reporting of network performance KPIs for 12M+ international roaming subscribers, flagging deviations above 10%.
Optimized event creation cycles from 45 to 15 minutes via Spark jobs, cutting reporting time by 67%.
Projects
Independent projects pursued to learn and apply advanced AI and ML concepts 🧩
Retail Demand Forecasting
Developed a two-stage forecasting pipeline on 1M+ retail transactions, integrating NLP-based product clustering (TF-IDF, UMAP, MiniBatchKMeans) with advanced time series models (Prophet, Bi-LSTM, ARIMA), achieving a MAPE as low as 7.65% for daily and weekly sales predictions.