David TianAddress Matching through Cosine SimilarityAddress matching, at it’s surface, may seem like a very intuitive and simple process. However, at my previous job where I knew nothing…3 min read·Sep 12, 2021----
David TianCommon tools used to combat fraudPiggybacking off of my prior blog-post talking about anomaly detection, this blog will focus on the common tools used to combat fraud. In…3 min read·Sep 9, 2021----
David TianAnomaly Detection: Algorithms, Explanations, Applications — An Isolation Forest SummaryThis will be a summary of a one and a half hour talk on Anomaly Detection by Thomas Dietterich, specific toIsolation Forest.3 min read·Sep 8, 2021----
David TianMy First Kaggle Competition — my experience and how I improved my RMSE by 30% using Hugging Face…I recently attempted my first Kaggle Competition, completely solo. $60,000 in prize money was on the line, and although I knew that there…5 min read·Jul 21, 2021----
David TianAdvantages and Disadvantages of Various Machine Learning AlgorithmsThe inspiration for this blog post came from a question one of my classmates in my data science boot camp asked me during one of my…3 min read·May 20, 2021----
David TianData Science and League of Legends — analyzing my personal gameplay data6 min read·May 5, 2021--1--1
David Tian1000x faster data manipulation : vectorizing with Pandas and Numpy— A SummaryIn this blog post, I will be summarizing Nathan Cheever’s presentation at PyGotham 2019.5 min read·Apr 21, 2021----
David TianWhy did I decide to learn data science?https://enterprisersproject.com/sites/default/files/styles/620x350/public/images/cio_rpa_robot_automation.png?itok=RZiEiLwx4 min read·Apr 6, 2021----