Projects
Photography Data Analysis
A group of six UCLA undergraduate students in Digital Humanities 101 was tasked with a project of analyzing a huge dataset from multiple angles and presenting their research findings through an exploratory website:
Data sources: The Carnegie Museum of Art (CMOA) https://github.com/cmoa/collection
IMDB Review Sentiment Prediction
As a team of 4 UCLA Statistics and Data Science undergraduates, we analyzed the IMDB Data set of 50,000 movie reviews and studied the relationship between the textual content of reviews and the sentiments expressed.
Raw Data set: https://drive.google.com/drive/folders/15rhdfkfIeTy9yK2jL00cExFm8qX0_BUI?usp=sharing
We concluded our findings into this Sentiment Paper.
Airbnb Data Analysis
The Airbnb data set in Los Angeles covers about 45,000 observations for hosting, and we used EDA, Geospatial Analysis, and Sentiment Analysis to investigate the relationship of all predictors with price.
Raw Data set: http://insideairbnb.com/get-the-data/
Findings: click to zoom in.
More graphs for analysis: extra graphs
Simulated Experiment Design and Analysis
Based on a simulated island, we designed an experiment to investigate the effects of different activities on the “inhabitants.”
Here is our Island Paper.
Publications
I assisted Dr. Qi in her research on the Coupling and Coordination of Systems of Citizenization, Regional Economy, and Public Service in China from the Perspective of Sustainable Development. The research report was published on MDPI as an open access article. https://www.mdpi.com/1875680