This project analyzes the representation and themes associated with anime in The New York Times articles from 1981 to 2023 using Semi-Supervised Topic Modeling. It is based on coursework during my exchange semester at the WASEDA University in Tokyo, Japan.
For the full report, look at the Presentation File.
This analysis explores the evolution of anime coverage in The New York Times over four decades. Using advanced natural language processing techniques, specifically Semi-Supervised Topic Modeling, I uncover the main themes and trends in anime-related articles, providing insights into the changing perception and cultural impact of anime in the United States.
- Track the evolution of anime coverage in The New York Times from 1981 to 2023
- Identify and analyze the main topics associated with anime in these articles
- Investigate the "Anime spillover effect" hypothesis, which suggests that increased interest in anime leads to broader engagement with Japanese culture
- R (version 4.3.1)
- Key packages:
The furrr package is particularly noteworthy in this project. It extends purrr's mapping functions to work in parallel, significantly reducing computation time for our topic modeling tasks. By leveraging multiprocessing capabilities, I was able to efficiently train multiple topic models and perform other computationally intensive operations, making the analysis of large text datasets more feasible and time-efficient working within R.
- Data Collection and Preprocessing
- 1,219 articles from The New York Times mentioning "anime" (1981-2023)
- Collected via ProQuest archive query
- Cleaned and preprocessed to remove irrelevant content and standardize text
- Evidence supporting the "Anime spillover effect", with anime-related articles increasingly touching on broader aspects of Japanese culture and society
- Data shows clear diversification of topics alongside the increase in number of articles per year
- Suggest that the idea behind Cool Japan has merit
- I find evidence that alongside the increased coverage of anime, the number of topics it relates to increases.
- People with interest in Anime are possibly more likely to find out about these topics
- Entertainment Cluster: Articles discussing Hollywood Cinema often detail Film Directing techniques, while those on Anime & Manga frequently reference Ghibli Films.
- Cultural Dynamics: Pieces on Culinary Trends tend to explore Cultural Heritage, and those focusing on Fashion Trends commonly link to Asian Beauty concepts.
- Digital Media Correlation: Discourse around Social Media intersects with Digital Media impacts, just as Gaming & Fantasy connects with Comic Culture in digital entertainment.
- Socio-Economic Insights: Education & Campus articles typically involve Urban Housing issues, and Governance & Funding is often discussed alongside Political Dynamics.
anime_nyt_analysis.Rmd
: R Markdown file containing the entire analysisdata/
: Raw data file (anime_texts.txt)output/
: Generated plots and results
- Clone this repository
- You need to gather the texts yourself. I am not allowed to publish content of the New York Times.
- You can access the data by using news archives like ProQuest. Most universities should have access for students. Ask your chair or check your Uni website.
- Not a student? ProQuest access can be quite expensive but you can still use the code as a template to apply it to other topics.
- Open the
anime_nyt_analysis.Rmd
file in RStudio - Install required R packages (listed at the beginning of the R Markdown file)
- Run all chunks in the R Markdown file or knit the document to HTML
- View results in the
output/
directory and in the generated HTML report
This project is licensed under the MIT License.