Analytics of The Oscars: Best Picture Winner Word Clouds

160119125640-oscars-tease-full-169.jpg

Oscar nominations for 2017 will be revealed on January 23. As a tribute to the award-giving body, let’s do some basic text analytics on Oscar-worthy films.

In this two-part post, we look at word clouds made out of plot summaries from Best Picture nominees and winners in the 1928 Oscars ceremony up to 2016.

Plot summaries were scraped from Wikipedia with Beautiful Soup and subsequently processed with Pandas, Textblob and NLTK. Only nouns, adjectives and verbs were retained, and proper nouns and stop words were filtered out. Check out the Jupyter notebook here.

Here are the word clouds for Best Picture winners, partitioned by decade.

 

winner_1920s

winner_1930swinner_1940swinner_1950swinner_1960swinner_1970swinner_1980swinner_1990swinner_2000swinner_2010s

See something interesting? What is your 2018 Best Picture winner? Leave a comment below.

Leave a Reply