History through the President’s Words
I got this great data gathering idea from an awesome article on the Washington Post, which analyzed the State of the Union Addresses of the various US presidents to determine hot-button issues in the United States. I decided to apply the same methodology to the Philippines and extract word counts out of the various State of the Nation Addresses, from Cory Aquino in 1987 to Noynoy Aquino in 2013.
Length of Speech and the Riffraff Index
First, let’s take a look at a very basic statistic - the length of the speech in words - as well as something I constructed called the Riffraff Index, which is the percentage of words in the SONA that do not specifically relate to any national issue. What these words are, and how they were selected are at the next section.
The shortest speech was delivered by Gloria Macapagal-Arroyo in 2005 at only 1,600 words, while the longest was actually the most recent one by Noynoy Aquino, clocking in at a whopping 13,300 words.
But a long speech doesn’t mean a meaningful one - you could fill your speech with buzzwords and riffraff to make it longer, so I constructed the Riffraff Index - the percentage of words that do not appear to relate to a specific national issue (the higher the index, the more riffraff). The highest amount of ‘riffraff’ was during Cory Aquino’s 1991 farewell speech - a very heartfelt and emotional speech, while the lowest was during Ramos’ 1993 speech - one where Fidel Ramos identified key issues and discussed them in immense detail.
You can also see that the amount of riffraff increases during election periods (1991-1992, 1997-1998, 2004) except for the most recent one.
That’s not all we could do with this data. We can also pinpoint the specific national issues each president spent the most time talking about in his/her SONA, so we identified keywords from those that were mentioned in at least 14 out of the 27 speeches, and also those that were mentioned at least 5 times in any one speech. We then categorize them into various issues, as follows:
Economy, strife, law and order, and poverty are quite often discussed across the administrations, and varying degrees exist for other issues. Environment is least talked about in presidential speeches, and understandably so for a developing country with a small environmental footprint.
You can also observe that issues tend to be most talked about during the middle of a President’s term, and the values shrink as they become mum during election periods - probably to stay on the safe side.
But this streamgraph doesn’t make it easy to compare the priorities of each president relative to each other, so I constructed a heatmap that compares the favorite issues of each chief executive relative to his predecessors and/or successors:
Cory Aquino stayed pretty much balanced across the various issues. Fidel Ramos talked mostly about curbing poverty, protecting the environment, and improving the economy. Estrada talked about little else than strife, law, & order - his last SONA talked almost exclusively about the MILF and Mindanao. GMA focused on workers, jobs, wages, and education, which she has consistently stressed throughout her administration. Finally and not surprisingly, our current President is big on employment as well as graft, corruption and justice.
SONA Interactive Word Counter
All this information is too rich to summarize easily, so I’ve decided to construct a simple tool (like Google’s n-gram viewer) to let you explore the SONA data for yourself. You can enter up to 5 words in the white boxes, and the line graph on the right will show how many mentions that particular word gets in each SONA from 1987’s Aquino to 2013’s Aquino.
You might have to wait a few seconds to update the chart upon entering information. Sorry for the clunky Excel implementation, as I have very little coding ability. (Use a space to clear out a cell)
Hope you enjoy discovering new trends with the tool! If you find an interesting pattern, please don’t hesitate to share it in the comments!
hanks for reading! If you enjoyed reading, I’d appreciate it if you shared this with your friends or comment below. Data and computation inquiries can be made through the contact form or the comments.
On Moving People, not Cars
With the rainy season in full swing, monstrous traffic jams yet again plague the Philippine capital. There is a lot of blame to go around - some say we're too reliant on private transportation, and others say undisciplined, unregistered, and reckless buses are the real culprit. Let's take a closer look at the data and see if it can provide some clarity.
On Getting to Know the Filipino Informal Settler
How well do you know the Filipino informal settler? Is he poor and underprivileged? What happens if you force him to pay rent? Well, short of actually starting a conversation with one, we can get to know the Filipino informal settler through data. Read on to find out more.
On Trains and Tribulations
: The LRT South Extension and Fare Hikes
The Light Rail Transit Line 1 South Extension Project is now out for bidding, with the contract to be awarded in the second quarter of this year. The rail system is under intense public scrutiny amidst talks of fare hikes, but as in most cases, a look at the data can lend some meaningful context to these debates: fare hikes, capacity constraints, and South Extension feasibility.