<< Chapter < Page | Chapter >> Page > |
The 5 W’s are questions we ask in order to understand our data. Knowing the answers to these questions helps to inform the decision making process, guiding the researcher in the most appropriate ways to graph and analyze their data.
The questions we are answering when examining the 5W’s of a dataset are; who, what, why, where, when, and how. Yes, I know that how does not start with a w. The ‘who’ question tell us the cases or who the data is coming from. When we collect data from a survey we call these cases respondents. If the data is collected via an experiment we would call the cases subjects or participants. If we are collecting data from an inanimate object then the cases are experimental units. The important thing to understand is that the ‘who’ are the respondents, the people or things that are answering your questions.
The ‘what’ question of the 5 W’s refers to the variables or the questions you have for the respondents. These are the things you want to know from them. These variables are either numerical, resulting in a measured number value, or categorical, resulting in a word or category answer.
The ‘why’ question is the purpose of the study, the reason you are asking questions of the cases. This is your guiding research question. To proceed with the analysis of data you always need to know the who, what, and why. It is important that you understand who your data is coming from (the source), what data you have (the questions asked), and why you have this data (what research question you are trying to answer.)
The where, when, and how are nice things to know and when you collect your own data you will know the answers to these questions, but they are not always known if you are working with someone else’s data. The ‘where’ question is the geographic location of the data collection. If you want to know about students’ usage of the library on campus where you collect your data will be important. You may get different results if you survey students in the library versus the student union.
‘When’ is the date and time of the data collection process. If we want to know how satisfied students are with their course schedule we may get different results if we survey students in classes that meet at 8:00 a.m. versus ones that meet at 11:30 a.m. The last question is ‘how’, the data collection process. This question encompasses the where and when, but goes further to examine the data collection process. If you are surveying individuals, which sampling methodology will you use? Will you interview respondents or hand out paper copies of your survey? How will bias be minimized in the data collection process? All of these considerations fall under how.
Pew Charitable Trusts conducted a longitudinal study that has followed families from 1968 to the present to gain a better understanding of the American Dream and economic mobility. They surveyed 2227 American families asking, “What is your “Family income” including all taxable income (such as earnings, interest, and dividends) and cash transfers (such as Social Security and welfare) of all family members?” Identify the 5W’s of this example.
Who: 2227 American families What: Family incomeWhy: To understand the relationship between the American Dream and economic mobility. Where: United States of AmericaWhen: 1968 to present How: Longitudinal study
Notification Switch
Would you like to follow the 'Collaborative statistics using spreadsheets' conversation and receive update notifications?