How to collect and analyze nominal data
Nominal data is labelled into mutually exclusive categories within a variable. These categories cannot be ordered in a meaningful way.
For example, preferred mode of transportation is a nominal variable, because the data is sorted into categories: car, bus, train, tram, bicycle, etc.
Levels of measurement
The level of measurement indicates how precisely data is recorded. There are 4 hierarchical levels: nominal, ordinal, interval, and ratio. The higher the level, the more complex the measurement.
Nominal data is the least precise and complex level. The word nominal means “in name,” so this kind of data can only be labelled. It does not have a rank order, equal spacing between values, or a true zero value.
Examples of nominal data
At a nominal level, each response or observation fits only into one category.
Nominal data can be expressed in words or in numbers. But even if there are numerical labels for your data, you can’t order the labels in a meaningful way or perform arithmetic operations with them.
In social scientific research, nominal variables often include gender, ethnicity, political preferences or student identity number.
Variable  Categories 

Zip code 

Political preferences 

Employment status 

Literary genre 

Variables that can be coded in only 2 ways (e.g. yes/no or employed/unemployed) are called binary or dichotomous. Since the order of the labels within those variables doesn’t matter, they are types of nominal variable.
How to collect nominal data
Nominal data can be collected through open or closedended survey questions.
If the variable you are interested in has only a few possible labels that capture all of the data, use closedended questions.
What is your gender?  Male Female Other Prefer not to answer 

Do you own a smartphone?  Yes No 
What is your favorite movie genre?  Romance Action Mystery Animation Musical Comedy Thriller 
If your variable of interest has many possible labels, or labels that you cannot generate a complete list for, use openended questions.
How to analyze nominal data
To analyze nominal data, you can organize and visualize your data in tables and charts.
Then, you can gather some descriptive statistics about your data set. These help you assess the frequency distribution and find the central tendency of your data. But not all measures of central tendency or variability are applicable to nominal data.
Republican Democrat Independent Independent Republican Republican Republican Democrat Independent 
Independent Republican Democrat Democrat Democrat Democrat Republican Democrat Democrat 
Democrat Republican Democrat Democrat Independent Republican Republican Democrat Democrat 
Distribution
To organize this data set, you can create a frequency distribution table to show you the number of responses for each category of political preference.
Political preference  Frequency 

Democrat  13 
Republican  9 
Independent  5 
Political preference  Percent 

Democrat  48.1% 
Republican  33.3% 
Independent  18.5% 
Using these tables, you can also visualize the distribution of your data set in graphs and charts.
Central tendency
The central tendency of your data set tells you where most of your values lie.
The mode, mean, and median are three most commonly used measures of central tendency. However, only the mode can be used with nominal data.
To get the median of a data set, you have to be able to order values from low to high. For the mean, you need to be able to perform arithmetic operations like addition and division on the values in the data set. While nominal data can be grouped by category, it cannot be ordered nor summed up.
Therefore, the central tendency of nominal data can only be expressed by the mode – the most frequently recurring value.
Statistical tests for nominal data
Inferential statistics help you test scientific hypotheses about your data. Nonparametric statistical tests are used with nominal data.
While parametric tests assume certain characteristics about a data set, like a normal distribution of scores, these do not apply to nominal data because the data cannot be ordered in any meaningful way.
Chisquare tests are nonparametric statistical tests for categorical variables. The goodness of fit chisquare test can be used on a data set with one variable, while the chisquare test of independence is used on a data set with two variables.
The chisquare goodness of fit test is used when you have gathered data from a single population through random sampling. To measure how representative your sample is, you can use this test to assess whether the frequency distribution of your sample matches what you would expect from the broader population.
With the chisquare test of independence, you can find out whether a relationship between two categorical variables is significant.
1 comment
Pritha Bhandari (Scribbrteam)
August 7, 2020 at 11:52 AMThanks for reading! Hope you found this article helpful. If anything is still unclear, or if you didn’t find what you were looking for here, leave a comment and we’ll see if we can help.