 Logical Indexing
Logical Indexing
Good! Accessing columns by their names is convenient. Can we filter the rows we want to output?
Indeed, we can. First, we can use indices (like it was for vectors or matrices). But usually, we do not know the positions of the rows but know some conditions we want to satisfy. For example, we may want to extract data for only Males or only people older than 30. You can do it by specifying necessary conditions within square brackets. You need to use the double sign == for equality.
Assume we have data frame data and want to filter to rows having the value 30 in column age. This can be done using the following syntax: data[data$age == 30,]. Note that you put condition as the first index within the square bracket. For example, for the same training data as before, let's extract the data of people older than 30 and males only.
1234567891011# Data name <- c("Alex", "Julia", "Finn") age <- c(24, 43, 32) gender <- c("M", "F", "M") # Creating a data frame test <- data.frame(name, age, gender) # People older than 30 test[test$age > 30, ] # Males only test[test$gender == 'M', ]
As you can see, that's correct.
Swipe to start coding
Using the mtcars dataset, extract the following data:
- The cars pass a quarter-mile in less than 16 seconds (qseccolumn).
- Cars with 6 cylinders (cylcolumn).
Solution
Thanks for your feedback!
single
Ask AI
Ask AI
Ask anything or try one of the suggested questions to begin our chat
Can you explain how to filter for multiple conditions at once?
How do I filter rows based on a range of values?
Can I use other comparison operators besides `==` and `>`?
Awesome!
Completion rate improved to 5.56 Logical Indexing
Logical Indexing
Swipe to show menu
Good! Accessing columns by their names is convenient. Can we filter the rows we want to output?
Indeed, we can. First, we can use indices (like it was for vectors or matrices). But usually, we do not know the positions of the rows but know some conditions we want to satisfy. For example, we may want to extract data for only Males or only people older than 30. You can do it by specifying necessary conditions within square brackets. You need to use the double sign == for equality.
Assume we have data frame data and want to filter to rows having the value 30 in column age. This can be done using the following syntax: data[data$age == 30,]. Note that you put condition as the first index within the square bracket. For example, for the same training data as before, let's extract the data of people older than 30 and males only.
1234567891011# Data name <- c("Alex", "Julia", "Finn") age <- c(24, 43, 32) gender <- c("M", "F", "M") # Creating a data frame test <- data.frame(name, age, gender) # People older than 30 test[test$age > 30, ] # Males only test[test$gender == 'M', ]
As you can see, that's correct.
Swipe to start coding
Using the mtcars dataset, extract the following data:
- The cars pass a quarter-mile in less than 16 seconds (qseccolumn).
- Cars with 6 cylinders (cylcolumn).
Solution
Thanks for your feedback!
single