Data Frames
The data.frame
is, like we saw in 8 Data Structures, one of the core R data structures used in Statistics, Machine Learning, Data Science. Data frames store tabular data, where rows usually represent cases and columns usually represent variables. Columns can be of different classes/types, e.g. numeric (float), integer, character, factor.
When working with tabular data structures, core data science techniques to learn include:
- Reading in data (see 11 Base Data I/O)
- Reshaping (22 Reshaping)
- Joins (23 Joins)
See also
data.table
, a modern, highly optimized and powerful alternative todata.frame
.