Student-level demographic information for approximately 98,000 degree-seeking undergraduate students, keyed by student ID. Data at the "student-level" refers to information collected by undergraduate institutions about individual students, for example, age, sex, and race/ethnicity at matriculation.
Usage
data(student)Format
A data.frame and data.table with 13 variables and 97,555
observations of unique students occupying 18 MB of memory:
mcidCharacter, anonymized student identifier, e.g.,
MCID3111142225.institutionCharacter, de-identified institution name, e.g., Institution A, Institution B, etc.
transferCharacter, transfer status, possible values are
First-Time in College,First-Time Transfer.hours_transferNumeric, number of credit hours transferred (or
NA).raceCharacter, race/ethnicity as self-reported by the student, e.g., Asian, Black, Hispanic, etc.
sexCharacter, sex as self-reported by the student, possible values are Female, Male, and Unknown.
age_descCharacter, age group, possible values are
25 and Older,Under 25.us_citizenCharacter, US citizenship, possible values are
No,Yes.home_zipCharacter, home ZIP code (or
NA), e.g.,02056,20170,51301,80129, etc.high_schoolCharacter, code for the last high school attended before admission (or
NA), e.g.,060075,210512,431800,502195, etc.sat_mathNumeric, SAT mathematics test score (or
NA).sat_verbalNumeric, SAT reading test score (or
NA).act_compNumeric, ACT composite test score (or
NA).
Source
2022 MIDFIELD database
Details
Student data are structured in row-record form, that is, information associated with a particular ID occupies a single row—one record per student.
The data in midfielddata are a proportionate stratified sample of the
MIDFIELD database, but are not suitable for drawing inferences about program
attributes or student experiences—midfielddata provides practice data,
not research data.