Which variables are continuous (can take on any value in some reasonable range, such as weight) vs which are categorical (take on a set number of values where each represents something). If the data is repeated measures in some form (multiple rows per person, or data is students across several classrooms), what variable(s) identify the levels and groups?Īre there any variables which are strings (non-numeric) that you plan on using in some statistical analysis (and will need to be converted to numeric)? Is there a variable which uniquely identifies each unit of analysis? What is the unit of analysis? (What does each row represent - a person? a couple? a classroom?) How many observations does your data have? You should be able to answer the following questions (even if some of them are approximations): Whenever you first begin working with a data set, you’ll want to spend some time getting familiar with it. Feel free to make frequent use of preserve and restore. You can reload it as necessary (if you modify it and want the original) by re-running this with the clear option. Throughout this chapter, we’ll be using the “auto” data set which is distributed with Stata. 5.8.2 Converting strings into labeled numbers.5.8.1 Converting between string and numeric.5.8 Working with strings and categorical variables.3.6.1 Temporarily preserving and restoring data.3.5.3 Importing from a file not supported directly by Stata.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |