300 likes | 427 Vues
This lecture covers the fundamental concepts of data measurement in geographic analysis. It includes the processes for obtaining scores for raw and derived variables, emphasizing operational definitions and measurement scales such as nominal, ordinal, interval, and ratio. Key aspects like the temporal and spatial components of measurement are discussed alongside the methodologies for analyzing and visualizing data distributions using tools like MS Excel. By exploring the terminologies and structures used in geographic data, participants will gain insights into effective data handling and interpretation.
E N D
GTECH 201 Lecture 03Data measurementsData errors
Measurement • The process of obtaining scores for each element for each variable • This can take on a variety of forms – • Producing scores (and distributions) of a widely varying nature • The measurement of raw variables • Derived variables and measurement • Measurement of the time component • Measurement of the space component
Measurement of Raw Variables • Primary and secondary data; control over the measurement process • Operational definitions of the measurement process • Highly varied terminology for the different levels of measurement
Derived Variables and Measurement • Why not use raw variables? • Derived variables can be considered as a ratio (or percent or index) of two numbers – the numerator and the denominator (in the ratio a/b, a is the numerator, b is the denominator)
Measurement of the Time and Space Components • Time is a continuous variable • Date, time and datetime • Space is a continuous variable • (x,y) and (x,y,z) location measurement • Distance and bearing alternative
Measurement Scales • Nominal • Ordinal • Interval • Ratio Most statistical texts follow the original 4-fold measurement structure attributed to a chap called Stevens, written up in 1946:
Geographical Data • Dayta or darta • Data sets used by “geographers” • Geographic vs. spatial data • 3 Quick Questions: • What would a ‘piece of data’ look like? • Must data consist of numbers? • What would a data set look like?
3 Approaches to Data Definitions • Statistical analysis • Spatial analysis • Data base structures (IT) The terminology used in handling data sets (in geography) stems from three different sources. It is now somewhat of a mix of the three.
Some Basic Definitions • Element (= observation = entity) – the smallest (measurable) unit • Population (of elements) (approximately equal to universe) – the totality of all elements. May be finite or infinite, with size known or unknown • Sample – a subset of the population • Variable (= attribute) – a characteristic of the element in which we are interested • Data score (or simply datum/data) – an individual value for an element for a variable • Data distribution – a set of scores for a variable • Data set – an assemblage of related data distributions for a set of elements
Things We Can Do With Data • Data display – a display of a data distribution in the form of a table, graph or map • Data analysis – the analysis of data distributions (including statistical analysis) • Data storage and retrieval – just as the name implies (anything from sheets of paper in a folder to a computer file)
Data Structure • Measurement is the process of obtaining scores for an element • Each element will have an infinite number of characteristics, the relevant ones of which will be measured (the variables or attributes) • The element will also occupy a locational position in time and space; sometimes this position is important, sometimes it is not
Geographic Data Structure • A generalized data structure for each element is: • The data structure that results can be presented (stored) in a data matrix, where the columns are the variables (with or without the temporal and spatial location), and the rows are the elements { (var1, var2, ….., vark), (temporal location), (spatial location) }
Introduction to MS Excel • Opening screen • Typical look of an Excel spreadsheet
User Interface Elements • Title bar • Menu bar • Toolbars • Standard toolbar • Formatting toolbar • Formula bar • Status bar
Entering and Editing Data • Entering Data • Editing Data • F2 • Formula • Double-click
Formatting Text • Just as in MS Word • Alignment • Bold, italics, underline • Font color
Column Width • Choose Format > Column > Width • Drag cursor on separation line • Double click on separation line • Holding left mouse pressed
Mathematical Calculations • Cell arithmetic • AutoSum • Function button
Functions • Function categories • Function wizard
Formatting Numbers • Choose Format > Cells • Before • Toolbar • After
Creating Borders • Choose Format > Cells • Border icon on toolbar
Merge and Center • Icon on Formatting toolbar • Autofill
Printing • Page setup or Print preview
Creating Charts • Step 1 - select the data that you want charted • Step 2 – pull-up chart the wizard
Modifying Charts • Change size and position • Modify other characteristics