Introduction to Using JMP®

1 / 103

# Introduction to Using JMP®

## Introduction to Using JMP®

- - - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - - -
##### Presentation Transcript

1. WandiHuang Laboratory for Interdisciplinary Statistical Analysis Department of Statistics, Virginia Tech http://www.lisa.stat.vt.edu/ October, 2011 Introduction to Using JMP®

2. Outline • Introduction • Getting Started • Managing Data • Visualizing Data • Creating Summary Statistics • Performing Basic Statistical Analysis • Saving and Exporting Results • Resources

3. About JMP® • JMP was developed by SAS Institute Inc., Cary, NC • Using JMP statistical software, you can • Interact with your graphs and data to discover patterns and relationships in your data • See how the data and the model work together to produce the statistics • Perform statistical summary and analysis

4. JMP Download and Installation • JMP license information • All Virginia Tech students may download JMP free of chargeby going to the Software Distribution Office's Network Software page and logging on using your PID and password • http://www2.ita.vt.edu/software/student/products/sas/jmp/index.html • JMP 9 is available now for both Windows and Mac • Unzip the JMP 9 file, click on the ‘setup’ icon, and follow the instructions for installation

5. Prerequisites • Before you begin using JMP, note the following information: • You can use many JMP features, such as data manipulation, graphs, and scripting features, without any statistical knowledge • A basic understanding of basic statistical concepts, such as mean and variation, is recommended • Analytical features require statistical knowledge appropriate for the feature

6. JMP Terminology • JMP platforms use these windows: • Launch windowswhere you set up and run your analysis • Report windows showing the output of your analysis • Report windows normally contain the following items: • A graph of some type (such as a scatterplot or a histogram) • Specific reports that you can show or hide using the disclosure button • Platform options that are located within red triangle menus

7. Outline • Introduction • Getting Started • Managing Data • Visualizing Data • Creating Summary Statistics • Performing Basic Statistical Analysis • Saving and Exporting Results • Resources

8. JMP Home Window (Windows Only) • Tab + Alt to switch among different windows • Ctrl + Q to quit

9. JMP Data Table • You can enter, view, edit, and manage data using data tables • In a data table, each variable is a column, and each observation is a row • To create a new data table: • Select File > New > Data Table • Ctrl + N • Click on the first icon below the File menu

10. JMP Data Table • This shows an empty data table with no rows and one numeric column, labeled Column 1

11. Entering Data • Manually: • Move the cursor onto a cell, click in the cell and enter a value • Construct a formula to calculate column values • Open the formula editor by right-clicking the column name to which you want to apply the formula and selecting Formula… • Or Double-click the column name to which you want to apply the formula, Column Properties > Formula > Edit Formula • Select an empty formula element in the formula editing area by clicking it

12. Importing Data • You can import many file formats into JMP by default. For example: • Comma-separated (.csv) • .dat files that consist of text • Microsoft Excel 1997–2003 (.xls) • Plain text (.txt) • SAS versions 6–9 on Windows (.sd2, .sd5, .sd7, .sas7bdat) • SPSS files (.sav) • Other files, such as Microsoft Excel 2007 files, require specific Open Database Connectivity (ODBC)

13. Import from Excel Files • File > Open or Ctrl + O or • Or, select all data in the excel spreadsheet, copy, switch to JMP, create a new data table, Edit > Paste with Column Names • Exercise: • Open the SAT.xls excel file in JMP • In the Open Data File window, change ‘All JMP Files’ to ‘All Files’ • Copy and paste data in SAT.xls to a JMP data table

14. Data Table Panels • There are three data table panels • Table panel • Columns panel • Rows panel • The data table panels are arranged to the left of the data grid • These panels contain information about the table and its contents

15. JMP Modeling Types • The modeling type of a variable can be one of the following types, shown with its corresponding icon: • Continuous • Ordinal • Nominal • When you import data into JMP, it predicts which modeling types to use • Character data is considered nominal • Numeric data is considered continuous • To change the modeling type, click on the modeling type icon next to the variable and make your selection

16. Access Sample Data Tables • All of the examples in the JMP documentation suite use sample data. To access JMP’s sample data tables, • Select Help > Sample Data.From here, you can: • Open the sample data directory • Open an alphabetized list of all sample data tables • Search for a sample data table within a category • Alternatively, the sample data tables are installed in the following directory: • On Windows: C:\Program Files\SAS\JMP\9\Support Files <language>\Sample Data • On Macintosh: \Library\Application Support\JMP\9\<language>\Sample Data

17. Saving JMP Sessions • A saved session can help get you back to a previous state without having to manually re-open files and re-run analyses • Select File > Save • By default, JMP asks whether you would like to save the state of your session each time you exit the program • Saving session upon exiting:

18. Outline • Introduction • Getting Started • Managing Data • Visualizing Data • Creating Summary Statistics • Performing Basic Statistical Analysis • Saving and Exporting Results • Resources

19. Adding Rows • To add one or multiple new empty rows, you can take one of the following actions: • Select Rows > Add Rows • Double-click an empty row number area below the last row to add that many empty rows • Double-click the gray lower triangular area in the upper left corner of the data grid. In the Add Rows… window, • Enter the number of rows to add • Specify where you would like to add them • Right-click in an empty row below the last row, and select Add Rows… • Enter the number of rows to add

20. Deleting Rows • To delete rows from the data grid, you can do one of the following: • Highlight the rows that you want to delete, then select Rows > Delete Rows • Right-click on the row numbers and select Delete Rows

21. Adding Columns • To add one or multiple new empty columns, you can take one of the following actions: • Select Cols > New Column • Double-click the empty space to the right of the last data table column • Select Cols > Add Multiple Cols… (or double-click the gray upper triangular area in the upper left corner of the data grid). In the Add Multiple Cols… window, • Enter the number of columns to add • Specify if they are to be grouped • Select a data type • Enter their location • Select the initial data values

22. Deleting Columns • To delete columns from the data grid, you can do one of the following: • Highlight the columns that you want to delete, then select Cols > Delete Columns • Right-click on the column numbers and select Delete Columns

23. Selecting/Deselecting Rows • Select or deselect rows: • Select Rows > Row Selection > Go to Row… to select a certain row number • Select Rows > Row Selection > Select All Rows Select Rows > Clear Row States • Hold down Shift and click the gray lower triangular area in the upper left corner of the data grid to select all rows. Click again to deselect • To clear all highlights in the data table, press the ESC key on your keyboard

24. Selecting/Deselecting Columns • Select or deselect columns: • Select Cols> Go … to select a certain column number or name • Hold down Shift and click the gray upper triangular area in the upper left corner of the data grid to select all columns. Click again to deselect • To clear all highlights in the data table, press the ESC key on your keyboard

25. Selecting Cells with Specific Values • Selecting cells that match the currently highlighted cell • Highlight the cells that contain the value(s) that you want to locate • Select Rows > Row Selection > Select Matching Cells • Selecting cells that contain specific values • Select Rows > Row Selection > Select Where

26. Show/Hide Data • You suppress (hide) rows and columns so they are included in analyses but do not appear in plots and graphs. To do so, you • Select Hide/Unhide from the Rows menu or Cols menu • A mask icon appears beside the hidden row number or the column name, indicating that the row or column is hidden • To unhide rows or columns, you select Hide/Unhide again

27. Include/Exclude Data • You can exclude data from calculations in analyses. For most platforms, excluded data are not hidden in plots. To do so, you • Select Exclude/Unexcludefrom the Rows menu or Cols menu • A circle with a strikethrough appears beside either the row number or the column name, indicating that the row or column is excluded and not analyzed • To un exclude rows or columns, you select Exclude/Unexcludeagain

28. Data Filter • The Data Filter gives you a variety of ways to identify subsets of data • Using Data Filter commands and options, you interactively select complex subsets of data, hide these subsets in plots, or exclude them from analyses • Select Rows > Data Filter

29. Data Filter • Exercise: Select data for Virginia • Open SAT data in JMP • Select Rows > Data Filter • Select State and click Add • Let’s check Select for Virginia • Can also check Show or Include • De-select? Click Clear • Choose another variable? Click Start Over

30. Data Filter • To select/show/include continuous variables such as time or weight, • Use sliders to control selection • Drag the end sliders to select the range you want • Need specific end points? Click on those values

31. Outline • Introduction • Getting Started • Managing Data • Visualizing Data • Creating Summary Statistics • Performing Basic Statistical Analysis • Saving and Exporting Results • Resources

32. Histograms • Histograms visually display the distribution of your data • For categorical (nominal or ordinal) variables, the histogram shows a bar for each level of the ordinal or nominal variable • For continuous variables, the histogram shows a bar for grouped values of the continuous variable • Select Analyze > Distribution

33. Histograms • Exercise: Create a histogram for SAT Math • Open SAT data in JMP • Select Analyze > Distribution • In the Select Columns box, select SAT Math > Y, Columns, then click on OK

34. Histograms • Interacting with the histogram • Change the orientation: • Click on the ▼ red triangle menu> Histogram Options > Vertical • Display the count of within each bar: • Click on the ▼ red triangle menu> Histogram Options > Show Counts • Rescaling the axis (continuous variables only): • Click and drag on an axis to rescale it • Hover over the axis until you see a hand, double-click on the axis and set the parameters in the XAxis Specification window • Resizing histogram bars (continuous variables only): • Click on the ▼ red triangle menu> Histogram Options > Set Bin Width • Hover over the axis until you see a hand, double-click on the axis and set the increment in the XAxis Specification window

35. Histograms • Interacting with the histogram • Clicking on a histogram bar highlights the bar and selects the corresponding rows in the data table • The appropriate portions of all other graphical displays also highlight the selection

36. Scatterplots • Select Analyze > Fit Y by X • Exercise: Plot SAT Verbal vs. SAT Math • Select Analyze >Fit Y by X • Click SAT Verbal in Select Columns box > Y, Response • Click SAT Math in Select Columns box > X, Factor button • Click OK

37. Scatterplots • Interacting with the scatterplots • Suppose we are interested in the points with both SAT Math and SAT Verbal greater than 600 • Point at this point and click on it • The point gets highlighted • The corresponding row (row 274) is also highlighted in the data table

38. Scatterplots • Interacting with the scatterplots • Suppose we are interested in all the points withboth SAT Math and SAT Math > 580 • Shift-click on all the points that satisfied this condition • Or, drag a box over all these points • To deselect, Ctrl-click

39. Scatterplots • Interacting with the scatterplots • Color the selected points red and change the symbol to an empty circle • Right click on the scatterplot • Row Colors • Row Markers • etc.

40. Scatterplots • Interacting with the scatterplots • Suppose those highlighted points are considered as ‘outliers’ and need to be removed from the plot (or the analysis) • Right click on the scatterplot • Row Hide • Row Exclude • ▼ Red triangle menu> Script > Redo Analysis to update the plot

41. Scatterplot Matrix • Using the Scatterplot Matrix platform, you can assess the relationships between multiple variables simultaneously • A scatterplot matrix is an ordered collection of bivariate graphs • Select Graph > Scatterplot Matrix • Select Analyze > Multivariate Methods > Multivariate (continuous data only) • Exercise: • Help > Sample data > Iris • Select Sepal length, Sepal width, Petal length, and Petal width and click Y, Columns • Select Speciesand click Group • Click OK

42. Scatterplot Matrix • To make the groupings stand out, you can: • From the ▼red triangle menu, select Density Ellipses • From the ▼ red triangle menu, select Shaded Ellipses

43. Scatterplot 3D • The Scatterplot 3D platform shows the values of numeric columns in the associated data table in a rotatable, 3D view • Select Graph > Scatterplot 3D • Exercise: • Help > Sample data > Iris • Select Graph > Scatterplot 3D • Select Sepal length, Sepal width, Petal length, and Petal width and click Y, Columns • Click OK

44. Scatterplot 3D • Information Displayed on the Scatterplot 3D Report

45. Scatterplot 3D • Normal Contour Ellipsoids • Exercise: Grouped normal contour ellipsoids • The ellipsoids cover 75% of the data points and are 50% transparent • The contours are color-coded based on species • Help > Sample data > Iris • Select Graph > Scatterplot 3D • Select Sepal length, Sepal width, Petal length, and Petal width and click Y, Columns • Click OK • ▼ Red triangle menu > Normal Contour Ellipsoids • Select Grouped by Column • Select Species • Type 0.75 next to Coverage • Type 0.5 next to Transparency • Click OK

46. Scatterplot 3D • Example of Grouped Normal Contour Ellipsoids

47. Scatterplot 3D • If we select Nonpar Density Contour instead of Normal Contour Ellipsoids, we can create nonparametric density contours

48. Variability Charts • The variability charts are used when we have multiple categorical x variables and one y variable • Select Graph > Variability/Gauge Chart • Exercise: • Help > Sample data > Car Physical Data • Select Graph > Variability/Gauge Chart • Select Weight as Y, Response,Countryand Type as X, Grouping • Click OK

49. Variability Charts • From the ▼ red triangle menu, you can • Connect Cell Means (blue lines are added) • Uncheck Show Range Bars (easier to see points) • Show Group Means (purple lines are added)