html5-img
1 / 75

Assessment Workshop Creating and Evaluating High Quality Assessments

Assessment Workshop Creating and Evaluating High Quality Assessments. Dr. Deborah Brady. Agenda. Introductions: Overview Break at about 10, lunch at about 12, session ends at about 3:00 Morning presentation (with frequent processing breaks) and afternoon time for beginning to plan

giza
Télécharger la présentation

Assessment Workshop Creating and Evaluating High Quality Assessments

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Assessment Workshop Creating and Evaluating High Quality Assessments Dr. Deborah Brady

  2. Agenda • Introductions: Overview • Break at about 10, lunch at about 12, session ends at about 3:00 • Morning presentation (with frequent processing breaks) and afternoon time for beginning to plan • High quality Assessments (DESE criteria) • Tools to evaluate assessments • Tools to track all educators’ DDMs • Quality Tracking Tool • Educator Alignment Tool • Measuring Student Growth • Direct measures • Local alternatives to determine growth • Pre-/Post, Holistic Rubrics, Measures over time, Post-test only • “Standardization” an alternative, but not required • Indirect measures • Piloting, preparing for full implementation in SY 2015 • EXIT SLIPS—questions, priorities, planning for full implementation (to shape the second workshop) • My email dbrady3702@msn.com;

  3. The Steps Necessary to Get Ready for June Report and After

  4. Potential as Transformative ProcessWhen C, I or A is changed….Elmore, Instructional Rounds, and the “task predicts performance”

  5. Goals for Today • Answer your questions and provide tools/materials to support you • ( Use note cards/exit slips) • By the end of this session • You will understand what needs to be done and explain it to your colleagues • You will have tools to begin to do that work in your district • You may still have unique questions— Please ask them! • dbrady3702@msn.com

  6. The DESE Requirements Purpose, timeline, requirements, direct and indirect assessments

  7. District Determined Measures DEFINITION TYPES OF MEASURES Portfolio assessments Approved commercial assessments District developed pre and post unit and course assessments Capstone projects DDMs are defined as: “Measures of student learning, growth, and achievement related to the Curriculum Frameworks, that are comparable across grade or subject level district-wide”

  8. The Role of DDMs (DESE) To provide educators with an opportunity to: Understand student knowledge and learning patterns more clearly Broaden the range of what knowledge and skills are assessed and how learning is assessed Improve educator practice and student learning Provide educators with feedback about their performance with respect to professional practice and student achievement Provide evidence of an educators impact on student learning Bottom Line: This process can be useful

  9. District Determined Measures Regulations Every educator will need data from at least 2 different measures Trends must be measured over a course of at least 2 years One measure must be taken from State-wide testing data such as MCAS if available (grades 4-8 ELA and Math SGP for classroom educators) One measure must be taken from at least one District Determined Measure which can include Galileo, normed assessments (DRA, MAP, SAT)

  10. Timeline for Piloting and Full Implementation 2013-2014 District-wide training, development of assessments and piloting June 2014: Report: All educators in the district have 2 DDMs to be implemented fully in SY2015. 2014-2015 All DDMs are implemented; scores are divided into H-M-and Low and stored locally 2015-2016 Second year data is collected and all educators receive an impact rating that is sent to DESE based on 2 years of data for two DDMs

  11. Impact Rating Performance & Impact Ratings Performance Rating Ratings are obtained through data collected from observations, walk-throughs and artifacts Exemplary Proficient Needs Improvement Unsatisfactory Ratings are based on trends and patterns in student learning, growth and achievement over a period of at least 2 years Data gathered from DDM’s and State-wide testing High Moderate Low

  12. Student Impact Rating Determines Plan Duration for PST (not future employment) Impact Rating on Student Performance Massachusetts Department of Elementary and Secondary Education

  13. What kinds of assessments will work for administrators, guidance, nurses, school psychologists? • Use School-wide Growth Measures • Use MCAS growth measures and extend them to all educators in a school • Use “indirect measures” such as dropout rates, attendance, etc., as measures • Use Student Learning Objectives (SLOs) • Or create measures. • A pre- and post- test are generally required to measure growth except with normed assessments

  14. Indirect Measures • Indirect measures of student learning, growth, or achievement provide information about students from means other than student work. • These measures may include student record information (e.g., grades, attendance or tardiness records, or other data related to student growth or achievement such as high school graduation or college enrollment rates). • To be considered for use as DDMs, a link (relationship) between indirect measures and student growth or achievement must be established. • For some educators such as district administrators and guidance counselors, it may be appropriate to use one indirect measure of student learning along with other direct measures; • ESE recommends that at least one of the measures used to determine each educator’s student impact rating be a direct measure.

  15. Indirect Measure Examples • Consider SST Process for a team: • High school SST team example • Child Study Team example • RTI team example • High school guidance example • Subgroups of students can be studied (School Psychologist group example) • Social-emotional growth is appropriate (Autistic/Behavioral Program example) • Number of times each student says hello to a non-classroom adult on his or her way to gym or class • Number of days (or classes) a student with school anxiety participates • Assess level of participation in a class • Increase the “in-depth” studies of at risk students • Make sure students go through the referral process to decrease the number of students who are unnecessarily assessed • Improve applications to college • IEP goals can be used as long as they are measuring growth (academic or social-emotional)

  16. Table Talk (5 minutes) Using the 6-phase overview, what are your priorities?

  17. Using the 6-phase overview, what are your priorities?

  18. Assessment Quality Requirementsand Definitions from DESE Alignment, Rigor, Comparability, “Substantial,” Modifications

  19. What are the requirements? • 1. Is the measure aligned to content? • Does it assess what is most important for students to learn and be able to do? • Does it assess what the educators intend to teach? • Bottom Line: “substantial” content of course • At least 2 standards • ELA: reading/writing • Math: Unit exam • Not necessarily a “final” exam (unless it’s a high quality exam)

  20. 2. Is the measure informative? • Do the results of the measure inform educators about curriculum, instruction, and practice? • Does it provide valuable information to educators about their students? • Does it provide valuable information to schools and districts about their educators? Bottom Line: Time to analyze is essential

  21. Five Considerations (DESE) • Measure growth • Employ a common administration procedure  • Use a common scoring process • Translate these assessments to an Impact Rating • Assure comparability of assessments (rigor, validity).

  22. Comparability • Comparable within a grade, subject, or course across schools within a district • Identical measures are recommended across a grade, department, course • Comparable across grade or subject level district-wide • Impact Ratings should have a consistent meaning across educators; therefore, DDMs should not have significantly different levels of rigor

  23. Two Considerations for Local DDMs, 1. Comparable across schools • Example: Teachers with the same job (e.g., all 5th grade teachers) • Where possible, measures are identical • Easier to compare identical measures • Do identical measures provide meaningful information about all students? • Exceptions: When might assessments not be identical? • Different content (different sections of Algebra I) • Differences in untested skills (reading and writing on math test for ELL students) • Other accommodations (fewer questions to students who need more time) • NOTE: Roster Verification and Group Size will be considerations by DESE

  24. 2. Comparable across the District • Aligned to your curriculum (comparable content) K-12 in all disciplines • Appropriate for your students • Aligned to your district’s content • Informative, useful to teachers and administrators • “Substantial” Assessments (comparable rigor): • “Substantial” units with multiple standards and/or concepts assessed. (DESE began talking about finals/midterms as preferable recently) See Core Curriculum Objectives (CCOs) on DESE website if you are concerned http://www.doe.mass.edu/edeval/ddm/example/ • Quarterly, benchmarks, mid-terms, and common end of year exams • NOTE: All of this data stays in your district. Only HML goes to DESE with a MEPID for each educator.

  25. RigorAlignment Rigorous Aligned to District curriculum Shifted to new expectations Shifted from MCAS expectations Consider PARCC This is a district decision Gradual increments? Giant steps? • 2011 Massachusetts Frameworks • Common Core Shifts • Complex texts • Complex tasks • Writing to text • Shift in Persuasive Essay (Formal Argument) • Shift in Narrative (More substantial and linked to content) • Shift in Informational Text (organization substantiation) • Math, Science , History/SS frameworks

  26. Writing to Text and PARCC The Next Step? • The 2011 MA Frameworks Shifts to the Common Core • Complex Texts • Complex Tasks • Multiple Texts • Increased Writing A Giant Step? Increase in cognitive load • Mass Model Units—PBL with Performance-Based Assessments (CEPAs) • PARCC assessments require matching multiple texts

  27. Two Forms to Adapt to Your Standards • Handout—DDM Proposal form • Excel file (on wiki) Simple Excel List

  28. June Report Form (Not Yet Released)Educators Linked with DDMs

  29. Handout Sample

  30. Table Talk/Team Talk (10 min) How will you develop quality assessments?

  31. Calculating Growth Scores Defining growth, measuring growth, calculating growth for a classroom, for a district

  32. Sample Student GROWTH SCORES from the MCAS TEACHER GROWTH SCORES are developed from student growth scores 244/ 25 SGP 4503699 230/ 35 SGP 225/ 92 SGP

  33. Approaches to Measuring Student Growth • Pre-Test/Post Test • Repeated Measures • Holistic Evaluation • Post-Test Only

  34. Pre/Post Test • Description: • The same or similar assessments administered at the beginning and at the end of the course or year • Example: Grade 10 ELA writing assessment aligned to College and Career Readiness Standards at beginning and end of year with the passages changed • Measuring Growth: • Difference between pre- and post-test. • Considerations: • Do all students have an equal chance of demonstrating growth?

  35. Pre-Post AnalysisCut Scores for L-M-H Growth

  36. Determining Growth with Pre- and Post Assessments • Cut scores need to be locally determined for local assessments • Standardized assessments use “The Body of the Work” protocol which easily translates to local assessments • First determine the difference between pre- and post- scores for all students in a grade or course • Then determine what Low Moderate and High growth is. (Local cut scores) • Top and bottom 10% to begin as a test case • Body of the Work check • Then all scores are reapportioned to each teacher • The MEDIAN score for each teacher determines that teacher’s growth score

  37. Table/Team Talk Discuss the calculations, security, storage, fairness of determining local cut scores.

  38. Further measures beyond pre- and post- tests Repeated measures, Holistic Rubrics, Post-Test Only

  39. Repeated Measures • Description: • Multiple assessments given throughout the year. • Example: running records, attendance, mile run • Measuring Growth: • Graphically • Ranging from the sophisticated to simple • Considerations: • Less pressure on each administration. • Authentic Tasks

  40. Repeated Measures Example Running Record # of errors Date of Administration

  41. Holistic • Description: • Assess growth across student work collected throughout the year. • Example: Tennessee Arts Growth Measure System • Measuring Growth: • Growth Rubric (see example) • Considerations: • Option for multifaceted performance assessments • Rating can be challenging & time consuming

  42. Holistic Example Example taken from Austin, a first grader from Anser Charter School in Boise, Idaho.  Used with permission from Expeditionary Learning. Learn more about this and other examples at http://elschools.org/student-work/butterfly-drafts

  43. Post-Test Only • Description: • A single assessment or data that is paired with other information • Example: AP exam • Measuring Growth, where possible: • Use a baseline • Assume equal beginning • Considerations: • May be only option for some indirect measures • What is the quality of the baseline information?

  44. Post-Test OnlyA challenge to tabulate growth • Portfolios • Measuring achievement v. growth • Unit Assessments • Looking at growth across a series • Capstone Projects • May be a very strong measure of achievement

  45. “Tools” to Support the Process • For determining what is important (Core Curriculum Objectives) • For determining adequacy for use as DDM (Quality Tool) • For making sure each educator has 2 DDMs (Excel Sheet) • For assessing rigor (Cognitive Complexity Rubric, CEPA Rubric)

  46. Core Curriculum Objectives(CCOs—partial list for Writing to Text)

  47. ELA-Literacy — 9 English 9-12https://wested.app.box.com/s/pt3e203fcjfg9z8r02si Assessment Hudson High School Portfolio Assessment for English Language Arts and Social Studies Publisher Website/Sample Designed to be a measure of student growth over time in high school ELA and social science courses. Student selects work samples to include and uploads them to electronic site. Includes guiding questions for students and scoring criteria. Scoring rubric for portfolio that can be adapted for use in all high school ELA and social science courses. Generalized grading criteria for a portfolio. Could be aligned to a number of CCOs, depending on specification of assignments.

  48. Sample DDMs—Local Digital PortfolioHudson, MA • Buy, Borrow, Build • Each sample DDM is evaluated Hudson’s Evaluation: Designed to be a measure of student growth over time in high school ELA and social science courses. Student selects work samples to include and uploads them to electronic site. Includes guiding questions for students and scoring criteria. Scoring rubric for portfolio that can be adapted for use in all high school ELA and social science courses. Generalized grading criteria for a portfolio. Could be aligned to a number of CCOs, depending on specification of assignments. • Many are standardized assessments

More Related