1 / 78

Assessment Workshop Creating and Evaluating High Quality Assessments

Assessment Workshop Creating and Evaluating High Quality Assessments. Dr. Deborah Brady. Agenda. Introductions: Overview Break at about 10, lunch at about 12, session ends at about 3:00 Morning presentation (with frequent processing breaks) and some afternoon time for beginning to plan

braith
Télécharger la présentation

Assessment Workshop Creating and Evaluating High Quality Assessments

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Assessment Workshop Creating and Evaluating High Quality Assessments Dr. Deborah Brady

  2. Agenda • Introductions: Overview • Break at about 10, lunch at about 12, session ends at about 3:00 • Morning presentation (with frequent processing breaks) and some afternoon time for beginning to plan • High quality Assessments (DESE criteria) • Tools to evaluate assessments • Tools to track all educators’ DDMs • Measuring Student Growth • Direct measures • Local alternatives to determine growth • Pre-/Post, Holistic Rubrics, Measures over time, Post-test only • “Standardization” an alternative, but not required • Indirect measures • Piloting, preparing for full implementation in SY 2015 • EXIT SLIPS—questions, priorities, planning for full implementation • My email dbrady3702@msn.com;

  3. The Steps Necessary to Get Ready for June Report and After

  4. Carousel Walk and Living Likert Scale Carousel Walk • Take a marker for your team (or yourself if you are on your own) • Visit each of the 5 “stages” of DDM development • Put a check to the right of each item that you have completed • After viewing each stage, return to the stage that most represents you and/or your district Living Likert • Go to your “stage” • What are the barriers? What are the strengths that your district has? • Discuss the barriers and strengths with the others at your stage. • Be prepared to report out as a whole group.

  5. Potential as Transformative ProcessWhen C, I or A is changed….Elmore, Instructional Rounds, and the “task predicts performance”

  6. Goals for Today • Answer your questions and provide tools/materials to support you • ( Use note cards/exit slips) • By the end of this session • You will understand what needs to be done and explain it to your colleagues • You will have tools to begin to do that work in your district • You may still have unique questions— Please ask them! • dbrady3702@msn.com • http://tinyurl.com/lvn5ome materials from this presentation • This is one page from my Wikispaces site Writing to Text

  7. The DESE Requirements Purpose, timeline, requirements, direct and indirect assessments

  8. District Determined Measures DEFINITION TYPES OF MEASURES Portfolio assessments Approved commercial assessments District developed pre and post unit and course assessments Capstone projects DDMs are defined as: “Measures of student learning, growth, and achievement related to the Curriculum Frameworks, that are comparable across grade or subject level district-wide”

  9. Timeline for Piloting and Full Implementation 2013-2014 District-wide training, development of assessments and piloting June 2014: Report: All educators in the district have 2 DDMs to be implemented fully in SY2015. 2014-2015 All DDMs are implemented; scores are divided into H-M-and Low and stored locally 2015-2016 Second year data is collected and all educators receive an impact rating that is sent to DESE based on 2 years of data for two DDMs

  10. District Determined Measures Regulations Every educator will need data from at least 2 different measures Trends must be measured over a course of at least 2 years One measure must be taken from State-wide testing data such as MCAS if available (grades 4-8 ELA and Math SGP for classroom educators) One measure must be taken from at least one District Determined Measure which can include Galileo, normed assessments (DRA, MAP, SAT)

  11. NEW!Identifying and Selecting DDMs • Establishes a DDM Working Group • Co-chaired by superintendent and president of local bargaining unit or their designees. • Surveys the district for available assessments • Recruits educators to identify assessments and make recommendations • Identifies at least two measures for each educator • Collects feedback on the quality of the DDMs (continuous improvement) • Makes recommendations to the superintendent

  12. NEW!Determining Educator Impact on Each DDM • Evaluator and educator meet. Evaluator determines whether students demonstrated high, moderate, or low growth on each DDM. • Evaluator shares the resulting designations of student growth with educator. • Educators confirm rosters. • Must be on roster by 10/1 and remain on roster through last day of testing. • Must be present for 90% of instructional time.

  13. NEW! Identifying and Selecting DDMs • Describes process for selecting DDMs • Working group makes recommendations to the superintendent. • If superintendent declines, expedited resolution process is triggered: • Parties petition the Commissioner • Commissioner forwards list of 3 hearing officers with curriculum and/or assessment expertise • Parties choose hearing officer • Hearing officer renders final decision • Educators are informed of their DDMs by fourth week of school. • Educators receive appropriate professional development.

  14. Impact Rating Performance & Impact Ratings Performance Rating Ratings are obtained through data collected from observations, walk-throughs and artifacts Exemplary Proficient Needs Improvement Unsatisfactory Ratings are based on trends and patterns in student learning, growth and achievement over a period of at least 2 years Data gathered from DDM’s and State-wide testing High Moderate Low

  15. NEW! Determining a Student Impact Rating • Introduces the application of professional judgment to determine the Student Impact Rating • Evaluator assigns rating using professional judgment. • Evaluator considers designations of high, moderate, or low student growth from at least two measures in each of at least two years. • If rating is low, evaluator meets with educator to discuss • If rating is moderate or high, evaluator/educator decide if meeting is necessary.

  16. Student Impact Rating Determines Plan Duration for PST (not future employment) Impact Rating on Student Performance Massachusetts Department of Elementary and Secondary Education

  17. NEW! Intersection of Ratings • Reinforces independent nature of the two ratings. • Exemplary or Proficient matched with Moderate or High = 2-Year Self-Directed Growth Plan • Exemplary/Moderateand Exemplary/High = recognition and rewards, including leadership roles, promotions, additional compensation, public commendation, and other acknowledgements. • Proficient/Moderate and Proficient/High = eligible for additional roles, responsibilities, and compensation. • Exemplary or Proficient matched with Low = 1-Year Self-Directed Growth Plan • Evaluator’s supervisor confirms rating. • Educator and evaluator analyze the discrepancy. • May impact Educator Plan goals. • Student Impact Rating informs the self-assessment and goal setting processes.

  18. Indirect Measures • Indirect measures of student learning, growth, or achievement provide information about students from means other than student work. • These measures may include student record information (e.g., grades, attendance or tardiness records, or other data related to student growth or achievement such as high school graduation or college enrollment rates). • To be considered for use as DDMs, a link (relationship) between indirect measures and student growth or achievement must be established. • For some educators such as district administrators and guidance counselors, it may be appropriate to use one indirect measure of student learning along with other direct measures; • ESE recommends that at least one of the measures used to determine each educator’s student impact rating be a direct measure.

  19. Indirect Measure Examples • Consider SST Process for a team: • High school SST team example • Child Study Team example • RTI team example • High school guidance example • Subgroups of students can be studied (School Psychologist group example) • Social-emotional growth is appropriate (Autistic/Behavioral Program example) • Number of times each student says hello to a non-classroom adult on his or her way to gym or class (Direct) • Number of days (or classes) a student with school anxiety participates • Assess level of participation in a class (Direct) • Increase the “in-depth” studies of at risk students • Make sure students go through the referral process to decrease the number of students who are unnecessarily assessed • Improve applications to college • IEP goals can be used as long as they are measuring growth (academic or social-emotional)

  20. Table Talk (5 minutes) Using the 6-phase overview, what are your priorities?

  21. Using the 6-phase overview, what are your priorities?

  22. Assessment Quality Requirementsand Definitions from DESE Alignment, Rigor, Comparability, “Substantial,” Modifications

  23. What are the requirements? • 1. Is the measure aligned to content? • Does it assess what is most important for students to learn and be able to do? • Does it assess what the educators intend to teach? • Bottom Line: “substantial” content of course • At least 2 standards • ELA: reading/writing • Math: Unit exam • Not necessarily a “final” exam (unless it’s a high quality exam)

  24. 2. Is the measure informative? • Do the results of the measure inform educators about curriculum, instruction, and practice? • Does it provide valuable information to educators about their students? • Does it provide valuable information to schools and districts about their educators? Bottom Line: Time to analyze is essential

  25. Five Considerations (DESE) • Measure growth • Employ a common administration procedure  • Use a common scoring process • Translate these assessments to an Impact Rating • Assure comparability of assessments (rigor, validity).

  26. Comparability • Comparable within a grade, subject, or course across schools within a district • Identical measures are recommended across a grade, department, course • Comparable across grade or subject level district-wide • Impact Ratings should have a consistent meaning across educators; therefore, DDMs should not have significantly different levels of rigor

  27. Two Considerations for Local DDMs, 1. Comparable across schools • Example: Teachers with the same job (e.g., all 5th grade teachers) • Where possible, measures are identical • Easier to compare identical measures • Do identical measures provide meaningful information about all students? • Exceptions: When might assessments not be identical? • Different content (different sections of Algebra I) • Differences in untested skills (reading and writing on math test for ELL students) • Other accommodations (fewer questions to students who need more time) • NOTE: Roster Verification and Group Size will be considerations by DESE

  28. 2. Comparable across the District • Aligned to your curriculum (comparable content) K-12 in all disciplines • Appropriate for your students • Aligned to your district’s content • Informative, useful to teachers and administrators • “Substantial” Assessments (comparable rigor): • “Substantial” units with multiple standards and/or concepts assessed. (DESE began talking about finals/midterms as preferable recently) See Core Curriculum Objectives (CCOs) on DESE website if you are concerned http://www.doe.mass.edu/edeval/ddm/example/ • Quarterly, benchmarks, mid-terms, and common end of year exams • NOTE: All of this data stays in your district. Only HML goes to DESE with a MEPID for each educator.

  29. RigorAlignment Rigorous Aligned to District curriculum Shifted to new expectations Shifted from MCAS expectations Consider PARCC This is a district decision Gradual increments? Giant steps? • 2011 Massachusetts Frameworks • Common Core Shifts • Complex texts • Complex tasks • Writing to text • Shift in Persuasive Essay (Formal Argument) • Shift in Narrative (More substantial and linked to content) • Shift in Informational Text (organization substantiation) • Math, Science , History/SS frameworks

  30. Writing to Text and PARCC The Next Step? • The 2011 MA Frameworks Shifts to the Common Core • Complex Texts • Complex Tasks • Multiple Texts • Increased Writing A Giant Step? Increase in cognitive load • Mass Model Units—PBL with Performance-Based Assessments (CEPAs) • PARCC assessments require matching multiple texts

  31. Two Forms to Adapt to Your Standards • Handout—DDM Proposal form • Excel file (on wiki) Simple Excel List

  32. June Report Form (Not Yet Released)Educators Linked with DDMs

  33. Handout Sample

  34. Table Talk/Team Talk (10 min) How will you develop quality assessments?

  35. Calculating Growth Scores Defining growth, measuring growth, calculating growth for a classroom, for a district

  36. Sample Student GROWTH SCORES from the MCAS TEACHER GROWTH SCORES are developed from student growth scores 244/ 25 SGP 4503699 230/ 35 SGP 225/ 92 SGP

  37. Approaches to Measuring Student Growth • Pre-Test/Post Test • Repeated Measures • Holistic Evaluation • Post-Test Only

  38. Pre/Post Test • Description: • The same or similar assessments administered at the beginning and at the end of the course or year • Example: Grade 10 ELA writing assessment aligned to College and Career Readiness Standards at beginning and end of year with the passages changed • Measuring Growth: • Difference between pre- and post-test. • Considerations: • Do all students have an equal chance of demonstrating growth?

  39. Pre-Post AnalysisCut Scores for L-M-H Growth

  40. Determining Growth with Pre- and Post Assessments • Cut scores need to be locally determined for local assessments • Standardized assessments use “The Body of the Work” protocol which easily translates to local assessments • First determine the difference between pre- and post- scores for all students in a grade or course • Then determine what Low Moderate and High growth is. (Local cut scores) • Top and bottom 10% to begin as a test case • Body of the Work check • Then all scores are reapportioned to each teacher • The MEDIAN score for each teacher determines that teacher’s growth score

  41. Further measures beyond pre- and post- tests Repeated measures, Holistic Rubrics, Post-Test Only

  42. Repeated Measures • Description: • Multiple assessments given throughout the year. • Example: running records, attendance, mile run • Measuring Growth: • Graphically • Ranging from the sophisticated to simple • Considerations: • Less pressure on each administration. • Authentic Tasks

  43. Repeated Measures Example Running Record # of errors Date of Administration

  44. Holistic • Description: • Assess growth across student work collected throughout the year. • Example: Tennessee Arts Growth Measure System • Measuring Growth: • Growth Rubric (see example) • Considerations: • Option for multifaceted performance assessments • Rating can be challenging & time consuming

  45. Holistic Example Example taken from Austin, a first grader from Anser Charter School in Boise, Idaho.  Used with permission from Expeditionary Learning. Learn more about this and other examples at http://elschools.org/student-work/butterfly-drafts

  46. Post-Test Only • Description: • A single assessment or data that is paired with other information • Example: AP exam • Measuring Growth, where possible: • Use a baseline • Assume equal beginning • Considerations: • May be only option for some indirect measures • What is the quality of the baseline information?

  47. Post-Test OnlyA challenge to tabulate growth • Portfolios • Measuring achievement v. growth • Unit Assessments • Looking at growth across a series • Capstone Projects • May be a very strong measure of achievement

  48. Table/Team Talk Discuss the calculations, security, storage, fairness of determining local cut scores.

  49. “Tools” to Support the Process • For determining what is important (Core Curriculum Objectives) • For determining adequacy for use as DDM (Quality Tool) • For making sure each educator has 2 DDMs (Excel Sheet) • For assessing rigor (Cognitive Complexity Rubric, CEPA Rubric)

More Related