Criteria of tests-4_第1頁
Criteria of tests-4_第2頁
Criteria of tests-4_第3頁
Criteria of tests-4_第4頁
Criteria of tests-4_第5頁
已閱讀5頁,還剩27頁未讀, 繼續(xù)免費閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認(rèn)領(lǐng)

文檔簡介

1、 Criteria of Tests測試的標(biāo)準(zhǔn) Validity 效度 Reliability 信度 Power/Difficulty 難度 Discrimination 區(qū)分度 Practicality 實用性 Backwash effects 后效作用Criteria of testsValidityThe validity of a test is the extent to which it measures what it is supposed to measureand nothing else.效度是指一套測試所考的是否就是設(shè)計人想要考的內(nèi)容,或者說,在多大程度上考了想要考的。

2、Discuss on the following items: “Is photography an art or a science?” Discuss. “The mind is in its own place, and itself can make a Heaven of Hell, a Hell of a Heaven.” (Milton) Discuss. Use the following words in sentences: courageous, choosy, acceptable, complicated, etc.A. John is a very courageo

3、us boy.B. John, the captain of our team, is courageous.C. I have a courageous father.Factors of validity Face validity 表面效度 Content validity 內(nèi)容效度 Construct validity 結(jié)構(gòu)效度 Empirical validity 實驗效度 Concurrent validity 共時效度 Predictive validity 預(yù)測效度Face validity If a test item looks right to other testers

4、, teachers, moderators, and testees, it can be described as having at least face validity. 表面效度指考試表面的可信度或公眾的可接受程度。 鄒申:一個考試看上去具有了擬定的技能或能力測試。(測語音語調(diào)用筆頭考試來測則表面效度低。)Content validity A test is said to have content validity if its content constitutes a representative sample of the language skill, structure

5、s, etc. with which it is meant to be concerned. 內(nèi)容效度指測試是否考了考試大綱規(guī)定要考的,或者說考試的題目在多大程度上能代表它所要測量的目標(biāo)。(1)Is the content of a test related to the objective or purpose of it?(2)Are the test items representative?(3)Is the content appropriate or suitable for the testees?Construct validity If a test has constru

6、ct validity, it is capable of measuring certain specific characteristics in accordance with a theory of language behavior and learning. 結(jié)構(gòu)(構(gòu)卷)效度指測試是否以有效的語言觀(包括語言學(xué)習(xí)觀和語言運用觀)為依據(jù)。這里的結(jié)構(gòu)并不是指試卷的結(jié)構(gòu)或題目的編排,而是指整個考試的理論基礎(chǔ)。Empirical validity This validity is obtained as a result of comparing the results of the te

7、st with the results of some criterion measure. 實驗(統(tǒng)計)效度是將考試結(jié)果與其它測量結(jié)果相比較而得來的。它又可分為共時效度和預(yù)測效度。Concurrent validity If the results of the test are compared with the results of some criterion measure such as: an existing test, known or believed to be valid and given; or the teachers ratings or any other s

8、uch form of independent assessment givenat the same time, then results obtained by either of the above two methods are measures of the tests concurrent validity in respect of the particular criterion used. In other words, concurrent validity is established when the test and the criterion are adminis

9、tered at about the same time. 共時效度是將一次測試的結(jié)果同另一次同時或時間相近的測試的結(jié)果相比較,或同教師對學(xué)生的評估相比較而得出的系數(shù)。例如拿期末考試成績與剛剛結(jié)束的四級考試成績相比,假若得分情況相似,則說明期末測試有較高的共時效度。(前提:四級考試效度很高。)Predicative validity If the results of the test are compared with the results of some criterion measure such as: the subsequent performance of the testee

10、s on a certain task measured by some valid test; or the teachers ratings or any other such form of independent assessment given later,then results obtained by either of these two methods are measures of the tests predicative validity in respect of the particular criterion used. In other words, predi

11、cative validity concerns the degree to which a test can predict the testers future performance or success. 預(yù)測效度涉及測試的預(yù)測能力,即測試結(jié)果到底在多大程度上能夠預(yù)測出某些將來會發(fā)生的可能性,或者說考試是否具有預(yù)測學(xué)生未來表現(xiàn)或成績的功能。 A Test is said to be reliable if it is consistent in its measurements. 信度是指考試結(jié)果的可靠性和穩(wěn)定性。例如 拿一份卷子對同一組學(xué)生實施兩次或多次測 試,如果結(jié)果很一致,則說明

12、該測試的信度 較高。Reliability驗證測試信度的方法 考后復(fù)考法 (test/retest method) 試題分半法 (split-half method) 平行試題法 (parallel forms method)test/retest methodThis method is to re-administer the same test after a lapse of time. It is often impracticable since certain students will benefit more than others by a familiarity with

13、 the type and format of the test. Moreover, in addition to changes in performance resulting from the memory factor, personal factors such as motivation and differential maturation will also account for differences in the performances of certain students.split-half methodThis method estimates a diffe

14、rent kind of reliability from that estimated by test/re-test procedure. It is based on the principle that, if an accurate measuring instrument were broken into two equal parts, the measurements obtained with one part would correspond exactly to those obtained with the other. parallel forms methodThi

15、s method is to administer parallel forms of the test to the same group. This assumes that two similar versions of a particular test can be constructed: such tests must be identical in the nature of their sampling, difficulty, length, rubrics, etc. only after a full statistical analysis of the tests

16、and all the items contained in them can the tests safely be regarded as parallel. If the correlation between the two tests is high, then the tests can be termed reliable.影響考試信度的因素 題量 題目性質(zhì) 題目區(qū)分度 成績分布 題目難度 評分是否客觀 考試的時間 Power/Difficulty難度是指一套試題中每個題目的難易程度。分析一套試卷的質(zhì)量如何,除了看其信度和效度這兩個重要指標(biāo)之外,還要研究試題的難度指數(shù)(index

17、 of difficulty/facility value),即試題的難易度。難度值的計算公式 題目的難度通常用P來表示,P值實際上指的是答對題目的比率。假設(shè)有10名考生,某道題有8人答對,那么該題的難度值為:適用于主觀性試題的公式 假設(shè)某寫作題的滿分為20分,所有考生在這道題上的得分的平均分為16分,則該題的難度值為:正態(tài)分布圖 Discrimination Discrimination of a test is its capability to discriminate among the different candidates and to reflect the differenc

18、es in the performance of the individuals in the group. 區(qū)分度指一個題目區(qū)分考生能力的程度。計算題目區(qū)分度的方法 公式法 點雙列相關(guān)系數(shù)法 雙列相關(guān)系數(shù)法Practicality A good test is practical. It is within the means of financial limitations, time constraints, ease of administration, and scoring and interpretation. 實用性是指試題是否便于使用以及實施 起來是否可行。Factors affecting practicality the length of time available for the administration of the test the answer sheet and the stationery used the test situation the necessary e

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論