Difference between revisions of "National Origin or National Location"

Latest revision as of 06:04, 10 June 2022

Ogan et al. (2015) pdf

Multi-national models predicting learning gains from student's help-seeking behavior
Models built on only U.S. or combined data sets performed extremely poorly for Costa Rica
Models performed better when built on and applied for the same country, except for Philippines where model built on that country which was outperformed slightly by model built on U.S. data

Li et al. (2021) pdf

Model predicting student achievement on the standardized examination PISA
Inaccuracy of the U.S.-trained model was greater for students from countries with lower scores of national development (e.g. Indonesia, Vietnam, Moldova)

Wang et al. (2018) pdf

Automated scoring model for evaluating English spoken responses
SpeechRater gave a significantly lower score than human raters for German students
SpeechRater scored gave higher scores than human raters for Chinese students, with H1-rater scores higher than mean

Bridgeman et al. (2009) page

Automated scoring models for evaluating English essays, or e-rater

E-Rater gave significantly better scores than human rater for TOEFL essays (independent task) written by speakers of Chinese and Korean
E-Rater correlated poorly with human rater and gave better scores than human rater for GRE essays (both issue and argument prompts) written by Chinese speakers

Bridgeman et al. (2012) pdf

A later version of automated scoring models for evaluating English essays, or e-rater
E-rater gave better scores for test-takers from Chinese speakers (Mainland China, Taiwan, Hong Kong) and Korean speakers when assessing TOEFL (independent prompt) essay
E-rater gave lower scores for Arabic, Hindi, and Spanish speakers when assessing their written responses to independent prompt in TOEFL

@@ Line 1: / Line 1: @@
-Bridgeman, Trapani, and Attali (2009) [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.577.7573&rep=rep1&type=pdf pdf]
-* E-Rater system that automatically grades a student’s essay
+Ogan et al. (2015) [https://link.springer.com/content/pdf/10.1007/s40593-014-0034-8.pdf pdf]
-* Inaccurately high scores were given to Chinese and Korean students
-*System showed poor correlation for GRE essay scores of Chinese students
+* Multi-national models predicting learning gains from student's help-seeking behavior
+* Models built on only U.S. or combined data sets performed extremely poorly for Costa Rica
+* Models performed better when built on and applied for the same country, except for Philippines where model built on that country which was outperformed slightly by model built on U.S. data
-Bridgeman, Trapani, and Attali (2012) [https://www.tandfonline.com/doi/pdf/10.1080/08957347.2012.635502?needAccess=true pdf]
-*A later version of E-Rater system for automatic grading of GSE essay
-* Chinese students were given higher scores than when graded by human essay raters
-*Speakers of Arabic and Hindi were given lower scores
-Ogan and colleagues (2015) [https://link.springer.com/content/pdf/10.1007/s40593-014-0034-8.pdf pdf]
-*Multi-national model predicting learning gains from student's help-seeking behavior
-*Both U.S. and combined model performed extremely poorly for Costa Rica
-*U.S. model outperformed for Philippines than when trained with its own data set
@@ Line 27: / Line 17: @@
 * Automated scoring model for evaluating English spoken responses
-* SpeechRater gave a significantly lower score than human raters for German
+* SpeechRater gave a significantly lower score than human raters for German students
-* SpeechRater scored in favor of Chinese group, with H1-rater scores higher than mean
+* SpeechRater scored gave higher scores than human raters for Chinese students, with H1-rater scores higher than mean
-Bridgeman et al. (2009) [https://www.researchgate.net/publication/242203403_Considering_Fairness_and_Validity_in_Evaluating_Automated_Scoring pdf]
+Bridgeman et al. (2009) [https://www.researchgate.net/publication/242203403_Considering_Fairness_and_Validity_in_Evaluating_Automated_Scoring page]
 * Automated scoring models for evaluating English essays, or e-rater
-* E-rater gave significantly higher score for students from China and South Korea than 14 other countries when assessing independent writing task in Test of English as a Foreign Language (TOEFL)
-* E-rater gave slightly higher scores for GRE analytical writing, both argument and issue prompts, by students from China whose written responses tended to be the longest and below average on grammar, usage and mechanics
+* E-Rater gave significantly better scores than human rater for TOEFL essays (independent task) written by speakers of Chinese and Korean
+* E-Rater correlated poorly with human rater and gave better scores than human rater for GRE essays (both issue and argument prompts) written by Chinese speakers
-Bridgeman, Trapani, and Attali (2012) [https://www.researchgate.net/publication/233291671_Comparison_of_Human_and_Machine_Scoring_of_Essays_Differences_by_Gender_Ethnicity_and_Country pdf]
+Bridgeman et al. (2012) [https://www.tandfonline.com/doi/pdf/10.1080/08957347.2012.635502?needAccess=true pdf]
 * A later version of automated scoring models for evaluating English essays, or e-rater
-* E-rater gave slightly higher scores for test-takers from Chinese speakers (Mainland China, Taiwan, Hong Kong) and Korean speakers when assessing written responses to independent prompt in Test of English as a Foreign Language (TOEFL)
+* E-rater gave  better scores for test-takers from Chinese speakers (Mainland China, Taiwan, Hong Kong) and Korean speakers when assessing TOEFL (independent prompt) essay
-* E-rater gave slightly lower scores for Arabic, Hindi, and Spanish speakers when assessing their written responses to independent prompt in TOEFL
+* E-rater gave lower scores for Arabic, Hindi, and Spanish speakers when assessing their written responses to independent prompt in TOEFL
-* E-rater gave  significantly higher scores for test-takers from Mainland China than from Taiwan, Korea and Japan when assessing their GRE writings which tended to be below average on grammar, usage, and mechanics but longest response

Difference between revisions of "National Origin or National Location"

Latest revision as of 06:04, 10 June 2022

Navigation menu

Search