Stage 9-1 Level View

Primary 4 Benchmark Scores

Per-level benchmark view grouped by generation model and subject. Scores are derived from the Stage 9-0 evaluator reports without changing the underlying scoring algorithm.

89 reports 5 subjects 2 LLMs Updated 2026-06-02 12:01:27 UTC Refreshes every 5 min
Evaluator Gemma 4 26B A4B google/gemma-4-26b-a4b-it

Showing all Primary 4 subjects. Pick a subject to recalculate the LLM scores, low-score review list, and detailed rows for that subview.

LLM Summary

Average scores grouped by the model that generated the Primary 4 content.

Generation Model Artifacts Overall Missing Images Language Syllabus Answers Notation Timing
Claude Sonnet 4 5 8.3 1 8.2 8.2 0.0 9.5 -
Legacy generator 84 8.7 39 9.5 8.7 8.9 9.4 9.2

Subject Summary

Average scores grouped by subject inside Primary 4.

Subject Artifacts Overall Missing Images Language Syllabus Answers Notation Timing
Chinese 33 8.7 20 9.1 8.5 8.9 10.0 9.0
English 13 9.1 3 9.7 9.5 8.6 10.0 9.5
Higher Chinese 8 8.5 2 8.9 8.6 9.1 10.0 9.6
Mathematics 20 8.8 6 9.9 9.6 8.8 8.1 9.6
Science 15 8.2 9 9.4 7.4 8.3 10.0 8.9

LLM by Content Type

Model scores split by quizzes, papers, cheatsheets, and parent guides.

Generation Model Type Artifacts Overall Missing Images Language Syllabus Answers Notation Timing
Claude Sonnet 4 Cheatsheet 5 8.3 1 8.2 8.2 0.0 9.5 -
Legacy generator Paper 25 9.1 18 9.4 8.9 8.9 10.0 8.8
Legacy generator Parents Guide 5 9.2 0 9.7 8.7 - - 8.5
Legacy generator Quiz 54 8.5 21 9.5 8.6 8.9 9.1 9.5

Needs Review: Scores Below 8.0

Artifacts with overall benchmark scores below 8.0 for the current level view.

Overall Model Subject Type Stage Topic / Paper Language Syllabus Template Answers Notation Timing Comments
5.0 Claude Sonnet 4 Chinese Cheatsheet 2-7 cheatsheet 3.0 2.0 - - - - The content is highly inappropriate for Primary 4. It covers advanced topics like 'Classical Chinese (文言文)', 'Literary Appreciation (文学鉴赏)', and 'Argumentative Writing (议论文)', which are secondary school level concepts. The vocabulary and cognitive demands (e.g., critical thinking, logical rebuttal) far exceed the P4 MOE syllabus. While the three-point summary structure is good, the subject matter is wrong for the age group.
6.2 Legacy generator Chinese Quiz 5-1 reading 6.0 5.0 4.0 9.0 - 10.0 The content is far too simple for Primary 4; it reads like a Primary 1 or 2 level text. The vocabulary and sentence structures lack the complexity required by the P4 syllabus. There is a major discrepancy between the header score (15) and the actual total score (25). The question format does not reflect the standard Singapore P4 Chinese comprehension paper style.
6.4 Legacy generator Science Quiz 5-1 systems 9.0 3.0 5.0 8.0 10.0 7.0 Major syllabus misalignment: The quiz covers Respiratory, Circulatory, and Skeletal systems, which are explicitly excluded from the P4 syllabus (noted as P5/P6 topics). The content is far too advanced for P4. Additionally, there is a math error: the header says 15 marks, but the total is 25. The question format lacks the standard Singapore Science paper structure (Section A is usually MCQ, Section B is structured, but the complexity here is more suited to upper primary).
6.6 Legacy generator Science Quiz 5-1 light 8.0 2.0 5.0 9.0 10.0 10.0 Major syllabus misalignment: The quiz covers refraction, the law of reflection, prisms, and spectrums, all of which are explicitly excluded from the P4 syllabus. The content is more aligned with P5/P6 or secondary school. Additionally, the total marks in the header (15) do not match the actual total (25). Missing diagrams for refraction and shadow questions.
7.1 Legacy generator Mathematics Quiz 5-1 data-analysis 9.0 4.0 5.0 9.0 10.0 10.0 Major syllabus misalignment: 'Average' (Mean) is a Primary 5/6 concept in Singapore, not Primary 4. P4 Data Analysis focuses on reading/interpreting tables and bar graphs, not calculating means. The quiz is also missing the actual bar graphs/line graphs described in the text, making it rely on text-based descriptions which is not standard for this topic. Total marks in header (15) contradicts the marking scheme (25).
7.2 Legacy generator Higher Chinese Quiz 5-1 reading 8.0 7.0 6.0 9.0 - 9.0 The content is significantly below the expected rigor for Higher Chinese P4. The passage is extremely simple, and most questions are direct literal retrieval rather than the inference and analysis required by the syllabus. There is a major scoring discrepancy: the header says 15 marks, but the breakdown and total sum to 25 marks. The exam format lacks standard instructions and time allocation.
7.2 Legacy generator Science Quiz 5-1 magnets 9.0 6.0 5.0 8.0 10.0 9.0 Syllabus mismatch: Magnets are not part of the P4 MOE Singapore Science syllabus (they are P5/P6 topics). Content difficulty is too high for P4; concepts like magnetic field lines, electromagnets, and magnetic induction (stroking a nail) are beyond the P4 level. Exam format is poor: total marks in header (15) contradicts the actual total (25), and it lacks standard MOE Section A/B structure. Missing diagrams for magnetic field and pole positioning questions.
7.2 Legacy generator Science Quiz 5-1 materials 9.0 4.0 5.0 8.0 10.0 9.0 Major syllabus misalignment: The quiz covers physical/chemical changes, dissolving, and separation techniques (filtration, evaporation), which are Primary 5/6 topics in the Singapore MOE syllabus. P4 Science focuses on states of matter, light, and heat. Additionally, the question on particle models (Q4, Q10) is too advanced for the P4 level. The total marks in the header (15) do not match the actual total (25).
7.4 Legacy generator Chinese Quiz 5-1 listening 9.0 7.0 6.0 9.0 - 10.0 The content is significantly below Primary 4 level; the vocabulary and sentence structures are more suited for Primary 1 or 2. There is a major internal inconsistency: the quiz header states a total score of 15, but the marking scheme calculates a total of 25. The question format lacks the complexity expected in P4 listening assessments (e.g., more nuanced inference).
7.4 Legacy generator Science Quiz 3-0 matter 9.0 6.0 7.0 5.0 10.0 8.0 Major misalignment between quiz and answer key. The quiz contains questions on density and volume calculations which are not in the P4 syllabus (density is typically P5/P6). Furthermore, the answer key does not correspond to the quiz questions (e.g., Quiz Q1-5 vs Answer Key Q1-5 are completely different topics). The answer key introduces complex concepts like particle energy and reversible changes not present in the quiz. High difficulty due to out-of-syllabus content.
7.6 Legacy generator Science Quiz 5-1 heat 9.0 6.0 7.0 8.5 10.0 8.0 The quiz content is significantly above Primary 4 level. Concepts like radiation, convection currents, land/sea breezes, and double-glazing are typically Secondary school topics in the Singapore syllabus. P4 heat should focus on basic heat flow (hot to cold), conductors/insulators, and expansion/contraction. There is also a math error: the header says 15 marks, but the total is 25.
7.7 Legacy generator Chinese Paper 3-1 wa1-paper-8 8.5 8.0 7.0 5.0 10.0 7.0 The paper has significant issues: 1. Missing images for the composition section. 2. Section V (Sentence Correction) contains a 'trick' question where the sentence is already correct, which is unusual for P4. 3. Difficulty is uneven; Section I is very basic, while Section V requires higher-order logic. 4. Answer key for Section V provides corrections but lacks step-by-step reasoning. 5. Marks distribution is slightly inconsistent with standard MOE weighted formats.
7.7 Legacy generator Mathematics Quiz 3-0 fractions 10.0 10.0 6.0 8.0 2.0 9.0 Major issue: The answer key does not match the quiz questions. The questions in Section A and B are completely different from the questions and answers provided in the key. Additionally, the quiz uses plain text slashes instead of proper LaTeX notation for fractions.
7.9 Legacy generator Science Parents Guide 2-9 parents-guide 9.5 6.0 - - - - The guide includes several topics not present in the provided P4 syllabus (Magnets, Food Chains/Living Together). It also introduces advanced concepts like density and photosynthesis equations which are beyond the P4 scope defined in the syllabus. Language is excellent for parents.
7.9 Legacy generator Chinese Quiz 5-1 writing 9.0 8.5 7.0 9.0 - 9.0 The quiz is significantly too easy for Primary 4; Section A uses P1/P2 level vocabulary (sun, book, apple). Section B focuses on basic conjunctions rather than P4 level sentence structures. The exam format lacks a total marks/time allocation header in the main paper, though the answer key corrects the total to 20. Missing actual images for Section A and C.

Content Type Summary

Average scores grouped by content type.

Cheatsheet 8.3 5 artifacts, 1 with missing-image flags
Paper 9.1 25 artifacts, 18 with missing-image flags
Parents Guide 9.2 5 artifacts, 0 with missing-image flags
Quiz 8.5 54 artifacts, 21 with missing-image flags

Detailed Benchmark Rows

Topics, quiz variants, paper versions, cheatsheets, and parent guides listed individually.

Model Type Stage Subject Topic / Paper Overall Missing Images LanguageSyllabusTemplateCleanStep AnswersNotationPaper FormatDifficultyTime Fit3-Point SummaryParent Guide Difficulty Comments
Claude Sonnet 4 Cheatsheet 2-7 Chinese cheatsheet 5.0 No 3.02.0-10.0---2.0-8.0- too hard The content is highly inappropriate for Primary 4. It covers advanced topics like 'Classical Chinese (文言文)', 'Literary Appreciation (文学鉴赏)', and 'Argumentative Writing (议论文)', which are secondary school level concepts. The vocabulary and cognitive demands (e.g., critical thinking, logical rebuttal) far exceed the P4 MOE syllabus. While the three-point summary structure is good, the subject matter is wrong for the age group.
Claude Sonnet 4 Cheatsheet 2-7 English cheatsheet 9.5 No 9.510.0-10.0---9.0-9.0- appropriate Excellent syllabus alignment. The cheatsheet uses effective thematic grouping and provides concise, high-quality summaries for P4 level. Language is appropriate for the age group. No major issues found.
Claude Sonnet 4 Cheatsheet 2-7 Higher Chinese cheatsheet 8.9 No 8.59.0-10.0-10.0-7.0-9.0- uneven The cheatsheet is well-structured with useful three-point summaries (Key words, Sentence making, Common errors). However, the difficulty is uneven: topics like 'Classical Chinese' (文言文) and 'Argumentation' (论证) are quite advanced for P4, whereas 'Sentence Transformation' is very basic. It covers the syllabus well including Singapore culture.
Claude Sonnet 4 Cheatsheet 2-7 Mathematics cheatsheet 8.3 Yes 10.010.0-10.00.09.0-10.0-9.0- appropriate Excellent syllabus coverage for P4. Topic sections use effective bulleted summaries rather than generic text. Notation is clean, though some fractions use unicode instead of LaTeX. Missing diagrams for geometry and nets which are essential for this level.
Claude Sonnet 4 Cheatsheet 2-7 Science cheatsheet 9.8 No 10.010.0-10.0---10.0-9.0- appropriate Excellent cheatsheet. High syllabus adherence for P4 Science. Uses effective bulleted summaries for each topic. Language is perfectly pitched for 10-year-olds. No broken markdown or artifacts found.
Legacy generator Paper 3-1 Chinese sa1-paper-1 9.3 Yes 9.59.09.010.09.5-9.59.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. The exam format (instructions, marks, timing) is professional. Major issue: The 'Picture Composition' (看图作文) section relies on image descriptions rather than actual images, which is a critical missing element for a visual-based task. Answer key is excellent, providing both correct answers and scoring rubrics.
Legacy generator Paper 3-1 Chinese sa1-paper-2 9.2 Yes 9.59.08.510.09.010.09.09.09.0-- appropriate The paper is well-structured and follows the P4 Chinese syllabus. Language level is accurate. Major issue: The 'Look and Write' (看图作文) section relies on four specific images that are described in text but not visually present. The answer key is excellent, providing scoring rubrics and model essays.
Legacy generator Paper 3-1 Chinese sa1-paper-3 9.3 Yes 9.59.08.510.09.010.09.59.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Note: The 'Picture Composition' section relies on text descriptions of images rather than actual images, which is a functional gap for a visual task.
Legacy generator Paper 3-1 Chinese sa1-paper-4 9.2 Yes 9.59.08.510.09.010.09.09.09.0-- appropriate The paper is well-structured and follows the P4 syllabus. Language is appropriate. Major issue: The 'Picture Composition' (看图作文) section relies on four images that are described in text but not visually present, making the artifact incomplete for actual use. Answer key is excellent with clear marking schemes and explanations.
Legacy generator Paper 3-1 Chinese sa1-paper-5 9.3 Yes 9.59.08.510.09.010.09.59.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Major issue: The 'Picture Composition' (看图作文) section relies entirely on text descriptions of images rather than actual images, which is a critical failure for this specific question type. Answer key is excellent, providing scoring rubrics and model essays.
Legacy generator Paper 3-1 Chinese sa2-paper-1 9.5 Yes 10.09.59.010.09.510.09.59.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. The 'missing images' for the composition section are provided as text descriptions, which is acceptable for a text-based artifact but would require actual images in a real exam. Answer key is excellent, providing clear marking schemes and model answers.
Legacy generator Paper 3-1 Chinese sa2-paper-2 9.2 Yes 9.59.08.510.09.010.09.09.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. The 'missing images' issue is significant for the composition section as it uses text descriptions instead of actual visual aids, though the content is logically sound. Answer key is excellent, providing marking schemes and sample essays.
Legacy generator Paper 3-1 Chinese sa2-paper-3 9.2 Yes 9.59.08.510.09.010.09.09.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Note: The 'Look at Pictures' composition section uses text descriptions instead of actual images, which is a major missing element for a visual-based task. Answer key is excellent with clear marking schemes and explanations.
Legacy generator Paper 3-1 Chinese sa2-paper-4 9.0 Yes 9.59.08.59.09.5-8.59.09.0-- appropriate Language and difficulty are well-aligned with P4 standards. The paper format is good, though the composition section uses text descriptions instead of actual images. A minor error was noted in the answer key for question 4 where the question text and answer explanation conflict regarding the character '惯' vs '贯'.
Legacy generator Paper 3-1 Chinese sa2-paper-5 9.3 Yes 9.59.08.510.09.510.09.09.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. The 'missing images' for the composition section are provided as text descriptions, which is acceptable for a text-based artifact but would require actual images in a real exam. Answer key is excellent, providing both answers and scoring rubrics.
Legacy generator Paper 3-1 Chinese wa1-paper-1 8.7 Yes 9.08.58.010.09.010.09.07.08.0-- uneven The paper includes a placeholder for a picture in the composition section which is required. Difficulty is uneven: Section 1 is very easy for P4, but Section 5 (Sentence Correction) and the 150-word composition requirement are quite challenging for a 60-mark/50-minute paper. Answer key is high quality with explanations.
Legacy generator Paper 3-1 Chinese wa1-paper-2 8.8 Yes 9.08.58.010.09.510.09.07.58.0-- appropriate The paper follows a standard format. Language is suitable for P4. However, the 'Sentence Correction' section is slightly problematic as some 'errors' are actually just stylistic choices or minor punctuation issues rather than clear grammatical errors. The composition section relies on a text description instead of an actual image. The answer key is excellent with clear explanations.
Legacy generator Paper 3-1 Chinese wa1-paper-3 9.2 Yes 9.59.08.510.09.010.09.08.59.0-- appropriate Language and difficulty are well-aligned with P4 standards. The paper format is professional. Major issue: The composition section relies on a picture that is only provided as a text description, which is not suitable for a real exam. Answer key is high quality with explanations.
Legacy generator Paper 3-1 Chinese wa1-paper-4 9.2 Yes 9.59.08.510.09.010.09.08.59.0-- appropriate Language and difficulty are well-aligned with P4 standards. The paper follows a standard exam structure. Major issue: The composition section relies on a visual prompt that is only provided as a text description, making the actual paper unusable without the image. Answer key is high quality with explanations.
Legacy generator Paper 3-1 Chinese wa1-paper-5 8.8 Yes 9.08.58.010.09.0-9.08.09.0-- appropriate The paper follows the P4 Chinese syllabus well. Language is appropriate. Major issue: The composition section relies on a picture that is only provided as a text description, making it unusable as a real exam paper. The answer key is high quality with explanations. Marks and timing are realistic.
Legacy generator Paper 3-1 Chinese wa1-paper-6 8.1 Yes 9.08.57.010.05.010.08.06.09.0-- too easy The paper is significantly too easy for Primary 4; most vocabulary and sentence structures are at a P2/P3 level. The 'Sentence Correction' section has logical errors in the answer key (e.g., Q22 correction is semantically incorrect/unnatural). Missing images for the composition section. Answer key lacks step-by-step explanations for language logic.
Legacy generator Paper 3-1 Chinese wa1-paper-7 8.6 Yes 9.08.58.010.07.010.09.07.58.0-- appropriate The paper follows a standard format. However, the composition section relies on a text description instead of an actual image, which is a major flaw for a 'Look at the picture and write' task. The difficulty is generally appropriate for P4, though some vocabulary in the multiple-choice section might be slightly repetitive. Answer key is clear but lacks detailed explanations for the sentence correction section.
Legacy generator Paper 3-1 Chinese wa1-paper-8 7.7 Yes 8.58.07.010.05.010.08.06.07.0-- uneven The paper has significant issues: 1. Missing images for the composition section. 2. Section V (Sentence Correction) contains a 'trick' question where the sentence is already correct, which is unusual for P4. 3. Difficulty is uneven; Section I is very basic, while Section V requires higher-order logic. 4. Answer key for Section V provides corrections but lacks step-by-step reasoning. 5. Marks distribution is slightly inconsistent with standard MOE weighted formats.
Legacy generator Paper 3-1 Chinese wa2-paper-1 9.2 No 9.59.08.510.09.5-9.09.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Answer key is excellent, providing clear marking schemes and explanations. Format is professional. Minor note: Section 1C (word matching) is slightly simplified compared to some standard exam formats but remains pedagogically sound.
Legacy generator Paper 3-1 Chinese wa2-paper-2 9.3 No 9.59.08.510.09.510.09.09.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Answer key is excellent, providing clear marking schemes and explanations. Format is professional. Note: No actual images were required for this specific content, so no missing image issues found.
Legacy generator Paper 3-1 Chinese wa2-paper-3 9.6 No 10.09.59.010.010.010.09.59.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Answer key is excellent, providing clear marking schemes and explanations. Format follows standard WA structure. No major issues found.
Legacy generator Paper 3-1 Chinese wa2-paper-4 9.3 No 9.59.08.510.09.510.09.09.09.0-- appropriate High quality paper. Language and vocabulary are well-aligned with P4 standards. Answer key is excellent, providing clear marking schemes and explanations. Format follows standard WA structure. No major issues found.
Legacy generator Paper 3-1 Chinese wa2-paper-5 9.6 No 10.09.59.010.010.010.09.59.09.0-- appropriate High quality paper. Language and difficulty are well-aligned with P4 standards. Answer key is excellent, providing clear scoring rubrics for comprehension and writing. Format follows standard WA structure. No major issues found.
Legacy generator Paper 3-1 Chinese wa3-paper-1 9.3 No 9.59.08.510.09.510.09.09.09.0-- appropriate High quality paper. Language and vocabulary are well-aligned with P4 standards. Answer key is excellent, providing clear marking schemes and explanations. Format follows standard Singaporean WA structure. No major issues found.
Legacy generator Paper 3-1 Chinese wa3-paper-2 9.3 No 9.59.08.510.09.510.09.09.09.0-- appropriate High quality paper. Language and vocabulary are well-aligned with P4 standards. The answer key is excellent, providing clear marking schemes and explanations. Format follows standard Singapore school assessment styles. One minor note: Question 5 answer key says '快速' but the question context '总是第一个做完' might better suit '认真' or '仔细' depending on nuance, though '快速' is acceptable. Overall very strong.
Legacy generator Parents Guide 2-9 Chinese parents-guide 9.0 No 9.59.08.510.0--8.09.08.5-9.5 appropriate High quality guide. Language is appropriate for parents. Adheres well to P4 syllabus (character counts, composition types). Exam format analysis is helpful, though specific marks/minutes are estimates. Very practical for home support.
Legacy generator Parents Guide 2-9 English parents-guide 9.9 No 10.09.5-10.0---10.0--10.0 appropriate Excellent parent guide. Highly aligned with MOE P4 syllabus, covering grammar, comprehension, and writing expectations accurately. Practical home support strategies and assessment schedules are well-structured.
Legacy generator Parents Guide 2-9 Higher Chinese parents-guide 9.1 No 9.59.08.510.0--8.09.0--9.5 appropriate High quality guide. Excellent breakdown of Higher Chinese characteristics (idioms, classical elements). Syllabus alignment is strong, particularly regarding cultural literacy and character recognition targets. Exam format analysis is helpful for parents. No major issues found.
Legacy generator Parents Guide 2-9 Mathematics parents-guide 10.0 No 10.010.0-10.0---10.0--10.0 appropriate Excellent parent guide. Highly aligned with the MOE P4 syllabus, including specific topics like decimals and unlike fractions. Provides practical, actionable advice for parents and correctly identifies the transition from P3 to P4.
Legacy generator Parents Guide 2-9 Science parents-guide 7.9 No 9.56.0-10.0---8.0--6.0 appropriate The guide includes several topics not present in the provided P4 syllabus (Magnets, Food Chains/Living Together). It also introduces advanced concepts like density and photosynthesis equations which are beyond the P4 scope defined in the syllabus. Language is excellent for parents.
Legacy generator Quiz 3-0 Chinese general 8.6 No 9.08.57.010.09.0-7.58.010.0-- appropriate Language and syllabus alignment are good for P4. Question 3 is a bit weak as 'He does homework every day' is not grammatically incorrect, just simple. Exam format lacks standard header details (School/Name/Class) but includes marks and time. Answer key is helpful with explanations.
Legacy generator Quiz 5-1 Chinese listening 7.4 No 9.07.06.010.09.0-5.03.010.0-- too easy The content is significantly below Primary 4 level; the vocabulary and sentence structures are more suited for Primary 1 or 2. There is a major internal inconsistency: the quiz header states a total score of 15, but the marking scheme calculates a total of 25. The question format lacks the complexity expected in P4 listening assessments (e.g., more nuanced inference).
Legacy generator Quiz 5-1 Chinese reading 6.2 No 6.05.04.010.09.0-4.02.010.0-- too easy The content is far too simple for Primary 4; it reads like a Primary 1 or 2 level text. The vocabulary and sentence structures lack the complexity required by the P4 syllabus. There is a major discrepancy between the header score (15) and the actual total score (25). The question format does not reflect the standard Singapore P4 Chinese comprehension paper style.
Legacy generator Quiz 5-1 Chinese speaking 8.3 Yes 9.08.57.010.09.0-6.07.010.0-- too easy The quiz is significantly easier than actual P4 oral exams. Part B uses text descriptions instead of actual images, which defeats the purpose of a visual-based speaking test. Part C questions are too basic (P1/P2 level) and lack the 'analysis/explanation' depth required by the P4 syllabus. Exam format lacks official time/marks allocation structure.
Legacy generator Quiz 5-1 Chinese vocabulary 8.0 No 9.08.57.010.09.5-6.04.010.0-- too easy The content is significantly below Primary 4 level; it reads like a Primary 1 or 2 vocabulary drill. The total score in the header (15) contradicts the actual total (25). Exam format lacks formal instructions and time allocation.
Legacy generator Quiz 5-1 Chinese writing 7.9 Yes 9.08.57.010.09.0-6.05.09.0-- too easy The quiz is significantly too easy for Primary 4; Section A uses P1/P2 level vocabulary (sun, book, apple). Section B focuses on basic conjunctions rather than P4 level sentence structures. The exam format lacks a total marks/time allocation header in the main paper, though the answer key corrects the total to 20. Missing actual images for Section A and C.
Legacy generator Quiz 3-0 English cloze-passage 9.1 No 9.59.08.510.09.010.09.07.010.0-- too easy Language and syllabus alignment are strong. However, the difficulty is quite low for P4; the grammar questions (pronouns and basic SVA) are more aligned with P2/P3 levels. The vocabulary cloze is appropriate. Format is clean and follows exam-style instructions well.
Legacy generator Quiz 3-0 English composition 9.7 Yes 10.010.09.010.0--9.010.010.0-- appropriate The quiz follows the P4 guided composition format well. However, it relies on text descriptions of pictures rather than actual images, which is a significant drawback for a composition task. The sample answer and rubric are high quality and align with MOE standards.
Legacy generator Quiz 5-1 English composition 8.4 Yes 9.59.07.010.08.5-6.08.09.0-- appropriate Language and syllabus alignment are strong. However, Section C relies on a 'Picture Description' instead of an actual image, which is a major flaw for a composition quiz. There is also a mathematical inconsistency: the header states a total score of 15, but the marking scheme sums to 20.
Legacy generator Quiz 3-0 English comprehension 9.4 No 10.010.09.010.08.0-9.09.010.0-- appropriate High quality quiz. Language and difficulty are well-aligned with P4 standards. Question types cover literal, inferential, and vocabulary skills effectively. Answer key is clear, though step-by-step reasoning for inferential questions could be more explicit.
Legacy generator Quiz 5-1 English comprehension 8.6 No 9.59.07.010.08.510.06.08.59.0-- appropriate Language is well-suited for P4. The quiz structure deviates from standard Singapore exam formats (which usually separate MCQ and Comprehension into distinct sections with specific mark allocations). The total marks in the header (15) contradicts the actual total (25). Answer key provides good explanations but lacks formal marking rubrics for open-ended questions.
Legacy generator Quiz 3-0 English editing 9.8 No 10.010.010.010.09.0-10.09.010.0-- appropriate High quality quiz. Adheres well to P4 editing format (spelling and grammar). Answer key includes helpful error analysis and teaching tips. The difficulty is well-calibrated for Middle Primary.
Legacy generator Quiz 5-1 English grammar 8.5 No 9.59.07.010.09.0-6.08.59.0-- appropriate Language and syllabus alignment are strong for P4. However, there is a major discrepancy between the quiz header (Score: / 15) and the actual marking scheme (Total: 25 marks). The exam format lacks a time duration. Question 9 in Section B provides 'Because/Since' as answers, but 'Since' was not in the provided options list, which is a minor instructional error.
Legacy generator Quiz 3-0 English grammar-vocabulary 9.7 No 10.010.09.010.010.010.09.09.010.0-- appropriate High quality quiz. Grammar and vocabulary topics align perfectly with P4 syllabus. Answer key provides excellent concise explanations. Format is clean and professional.
Legacy generator Quiz 5-1 English oral 8.1 Yes 9.08.56.010.09.010.05.07.08.0-- appropriate The quiz lacks the actual visual stimulus required for an Oral exam; it only provides text descriptions of pictures, which is not how P4 Oral is conducted. The marking scheme is too generous (1 mark per question) compared to standard MOE weighted rubrics. Question structure for Part B and C is more like a written comprehension than a spoken stimulus-based conversation.
Legacy generator Quiz 3-0 English synthesis-transformation 9.3 No 10.010.09.010.07.010.09.09.010.0-- appropriate High quality quiz. Content aligns well with P4 syllabus (relative pronouns, conjunctions, and sentence rewriting). The answer key provides good pedagogical guidance and common error warnings, though it lacks a strict step-by-step breakdown for the transformations. Format is clean and professional.
Legacy generator Quiz 5-1 English vocabulary 8.6 No 9.59.07.010.08.010.06.08.010.0-- appropriate Language and vocabulary are well-suited for P4. Major issue: The total marks in the header (15) contradicts the actual marks calculated in the marking scheme (25). Section C contains a mix of question types (sentence construction, word forms, homophones) which is slightly uneven for a single section.
Legacy generator Quiz 3-0 Higher Chinese general 9.3 No 9.59.08.510.09.510.09.09.09.59.0- appropriate High quality quiz. Language and vocabulary are well-aligned with P4 Higher Chinese standards. Answer key provides excellent marking schemes and scoring rubrics. The inclusion of a skills summary and idiom table acts as a useful cheatsheet. The letter format uses ASCII boxes which is acceptable for markdown.
Legacy generator Quiz 5-1 Higher Chinese listening 8.1 No 9.08.57.010.09.0-6.05.010.0-- too easy The content is too simple for Higher Chinese P4; it reads more like Standard Chinese P2/P3. The questions focus on literal retrieval rather than the 'implied meaning' and 'analysis' required by the P4 Higher Chinese syllabus. There is a mathematical error in the header: the score is listed as /15, but the total marks calculated in the answer key is 25. Exam format lacks specific time duration.
Legacy generator Quiz 5-1 Higher Chinese reading 7.2 No 8.07.06.010.09.0-5.04.09.0-- too easy The content is significantly below the expected rigor for Higher Chinese P4. The passage is extremely simple, and most questions are direct literal retrieval rather than the inference and analysis required by the syllabus. There is a major scoring discrepancy: the header says 15 marks, but the breakdown and total sum to 25 marks. The exam format lacks standard instructions and time allocation.
Legacy generator Quiz 5-1 Higher Chinese speaking 8.9 Yes 9.09.58.010.09.0-7.08.510.0-- appropriate The quiz content is well-aligned with P4 Higher Chinese oral standards. However, Part B (Picture Description) is fundamentally broken because the actual image is missing, providing only a text description of a scene instead. The marking scheme is excellent and provides clear rubrics for oral assessment.
Legacy generator Quiz 5-1 Higher Chinese vocabulary 8.3 No 9.08.57.010.09.010.06.05.010.0-- too easy The quiz is significantly too easy for Higher Chinese P4; the vocabulary (e.g., 聪明, 高兴, 帮助) is more aligned with Standard Chinese P1/P2. The marking scheme has a calculation error: the header says 15 marks, but the breakdown and total sum to 25 marks. Section B is too simplistic for Higher Chinese level.
Legacy generator Quiz 5-1 Higher Chinese writing 8.2 Yes 9.08.57.010.09.0-6.07.09.0-- too easy The quiz is too easy for Higher Chinese P4; Section A and B resemble standard Chinese (Lower) level. Major issue: Section C requires a picture but only provides a text description, making it a reading task rather than a true picture composition. Total marks in header (15) do not match the answer key total (18). Instructions lack time allocation.
Legacy generator Quiz 5-1 Mathematics addition-subtraction 8.8 No 10.010.07.010.09.010.06.07.010.0-- too easy The quiz is too repetitive; Section B is just direct calculation without any variation in wording or complexity. There is a mathematical error in the total marks calculation: the header says 15 marks, the marking scheme says 25 marks, but the actual sum of questions is 25. Difficulty is low for P4 as it lacks multi-step word problems or higher-order thinking.
Legacy generator Quiz 3-0 Mathematics area-perimeter 8.3 Yes 10.09.07.010.010.05.08.07.09.0-- uneven Major issue: Geometry questions (triangles, composite shapes) are text-only and require diagrams to be valid for P4. Syllabus adherence is good, but includes triangle area which is typically P5. Notation lacks proper LaTeX for squared units and fractions. Question 8 contains a mathematical error in the original prompt (Row 3) which the answer key correctly identifies but the quiz itself is flawed. Difficulty is uneven due to the inclusion of triangle area in a P4 context.
Legacy generator Quiz 3-0 Mathematics data-analysis 8.4 Yes 10.09.08.010.010.05.09.07.08.0-- uneven The quiz contains significant missing visual data (bar graphs, line graphs, pie charts) which are described in text but not rendered. Syllabus adherence is high, but the inclusion of Mean, Median, and Mode is slightly advanced for standard P4 (usually introduced in P5/P6), making the difficulty uneven. Notation lacks LaTeX for fractions and math operations.
Legacy generator Quiz 5-1 Mathematics data-analysis 7.1 Yes 9.04.05.010.09.010.04.03.010.0-- too easy Major syllabus misalignment: 'Average' (Mean) is a Primary 5/6 concept in Singapore, not Primary 4. P4 Data Analysis focuses on reading/interpreting tables and bar graphs, not calculating means. The quiz is also missing the actual bar graphs/line graphs described in the text, making it rely on text-based descriptions which is not standard for this topic. Total marks in header (15) contradicts the marking scheme (25).
Legacy generator Quiz 3-0 Mathematics decimals 9.4 No 10.010.09.010.010.08.09.09.010.0-- appropriate High quality quiz. Content aligns perfectly with P4 decimals syllabus. Notation is mostly plain text rather than LaTeX, but remains clear. Marks and timing are realistic. One minor discrepancy: the header says 20 questions but only 14 are provided.
Legacy generator Quiz 3-0 Mathematics factors-multiples 9.7 No 10.010.09.010.010.010.09.09.010.0-- appropriate High quality quiz. Content aligns perfectly with P4 Factors and Multiples syllabus. Answer key provides excellent step-by-step working. Note: The metadata claims 20 questions but only 14 are provided in the text; however, the marks and sections are internally consistent for the 14 questions shown.
Legacy generator Quiz 3-0 Mathematics four-operations 9.3 No 10.010.09.010.010.08.09.09.09.0-- appropriate High quality quiz. Content aligns perfectly with P4 Four Operations syllabus. Answer key provides excellent step-by-step working. Minor notation issue: uses standard text symbols instead of LaTeX for math expressions, but remains readable. Note: The metadata claims 20 questions, but the content only provides 14 questions.
Legacy generator Quiz 3-0 Mathematics fractions 7.7 No 10.010.06.09.08.02.07.08.09.0-- appropriate Major issue: The answer key does not match the quiz questions. The questions in Section A and B are completely different from the questions and answers provided in the key. Additionally, the quiz uses plain text slashes instead of proper LaTeX notation for fractions.
Legacy generator Quiz 5-1 Mathematics fractions 8.2 No 10.010.08.010.09.02.07.08.010.0-- appropriate Content is syllabus-aligned and appropriate for P4. Major issue: lacks LaTeX for fractions, using plain text slashes which is not standard for math papers. Total marks in header (15) contradicts the marking scheme summary (25).
Legacy generator Quiz 3-0 Mathematics geometry 9.0 Yes 10.010.08.09.010.07.09.09.09.0-- appropriate The quiz covers P4 geometry topics well. Major issue: several questions (8, 12, 13, 14) rely on visual diagrams or shapes that are not rendered, making them impossible to solve as presented. Notation for angles is acceptable but could use more formal LaTeX. Answer key is excellent with clear working.
Legacy generator Quiz 5-1 Mathematics geometry 8.7 Yes 10.09.07.010.09.010.06.07.010.0-- too easy The quiz is quite basic for P4; it lacks the complexity of typical Singapore school papers. Major issue: Geometry questions (angles, symmetry, shapes) almost always require diagrams/visuals in a real exam, which are missing here. The total marks in the header (15) contradicts the actual total (25). Question 4 (sum of angles in a triangle) is technically P5/P6 level in some contexts, but acceptable here.
Legacy generator Quiz 3-0 Mathematics measurement 9.6 No 10.010.08.010.010.010.09.09.010.0-- appropriate High quality quiz. Content aligns well with P4 measurement syllabus. Note: The metadata claims 20 questions but only 14 are provided. The marking scheme correctly reflects the 14 questions present. Formatting is clean and answers are well-explained.
Legacy generator Quiz 5-1 Mathematics measurement 9.3 No 10.010.08.010.010.010.07.09.010.0-- appropriate Content is highly accurate to P4 measurement syllabus. Note: The total marks in the header (15) contradicts the actual total (25) and the section breakdown. Question 5 introduces time, which is part of measurement but slightly different from length/mass/volume focus.
Legacy generator Quiz 3-0 Mathematics money 9.6 No 10.010.08.010.010.010.09.09.010.0-- appropriate High quality quiz. Content aligns perfectly with P4 Money syllabus. Note: The header claims 20 questions but only 14 are provided; however, the marks and sections are internally consistent. Formatting and step-by-step working are excellent.
Legacy generator Quiz 5-1 Mathematics money 8.6 No 10.010.07.010.09.010.05.06.010.0-- too easy The quiz is too simple for P4; it lacks multi-step word problems and higher-order thinking typical of Singapore Math. Major error: the total marks in the header (15) contradicts the marking scheme summary (25). Question 5 is slightly more complex than others but overall the difficulty is very low.
Legacy generator Quiz 5-1 Mathematics multiplication-division 8.4 No 10.010.07.010.05.010.06.08.010.0-- appropriate Syllabus alignment is strong for P4 multiplication/division. However, there is a major discrepancy in the total marks: the quiz header says 15 marks, but the marking scheme and actual question count sum to 25 marks. Answer key provides single-line solutions rather than full step-by-step working required for P4 math. Question templates are generic and lack the specific structure of Singapore MOE papers (e.g., no 'Show your working' prompts in Section B).
Legacy generator Quiz 3-0 Mathematics whole-numbers 9.3 No 10.010.09.010.010.08.09.09.09.0-- appropriate High quality quiz. Content aligns perfectly with P4 Whole Numbers syllabus. Answer key provides excellent step-by-step working. Minor note: LaTeX could be used for mathematical operations instead of plain text/ASCII, but it is clear. The question count in the header (20) contradicts the actual number of questions provided (14).
Legacy generator Quiz 5-1 Mathematics whole-numbers 9.2 No 10.010.08.010.09.010.07.09.010.0-- appropriate Content is highly accurate to P4 syllabus. Major discrepancy found in the total marks: the header states 15 marks, but the marking scheme and question distribution sum to 25 marks. Question difficulty is well-calibrated for P4.
Legacy generator Quiz 5-1 Science diversity 8.7 No 9.510.07.010.08.510.06.08.09.0-- appropriate Content aligns well with P4 Diversity syllabus. However, there is a major discrepancy in the total marks: the quiz header says 15 marks, but the actual questions sum to 25 marks. The exam format lacks the standard MOE structure (e.g., Section A usually has more MCQs, and Section B/C usually involves more data/diagram analysis). Answer key is high quality with clear marking criteria.
Legacy generator Quiz 3-0 Science heat 9.0 Yes 10.09.58.010.08.510.07.58.59.0-- appropriate Content is syllabus-accurate. However, several questions (e.g., Q9, Q10, Q12) describe experiments that typically require diagrams in Singapore Science papers to aid comprehension. The exam format is slightly off: 20 questions are listed in metadata but only 12 are present in the artifact. Marks per question in Section A are high for P4 (2 marks each for MCQs).
Legacy generator Quiz 5-1 Science heat 7.6 No 9.06.07.010.08.510.06.04.08.0-- too hard The quiz content is significantly above Primary 4 level. Concepts like radiation, convection currents, land/sea breezes, and double-glazing are typically Secondary school topics in the Singapore syllabus. P4 heat should focus on basic heat flow (hot to cold), conductors/insulators, and expansion/contraction. There is also a math error: the header says 15 marks, but the total is 25.
Legacy generator Quiz 5-1 Science life-cycles 8.7 Yes 9.510.07.010.08.510.06.08.59.0-- appropriate Content is syllabus-accurate for P4 Life Cycles. Major issue: The total marks in the quiz header (15) contradicts the actual marks in the sections (25). Missing diagrams for flower parts and seed dispersal questions which are standard in Singapore Science papers. Answer key is high quality with marking guidance.
Legacy generator Quiz 3-0 Science light 9.1 Yes 9.59.08.510.09.010.08.08.59.0-- appropriate Quiz is well-structured and aligns with P4 syllabus. However, several questions (e.g., Q8, Q10, Q12) describe experiments or setups that typically require diagrams in a standard Singapore Science paper. The inclusion of the law of reflection (Q4, Q12) is slightly advanced as the syllabus notes it is not strictly required, though it fits the topic.
Legacy generator Quiz 5-1 Science light 6.6 Yes 8.02.05.010.09.010.04.01.010.0-- too hard Major syllabus misalignment: The quiz covers refraction, the law of reflection, prisms, and spectrums, all of which are explicitly excluded from the P4 syllabus. The content is more aligned with P5/P6 or secondary school. Additionally, the total marks in the header (15) do not match the actual total (25). Missing diagrams for refraction and shadow questions.
Legacy generator Quiz 3-0 Science living-together 9.4 Yes 10.010.08.010.09.0-9.09.010.0-- appropriate High quality quiz. Content aligns well with P4 Science syllabus on food chains/webs. Note: Question 10(a) asks to draw a food web but lacks a provided diagram or clear workspace, and Section A marks are slightly high for simple MCQs (2 marks each). Missing images/diagrams for food chain visual aids.
Legacy generator Quiz 3-0 Science magnets 9.6 Yes 10.010.09.010.09.010.09.09.010.0-- appropriate High quality quiz. Content aligns well with P4 magnet syllabus. Note: Several questions (e.g., Q12, Q10) would benefit from diagrams/visuals in a real exam setting. Answer key is excellent with clear explanations.
Legacy generator Quiz 5-1 Science magnets 7.2 Yes 9.06.05.010.08.010.04.04.09.0-- too hard Syllabus mismatch: Magnets are not part of the P4 MOE Singapore Science syllabus (they are P5/P6 topics). Content difficulty is too high for P4; concepts like magnetic field lines, electromagnets, and magnetic induction (stroking a nail) are beyond the P4 level. Exam format is poor: total marks in header (15) contradicts the actual total (25), and it lacks standard MOE Section A/B structure. Missing diagrams for magnetic field and pole positioning questions.
Legacy generator Quiz 5-1 Science materials 7.2 No 9.04.05.010.08.010.06.04.09.0-- too hard Major syllabus misalignment: The quiz covers physical/chemical changes, dissolving, and separation techniques (filtration, evaporation), which are Primary 5/6 topics in the Singapore MOE syllabus. P4 Science focuses on states of matter, light, and heat. Additionally, the question on particle models (Q4, Q10) is too advanced for the P4 level. The total marks in the header (15) do not match the actual total (25).
Legacy generator Quiz 3-0 Science matter 7.4 Yes 9.06.07.010.05.010.08.04.08.0-- too hard Major misalignment between quiz and answer key. The quiz contains questions on density and volume calculations which are not in the P4 syllabus (density is typically P5/P6). Furthermore, the answer key does not correspond to the quiz questions (e.g., Quiz Q1-5 vs Answer Key Q1-5 are completely different topics). The answer key introduces complex concepts like particle energy and reversible changes not present in the quiz. High difficulty due to out-of-syllabus content.
Legacy generator Quiz 3-0 Science plants 8.8 Yes 9.59.07.010.09.010.08.07.59.0-- appropriate The quiz is well-structured and follows the syllabus. However, it lacks diagrams for Section B and C which are standard in Singapore Science papers (e.g., for photosynthesis or plant parts). The difficulty is slightly on the easy side for P4, leaning towards P3 review, but appropriate for a topical quiz. Marks assigned to MCQs (2 marks each) are higher than standard P4 papers where MCQs are usually 1 mark.
Legacy generator Quiz 5-1 Science systems 6.4 No 9.03.05.010.08.010.04.02.07.0-- too hard Major syllabus misalignment: The quiz covers Respiratory, Circulatory, and Skeletal systems, which are explicitly excluded from the P4 syllabus (noted as P5/P6 topics). The content is far too advanced for P4. Additionally, there is a math error: the header says 15 marks, but the total is 25. The question format lacks the standard Singapore Science paper structure (Section A is usually MCQ, Section B is structured, but the complexity here is more suited to upper primary).

Criteria

Scores use 10.0 as best fit. Missing images are tracked as a yes/no flag.

1

Language suitability

2

Syllabus adherence

3

Past-paper template adherence

4

No weird artefacts/symbols

5

Step-by-step answers

6

Latex/notation format

7

Exam paper format

8

Difficulty appropriateness

9

Doable within timeframe

10

Cheatsheet 3-point summaries

11

Parent guide syllabus fit