Stage 9-1 Level View
Primary 3 Benchmark Scores
Per-level benchmark view grouped by generation model and subject. Scores are derived from the Stage 9-0 evaluator reports without changing the underlying scoring algorithm.
Showing all Primary 3 subjects. Pick a subject to recalculate the LLM scores, low-score review list, and detailed rows for that subview.
LLM Summary
Average scores grouped by the model that generated the Primary 3 content.
| Generation Model | Artifacts | Overall | Missing Images | Language | Syllabus | Answers | Notation | Timing |
|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 4 | 5 | 8.7 | 2 | 8.4 | 8.4 | - | 9.0 | - |
| Legacy generator | 142 | 7.8 | 88 | 8.3 | 7.8 | 7.5 | 8.1 | 7.9 |
Subject Summary
Average scores grouped by subject inside Primary 3.
| Subject | Artifacts | Overall | Missing Images | Language | Syllabus | Answers | Notation | Timing |
|---|---|---|---|---|---|---|---|---|
| Chinese | 18 | 8.7 | 13 | 9.1 | 8.6 | 8.4 | 10.0 | 9.0 |
| English | 38 | 4.5 | 7 | 4.3 | 4.1 | 3.5 | 4.8 | 3.9 |
| Higher Chinese | 8 | 8.8 | 2 | 9.2 | 8.8 | 9.1 | 10.0 | 9.2 |
| Mathematics | 43 | 9.2 | 33 | 10.0 | 9.8 | 9.1 | 8.3 | 9.6 |
| Science | 40 | 8.9 | 35 | 9.7 | 8.8 | 8.8 | 10.0 | 9.2 |
LLM by Content Type
Model scores split by quizzes, papers, cheatsheets, and parent guides.
| Generation Model | Type | Artifacts | Overall | Missing Images | Language | Syllabus | Answers | Notation | Timing |
|---|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 4 | Cheatsheet | 5 | 8.7 | 2 | 8.4 | 8.4 | - | 9.0 | - |
| Legacy generator | Paper | 85 | 7.2 | 62 | 7.3 | 7.0 | 6.6 | 7.5 | 6.9 |
| Legacy generator | Parents Guide | 5 | 9.3 | 0 | 9.7 | 8.8 | - | - | 9.0 |
| Legacy generator | Quiz | 52 | 8.7 | 26 | 9.6 | 9.1 | 9.0 | 9.4 | 9.5 |
Needs Review: Scores Below 8.0
Artifacts with overall benchmark scores below 8.0 for the current level view.
| Overall | Model | Subject | Type | Stage | Topic / Paper | Language | Syllabus | Template | Answers | Notation | Timing | Comments |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.0 | Legacy generator | English | Paper | 3-1 | sa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder/metadata shell only. It contains no actual exam questions, content, or marking schemes, instead referring to a non-existent SA2 version. It is not a functional paper. |
| 0.0 | Legacy generator | English | Paper | 3-1 | sa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder/template only and contains no actual exam questions, content, or answers. It is non-functional for assessment. |
| 0.0 | Legacy generator | English | Paper | 3-1 | sa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder/template only. It contains no actual questions, content, or answers, making it impossible to benchmark for language, syllabus, or difficulty. It essentially points to a 'Version 1' which is not provided. |
| 0.0 | Legacy generator | English | Paper | 3-1 | sa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template only. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It fails to provide a functional exam paper. |
| 0.0 | Legacy generator | English | Paper | 3-1 | sa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder/template only and contains no actual exam questions, content, or answers. It is unusable for benchmarking quality. |
| 0.0 | Legacy generator | English | Paper | 3-1 | sa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template only. It contains no actual questions, content, or assessment material, making it impossible to evaluate language, syllabus adherence, or difficulty. It essentially fails to provide a paper. |
| 0.0 | Legacy generator | English | Paper | 3-1 | sa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder/template only and contains no actual exam questions, content, or answers. It is unusable for benchmarking quality. |
| 0.0 | Legacy generator | English | Paper | 3-1 | wa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template only. It contains no actual questions, content, or assessment material to evaluate against the syllabus or difficulty standards. |
| 0.0 | Legacy generator | English | Paper | 3-1 | wa3-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template containing no actual questions, content, or answers. It fails all pedagogical and examination criteria as it lacks the substance of a paper. |
| 1.3 | Legacy generator | English | Paper | 3-1 | wa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is an empty template containing no actual questions or content. It lacks the exam questions, marks per question, and the answer key required for a functional paper. |
| 1.5 | Legacy generator | English | Paper | 3-1 | sa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | - | 0.0 | The artifact is a placeholder template only. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It fails as a functional exam paper. |
| 1.7 | Legacy generator | English | Paper | 3-1 | sa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to assess syllabus adherence, difficulty, or language suitability. It only provides metadata and instructions. |
| 1.7 | Legacy generator | English | Paper | 3-1 | wa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder/template only and contains no actual exam questions or content. It cannot be evaluated for syllabus adherence, difficulty, or language suitability. It lacks the actual paper content required for a benchmark. |
| 1.7 | Legacy generator | English | Paper | 3-1 | wa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It only provides the header and footer metadata. |
| 1.7 | Legacy generator | English | Paper | 3-1 | wa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides header information. |
| 1.7 | Legacy generator | English | Paper | 3-1 | wa3-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | The artifact is a placeholder template containing no actual questions or content. It lacks the exam questions, marks per question, and the answer key required for a functional paper. |
| 1.9 | Legacy generator | English | Paper | 3-1 | wa1-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | - | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| 1.9 | Legacy generator | English | Paper | 3-1 | wa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | - | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| 1.9 | Legacy generator | English | Paper | 3-1 | wa3-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | - | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| 1.9 | Legacy generator | English | Paper | 3-1 | wa3-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | - | 0.0 | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It only provides the header and footer structure. |
| 2.8 | Legacy generator | English | Paper | 3-1 | wa2-paper-1 | 0.0 | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | The artifact is a placeholder template only. It contains no actual questions, content, or answers to evaluate for syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| 5.6 | Claude Sonnet 4 | Chinese | Cheatsheet | 2-7 | cheatsheet | 4.0 | 3.0 | - | - | - | - | The content is far too advanced for Primary 3. It covers literary appreciation (意境), rhetorical devices (排比), and complex sentence structures that are more suited for Upper Primary or Secondary levels. P3 syllabus focuses on basic character recognition, simple paragraph writing, and foundational reading/listening. The cheatsheet uses abstract meta-cognitive terms (e.g., 'literary appreciation') which are developmentally inappropriate for 9-year-olds. |
| 6.0 | Legacy generator | Mathematics | Paper | 3-1 | sa1-paper-4 | 9.0 | 6.0 | 7.0 | 2.0 | 5.0 | 5.0 | Critical failure: The answer key is completely hallucinated and does not match the questions in the paper. For example, Q1 asks about digit 8 in 2816, but the answer key discusses digit 9 in a different number. Q2, Q3, Q4, etc., all have answers that correspond to entirely different questions. Additionally, many questions require diagrams (clocks, shapes, bar charts) which are missing, and the difficulty is uneven, mixing P3 concepts with P4/P5 level arithmetic and logic. |
| 7.3 | Legacy generator | Science | Quiz | 5-1 | heat | 9.0 | 4.0 | 7.0 | 8.0 | 10.0 | 9.0 | Major syllabus misalignment: Heat is not a topic in the P3 Science syllabus (it is a P4 topic). The content is too advanced for P3. Additionally, the total marks in the header (15) do not match the actual total (25). Questions are very basic and lack the complexity of actual Singapore primary science papers. |
| 7.3 | Legacy generator | Science | Quiz | 5-1 | systems | 9.5 | 4.0 | 5.0 | 8.0 | 10.0 | 9.0 | Major syllabus misalignment: 'Systems' is not a standalone topic in the P3 MOE Science syllabus; it is a concept applied within Diversity, Cycles, and Interactions. The quiz introduces concepts like photosynthesis and human organs which are P5 level. The exam format is inconsistent: the header says 15 marks, but the total is 25. Lack of diagrams for science questions makes it less authentic to Singapore exam standards. |
| 7.8 | Legacy generator | Science | Quiz | 5-1 | life-cycles | 9.0 | 9.5 | 4.0 | 8.5 | 10.0 | 9.0 | The quiz lacks diagrams which are essential for P3 Science life cycle questions. The total marks in the header (15) do not match the actual total (25). Question 15 is speculative regarding the 'how long' part which is not syllabus-standard. Formatting of instructions and marks per question is inconsistent with MOE exam styles. |
| 7.9 | Legacy generator | Chinese | Quiz | 5-1 | listening | 9.0 | 8.5 | 7.0 | 9.0 | - | 10.0 | The content is significantly below P3 level; it reads like P1 or early P2 material with very simple sentence structures and basic vocabulary. There is a mathematical inconsistency: the header says 15 marks, but the breakdown and total sum to 25 marks. The format lacks the formal structure of a standard Singapore MOE listening paper (e.g., specific instructions for the audio component). |
Content Type Summary
Average scores grouped by content type.
Detailed Benchmark Rows
Topics, quiz variants, paper versions, cheatsheets, and parent guides listed individually.
| Model | Type | Stage | Subject | Topic / Paper | Overall | Missing Images | Language | Syllabus | Template | Clean | Step Answers | Notation | Paper Format | Difficulty | Time Fit | 3-Point Summary | Parent Guide | Difficulty | Comments |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 4 | Cheatsheet | 2-7 | Chinese | cheatsheet | 5.6 | No | 4.0 | 3.0 | - | 10.0 | - | - | - | 2.0 | - | 9.0 | - | too hard | The content is far too advanced for Primary 3. It covers literary appreciation (意境), rhetorical devices (排比), and complex sentence structures that are more suited for Upper Primary or Secondary levels. P3 syllabus focuses on basic character recognition, simple paragraph writing, and foundational reading/listening. The cheatsheet uses abstract meta-cognitive terms (e.g., 'literary appreciation') which are developmentally inappropriate for 9-year-olds. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | English | cheatsheet | 9.5 | No | 9.5 | 10.0 | - | 10.0 | - | - | - | 9.0 | - | 9.0 | - | appropriate | Excellent cheatsheet. Highly aligned with MOE P3 syllabus, covering grammar, writing, and comprehension. Uses effective structured summaries rather than generic pointers. Language is perfectly pitched for 9-year-olds. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | Higher Chinese | cheatsheet | 9.5 | No | 9.5 | 9.0 | - | 10.0 | - | - | - | 9.0 | - | 10.0 | - | appropriate | Excellent cheatsheet structure. Uses effective three-point summaries (Key words, Sentence making, Common mistakes) for each topic. High syllabus alignment with Singapore P3 Higher Chinese requirements, including local culture and specific language skills. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | Mathematics | cheatsheet | 9.6 | Yes | 9.5 | 10.0 | - | 10.0 | - | 9.0 | - | 10.0 | - | 9.0 | - | appropriate | Excellent syllabus coverage for P3. Topic sections use effective bulleted summaries rather than generic text. Notation is mostly good, though some math symbols could benefit from full LaTeX for consistency. Missing diagrams for geometry and bar graphs which are essential for this level. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | Science | cheatsheet | 9.5 | Yes | 9.5 | 10.0 | - | 10.0 | - | - | - | 9.0 | - | 9.0 | - | appropriate | Excellent syllabus alignment. Content is well-structured with clear, concise bullet points suitable for P3. Cheatsheet lacks diagrams which are crucial for Science (e.g., life cycles, magnet poles), but text content is high quality. |
| Legacy generator | Paper | 3-1 | Chinese | wa2-paper-1 | 8.8 | Yes | 9.0 | 8.5 | 8.0 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 9.0 | - | - | too easy | The paper is well-formatted and the answer key is excellent with explanations. However, Section III (Look at picture and choose word) is missing the actual images, making it impossible to complete as intended. The difficulty is slightly low for P3 WA2, as most multiple-choice options are very obvious and the reading comprehension is quite literal. |
| Legacy generator | Paper | 3-1 | Chinese | wa2-paper-2 | 9.2 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and difficulty are well-aligned with P3 standards. Format is professional with clear marks and instructions. Section III (Look at picture and choose words) is missing the actual images required for the task. Answer key is high quality with explanations. |
| Legacy generator | Paper | 3-1 | Chinese | wa2-paper-3 | 9.1 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 9.5 | 8.5 | 9.0 | - | - | appropriate | The paper follows the P3 syllabus well. However, Section III (Look at picture and choose word) is logically flawed because there are no actual images provided, making the section title misleading. The answer key is high quality with explanations. |
| Legacy generator | Paper | 3-1 | Chinese | wa2-paper-4 | 9.2 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and difficulty are well-aligned with P3 standards. Section III (Look at picture and choose word) is logically flawed as there are no actual images provided, making the section impossible to complete as intended. The answer key is high quality with explanations. |
| Legacy generator | Paper | 3-1 | Chinese | wa2-paper-5 | 9.2 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and difficulty are well-aligned with P3 standards. Section III (Look at picture and choose word) is logically flawed as there are no actual images provided, making the section impossible to complete as intended. Answer key is high quality with explanations. |
| Legacy generator | Paper | 3-1 | Chinese | wa3-paper-1 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 7.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and syllabus alignment are strong for P3. Section III (Look at picture and choose words) is logically broken because the required images are missing. Answer key for comprehension is provided as reference answers rather than step-by-step logic, which is acceptable for this level. |
| Legacy generator | Paper | 3-1 | Chinese | wa3-paper-2 | 8.8 | Yes | 9.0 | 8.5 | 8.0 | 10.0 | 7.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | The paper follows a standard exam structure. Section III (Look at picture and choose words) is logically broken because the required images are missing. Answer key for comprehension is provided as reference answers rather than step-by-step, which is acceptable for language subjects. Difficulty is well-aligned with P3 level. |
| Legacy generator | Paper | 3-1 | Chinese | wa3-paper-3 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 7.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and difficulty are well-aligned with P3 standards. Section III (Look at picture and choose words) is broken because the required images are missing. Answer key for comprehension is provided as reference answers rather than step-by-step logic, which is acceptable for this level. |
| Legacy generator | Paper | 3-1 | Chinese | wa3-paper-4 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 7.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and difficulty are well-aligned with P3 level. Section III (看图选词) is logically flawed as it requires images to be functional but no images or placeholders are provided. Answer key for open-ended questions is provided as reference rather than step-by-step logic. |
| Legacy generator | Paper | 3-1 | Chinese | wa3-paper-5 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 7.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | Language and syllabus alignment are strong for P3. Section III (Look at picture and choose words) is logically flawed because there are no actual images provided, making it impossible for a student to complete as intended. Answer key for comprehension is good but lacks detailed marking rubrics for open-ended questions. |
| Legacy generator | Paper | 3-1 | English | sa1-paper-1 | 0.0 | No | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder/metadata shell only. It contains no actual exam questions, content, or marking schemes, instead referring to a non-existent SA2 version. It is not a functional paper. |
| Legacy generator | Paper | 3-1 | English | sa1-paper-1 | 0.0 | No | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder/template only and contains no actual exam questions, content, or answers. It is non-functional for assessment. |
| Legacy generator | Paper | 3-1 | English | sa1-paper-1 | 0.0 | No | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder/template only. It contains no actual questions, content, or answers, making it impossible to benchmark for language, syllabus, or difficulty. It essentially points to a 'Version 1' which is not provided. |
| Legacy generator | Paper | 3-1 | English | sa1-paper-1 | 0.0 | Yes | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template only. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It fails to provide a functional exam paper. |
| Legacy generator | Paper | 3-1 | English | sa1-paper-1 | 1.7 | Yes | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 5.0 | 0.0 | 0.0 | - | - | too easy | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to assess syllabus adherence, difficulty, or language suitability. It only provides metadata and instructions. |
| Legacy generator | Paper | 3-1 | English | sa2-paper-1 | 8.9 | No | 9.5 | 9.0 | 8.5 | 10.0 | 7.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | High quality paper. Language is well-calibrated for P3. Marks and timing are realistic. Answer key is good but lacks step-by-step reasoning for comprehension; it provides direct answers only. Editing section has 11 errors instead of the instructed 10, which may confuse students. |
| Legacy generator | Paper | 3-1 | English | sa2-paper-1 | 1.5 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | - | 2.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template only. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It fails as a functional exam paper. |
| Legacy generator | Paper | 3-1 | English | sa2-paper-1 | 0.0 | No | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder/template only and contains no actual exam questions, content, or answers. It is unusable for benchmarking quality. |
| Legacy generator | Paper | 3-1 | English | sa2-paper-1 | 0.0 | Yes | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template only. It contains no actual questions, content, or assessment material, making it impossible to evaluate language, syllabus adherence, or difficulty. It essentially fails to provide a paper. |
| Legacy generator | Paper | 3-1 | English | sa2-paper-1 | 0.0 | No | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder/template only and contains no actual exam questions, content, or answers. It is unusable for benchmarking quality. |
| Legacy generator | Paper | 3-1 | English | wa1-paper-1 | 9.2 | No | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | High quality paper. Grammar and vocabulary levels are well-aligned with P3 syllabus. Note: Answer key for Q24 contains a gender error (His vs Her) which should be corrected in the source text. Comprehension questions are well-structured. |
| Legacy generator | Paper | 3-1 | English | wa1-paper-1 | 0.0 | No | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template only. It contains no actual questions, content, or assessment material to evaluate against the syllabus or difficulty standards. |
| Legacy generator | Paper | 3-1 | English | wa1-paper-1 | 1.7 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder/template only and contains no actual exam questions or content. It cannot be evaluated for syllabus adherence, difficulty, or language suitability. It lacks the actual paper content required for a benchmark. |
| Legacy generator | Paper | 3-1 | English | wa1-paper-1 | 1.7 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It only provides the header and footer metadata. |
| Legacy generator | Paper | 3-1 | English | wa1-paper-1 | 1.9 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | - | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| Legacy generator | Paper | 3-1 | English | wa2-paper-1 | 9.2 | No | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | High quality paper. Language and difficulty are well-calibrated for P3. Note: Answer key for Q24 contains a logic error (suggests 'His' but notes it should be 'Her'), which contradicts the text context. Format is professional. |
| Legacy generator | Paper | 3-1 | English | wa2-paper-1 | 1.7 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides header information. |
| Legacy generator | Paper | 3-1 | English | wa2-paper-1 | 1.3 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 2.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is an empty template containing no actual questions or content. It lacks the exam questions, marks per question, and the answer key required for a functional paper. |
| Legacy generator | Paper | 3-1 | English | wa2-paper-1 | 1.9 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | - | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| Legacy generator | Paper | 3-1 | English | wa2-paper-1 | 2.8 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 10.0 | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template only. It contains no actual questions, content, or answers to evaluate for syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| Legacy generator | Paper | 3-1 | English | wa3-paper-1 | 9.2 | No | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | High quality paper. Grammar and vocabulary levels are well-aligned with P3 MOE syllabus. Note: Answer key for Q24 contains a gender error (His vs Her) which should be corrected in the question or key. Comprehension questions are standard for this level. |
| Legacy generator | Paper | 3-1 | English | wa3-paper-1 | 0.0 | Yes | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | - | - | too easy | The artifact is a placeholder template containing no actual questions, content, or answers. It fails all pedagogical and examination criteria as it lacks the substance of a paper. |
| Legacy generator | Paper | 3-1 | English | wa3-paper-1 | 1.9 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | - | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate syllabus adherence, difficulty, or language suitability. It only provides the header and footer structure. |
| Legacy generator | Paper | 3-1 | English | wa3-paper-1 | 1.7 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | 0.0 | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template containing no actual questions or content. It lacks the exam questions, marks per question, and the answer key required for a functional paper. |
| Legacy generator | Paper | 3-1 | English | wa3-paper-1 | 1.9 | No | 0.0 | 0.0 | 0.0 | 10.0 | 0.0 | - | 5.0 | 0.0 | 0.0 | - | - | not applicable | The artifact is a placeholder template rather than a complete exam paper. It contains no actual questions, content, or answers, making it impossible to evaluate language, syllabus adherence, or difficulty. It only provides the header and footer structure. |
| Legacy generator | Paper | 3-1 | Mathematics | sa1-paper-1 | 9.4 | Yes | 10.0 | 10.0 | 9.5 | 9.0 | 10.0 | 8.5 | 10.0 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus. Major issue: several questions (20, 23, 35, 37) rely on diagrams that are only described in text placeholders rather than being rendered. Notation is mostly good but uses standard text for some math instead of full LaTeX. Answer key is excellent with clear working. |
| Legacy generator | Paper | 3-1 | Mathematics | sa1-paper-2 | 8.7 | Yes | 10.0 | 9.0 | 8.5 | 7.0 | 9.0 | 8.0 | 9.5 | 8.5 | 9.0 | - | - | appropriate | Major issue: The answer key does not match the question paper. The questions in Section A and B of the paper are completely different from the questions and answers provided in the key. Placeholder text for images is present but needs actual diagrams. Notation is generally good but inconsistent in some math expressions. |
| Legacy generator | Paper | 3-1 | Mathematics | sa1-paper-3 | 9.3 | Yes | 10.0 | 9.5 | 9.0 | 10.0 | 9.5 | 8.0 | 10.0 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Missing visual diagrams for symmetry (Q33), clock (Q36), and bar charts (Q22). Notation is mostly plain text rather than full LaTeX for fractions and units. Syllabus coverage is excellent. |
| Legacy generator | Paper | 3-1 | Mathematics | sa1-paper-4 | 6.0 | Yes | 9.0 | 6.0 | 7.0 | 8.0 | 2.0 | 5.0 | 9.0 | 3.0 | 5.0 | - | - | uneven | Critical failure: The answer key is completely hallucinated and does not match the questions in the paper. For example, Q1 asks about digit 8 in 2816, but the answer key discusses digit 9 in a different number. Q2, Q3, Q4, etc., all have answers that correspond to entirely different questions. Additionally, many questions require diagrams (clocks, shapes, bar charts) which are missing, and the difficulty is uneven, mixing P3 concepts with P4/P5 level arithmetic and logic. |
| Legacy generator | Paper | 3-1 | Mathematics | sa1-paper-5 | 9.2 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 10.0 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Missing visual diagrams for geometry (symmetry, clock, rectangle) and pictograms. Notation is mostly plain text rather than LaTeX. Answer key is clear and provides working steps. |
| Legacy generator | Paper | 3-1 | Mathematics | sa2-paper-1 | 9.3 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 10.0 | 9.5 | 9.0 | - | - | appropriate | High quality paper. Syllabus coverage is excellent. Major issue: missing diagrams for geometry (angles/rectangles) and clock questions. Notation uses plain text instead of LaTeX for fractions and units. Answer key is very helpful with clear steps. |
| Legacy generator | Paper | 3-1 | Mathematics | sa2-paper-2 | 9.4 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.5 | 8.0 | 10.0 | 9.5 | 9.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus. Missing visual diagrams for geometry (angles/shapes) and the pictograph/bar graph (though text-based approximations are provided). Notation is mostly clean but could use more formal LaTeX for fractions and units. Step-by-step answers are excellent. |
| Legacy generator | Paper | 3-1 | Mathematics | sa2-paper-3 | 9.2 | No | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 10.0 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Note: Question 3 in the answer key contains a logical error/self-correction loop regarding rounding. Notation for fractions and units is mostly plain text rather than LaTeX, which is acceptable for P3 but less professional. Bar graph and pictograph use ASCII/Markdown which is functional. |
| Legacy generator | Paper | 3-1 | Mathematics | sa2-paper-4 | 9.5 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.0 | 10.0 | 9.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus. Missing visual diagrams for geometry and bar graph questions (though ASCII/Markdown approximations are provided). Notation is mostly clean but could use more formal LaTeX for fractions and units. |
| Legacy generator | Paper | 3-1 | Mathematics | sa2-paper-5 | 9.4 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.0 | 10.0 | 9.5 | 9.0 | - | - | appropriate | High quality paper. Syllabus coverage is excellent. Missing diagrams for Question 18 (angle) and Question 50 (bar graph uses ASCII instead of actual image). Notation is mostly good but could use more formal LaTeX for fractions and units. Answer key is very thorough with step-by-step working. |
| Legacy generator | Paper | 3-1 | Mathematics | wa1-paper-1 | 9.7 | Yes | 10.0 | 10.0 | 9.5 | 9.0 | 10.0 | 9.0 | 10.0 | 9.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus and MOE exam structure. Includes clear marking schemes and step-by-step solutions. Note: Several questions (5, 15, 19, 21, 24) rely on diagrams that are currently represented by text placeholders. |
| Legacy generator | Paper | 3-1 | Mathematics | wa1-paper-2 | 9.4 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus and MOE exam structure. Note: Several questions (5, 19, 21, 24) rely on diagrams that are currently text placeholders. LaTeX usage is generally good but inconsistent in Section A (some math uses $ symbols, others do not). |
| Legacy generator | Paper | 3-1 | Mathematics | wa1-paper-3 | 9.7 | Yes | 10.0 | 10.0 | 9.5 | 10.0 | 9.0 | 9.0 | 10.0 | 9.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus (fractions, money, measurement, area/perimeter). Missing images are clearly marked with descriptive placeholders. Answer key provides good step-by-step working. Minor note: LaTeX usage is inconsistent (some math uses $ symbols, some does not), but overall very clean. |
| Legacy generator | Paper | 3-1 | Mathematics | wa1-paper-4 | 9.6 | Yes | 10.0 | 10.0 | 9.5 | 10.0 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Follows MOE P3 syllabus well. Missing images are clearly marked with descriptive placeholders. Notation is mostly good, though some math symbols could use more consistent LaTeX. Answer key provides clear working steps. |
| Legacy generator | Paper | 3-1 | Mathematics | wa1-paper-5 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.5 | 9.0 | 10.0 | 9.5 | 10.0 | - | - | appropriate | High quality paper. Follows MOE P3 syllabus well. Includes necessary sections (MCQ, Short Answer, Word Problems). Note: Several questions (5, 19, 21, 24) rely on diagrams that are currently text placeholders. Answer key provides good step-by-step working. |
| Legacy generator | Paper | 3-1 | Mathematics | wa2-paper-1 | 9.4 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.5 | 8.0 | 9.5 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Syllabus coverage is excellent for P3 WA2. Missing images for Question 10 (bar graph) and Question 20 (rectangle diagram) which are standard in Singapore math papers. Notation is mostly plain text; using LaTeX for fractions and units would improve professional look. |
| Legacy generator | Paper | 3-1 | Mathematics | wa2-paper-2 | 8.9 | Yes | 10.0 | 9.0 | 8.5 | 9.0 | 10.0 | 8.0 | 10.0 | 7.0 | 9.0 | - | - | uneven | Missing diagrams for bar graph (Q9) and bar graph drawing space (Q20). Syllabus adherence is good, but Q11 (Mean) is technically a P4/P5 concept in the MOE syllabus, making the difficulty uneven. Notation is mostly plain text rather than full LaTeX for fractions. |
| Legacy generator | Paper | 3-1 | Mathematics | wa2-paper-3 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Major issue: Question 9 and 18 refer to a bar graph that is not visually present, and Question 10 refers to a pictograph that is missing. Notation is generally good, though some fractions in Section A could use LaTeX for consistency. Answer key is excellent with clear step-by-step working. |
| Legacy generator | Paper | 3-1 | Mathematics | wa2-paper-4 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Follows MOE P3 syllabus well. Note: Question 9 and 10 describe graphs/pictographs but lack the actual visual assets. LaTeX usage is generally good but some fractions use plain text slashes instead of formal notation. |
| Legacy generator | Paper | 3-1 | Mathematics | wa2-paper-5 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 9.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus. Missing images for the bar graph (Q18) and pictograph (Q10) which are required for these question types. Notation is generally good, though some fractions could use LaTeX for better rendering. |
| Legacy generator | Paper | 3-1 | Mathematics | wa3-paper-1 | 9.2 | Yes | 10.0 | 9.5 | 9.0 | 10.0 | 9.0 | 8.0 | 10.0 | 8.5 | 9.0 | - | - | appropriate | High quality paper. Missing visual diagrams for angles, clock hands, and the bar chart grid. Notation is mostly good but uses standard text for some math instead of full LaTeX. Difficulty is well-aligned with P3 WA3 expectations. |
| Legacy generator | Paper | 3-1 | Mathematics | wa3-paper-2 | 9.6 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 8.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Missing visual diagrams for the clock face (Q21), the bar chart grid (Q23), and the pizza (Q24). Notation is generally good, though some math symbols could use more consistent LaTeX formatting. Difficulty is well-calibrated for P3 WA3. |
| Legacy generator | Paper | 3-1 | Mathematics | wa3-paper-3 | 9.6 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | 10.0 | 8.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus. Missing diagrams for angles and bar graphs which are essential for this level. Difficulty is appropriate but Section B is slightly repetitive on basic multiplication. |
| Legacy generator | Paper | 3-1 | Mathematics | wa3-paper-4 | 9.1 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.0 | 10.0 | 6.0 | 10.0 | - | - | uneven | Paper lacks visual diagrams for geometry and data questions, which are essential for P3. Difficulty is uneven: Section A and B are very basic recall, while Section C contains logic errors (Question 23 has no valid answer based on the table, and Question 22 is a trick question where the answer is 0). Notation is generally good but uses standard text for some math symbols. |
| Legacy generator | Paper | 3-1 | Mathematics | wa3-paper-5 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Follows MOE P3 syllabus well. Missing diagrams for angles and bar graphs which are essential for this level. Notation is generally good, though some currency and fraction formatting could use more consistent LaTeX. |
| Legacy generator | Paper | 3-1 | Science | sa1-paper-1 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 8.5 | 10.0 | 9.0 | 8.5 | 8.0 | - | - | appropriate | The paper follows the P3 Science syllabus well. However, it relies heavily on ASCII/text-based diagrams (e.g., plant parts, life cycles) which are poor substitutes for actual exam diagrams. The total marks (100) and duration (75 mins) are slightly high for P3 compared to standard school papers, but the difficulty level is appropriate. |
| Legacy generator | Paper | 3-1 | Science | sa1-paper-2 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 9.0 | 10.0 | 8.5 | 8.5 | 8.0 | - | - | appropriate | The paper is well-structured and aligns closely with the P3 Science syllabus. However, several questions (e.g., Q27, Q29, Q34) rely on diagrams or visual data that are represented only by text-based placeholders or ASCII art, which is insufficient for a real Science exam. The marking scheme is excellent and provides clear breakdown of marks. |
| Legacy generator | Paper | 3-1 | Science | sa1-paper-3 | 9.4 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | 9.0 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Content aligns perfectly with P3 Science syllabus (Diversity, Cycles, Interactions). Note: Several questions (26, 28, 30, 31, 32, 34, 35, 36) rely on diagrams or tables that are represented via text/markdown rather than actual images, which is a limitation for a Science paper. Answer key is excellent with clear mark allocation. |
| Legacy generator | Paper | 3-1 | Science | sa1-paper-4 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 9.0 | 10.0 | 8.5 | 8.5 | 8.0 | - | - | appropriate | The paper is well-structured and aligns with P3 Science syllabus themes. However, several questions (28, 29, 30, 34) rely on diagrams or life cycle stages that are represented by text-based placeholders rather than actual images, which is a major issue for a Science paper. The answer key is excellent, providing clear marking schemes. The timeframe is slightly tight for the volume of short-answer questions provided. |
| Legacy generator | Paper | 3-1 | Science | sa1-paper-5 | 9.1 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 8.5 | 10.0 | 9.0 | 9.0 | 9.0 | - | - | appropriate | The paper is high quality and aligns well with the P3 Science syllabus. It uses appropriate terminology. Major issue: several questions (e.g., Q28, Q34, Q36) rely on diagrams or experimental setups that are described in text but lack actual visual diagrams, which are essential for Science papers. The answer key is detailed and provides marking schemes. |
| Legacy generator | Paper | 3-1 | Science | sa2-paper-1 | 9.2 | Yes | 10.0 | 9.5 | 8.5 | 9.0 | 9.0 | 10.0 | 9.0 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Content aligns well with P3 Science syllabus. Note: Several questions (10, 21, 28, 30, 34, 37) rely on diagrams that are represented by ASCII/text placeholders rather than actual images, which is a major issue for a real exam. The answer key is excellent and provides clear marking schemes. |
| Legacy generator | Paper | 3-1 | Science | sa2-paper-2 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 8.5 | 10.0 | 8.0 | 9.0 | 8.5 | - | - | appropriate | High quality paper. Content aligns well with P3 syllabus. Major issue: several questions (e.g., Q8, Q21, Q30, Q34) rely on diagrams that are represented by text-based ASCII art or placeholders rather than actual images, which is not suitable for a formal exam. The paper is truncated at the end. |
| Legacy generator | Paper | 3-1 | Science | sa2-paper-3 | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 9.0 | 10.0 | 8.5 | 8.5 | 8.0 | - | - | appropriate | High quality paper. Language is perfect for P3. Syllabus coverage is excellent. Major issue: several questions (e.g., Q2, Q21, Q28, Q29, Q30) rely on diagrams/flowcharts that are represented by ASCII/text-based placeholders rather than actual images. While the text descriptions are clear, a real exam would require proper diagrams. Marks and timing are generally well-structured. |
| Legacy generator | Paper | 3-1 | Science | sa2-paper-4 | 9.1 | Yes | 9.5 | 9.0 | 8.5 | 9.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | The paper is well-structured and aligns closely with the P3 Science syllabus. However, several questions (e.g., Q28, Q30, Q35) rely on diagrams that are represented only by text-based ASCII/markdown placeholders rather than actual images, which is a significant drawback for a Science paper. The difficulty is appropriate for the level. |
| Legacy generator | Paper | 3-1 | Science | sa2-paper-5 | 8.4 | Yes | 9.5 | 8.5 | 7.0 | 9.0 | 8.5 | 10.0 | 7.5 | 8.0 | 8.0 | - | - | appropriate | The paper includes topics slightly beyond the P3 syllabus (e.g., photosynthesis details, human body systems, and viruses) which are typically P4-P6. Significant missing images for diagrams in Section B (plant diagram, skeleton, etc.) and Section C. Format is generally good but lacks the standard MOE Section B/C structure for P3. Answer key is high quality with clear explanations. |
| Legacy generator | Paper | 3-1 | Science | wa1-paper-1 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 Science syllabus themes (Diversity, Cycles, Materials). Format is professional with clear marks and instructions. Note: Question 18 relies on a diagram that is represented by text placeholders; actual images are missing. |
| Legacy generator | Paper | 3-1 | Science | wa1-paper-2 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 Science syllabus. Note: Several questions (e.g., Q12, Q15, Q18) rely on text-based descriptions of diagrams or lists that would typically be visual in a real exam; missing actual image assets/placeholders for life cycles and classification tasks. |
| Legacy generator | Paper | 3-1 | Science | wa1-paper-3 | 9.4 | Yes | 9.5 | 10.0 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 9.0 | 9.5 | - | - | appropriate | High quality paper. Content aligns perfectly with P3 Science syllabus (Diversity, Cycles, Materials). Major issue: several questions (13, 16, 18) rely on visual data or diagrams that are represented only by text placeholders, making them impossible to solve as intended without the actual images. Marks and timing are realistic. |
| Legacy generator | Paper | 3-1 | Science | wa1-paper-4 | 9.2 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 8.5 | 10.0 | 9.5 | 9.0 | 9.0 | - | - | appropriate | High quality paper. Language is perfect for P3. Syllabus coverage is accurate. Major issue: several questions (15, 16, 18) rely on visual diagrams or lists that are represented by text placeholders but lack the actual diagrams required for a Science paper. Answer key is excellent and provides clear marking schemes. |
| Legacy generator | Paper | 3-1 | Science | wa1-paper-5 | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 Science syllabus (Diversity, Cycles, Materials). Note: Question 15 requires a diagram for 'Draw arrows' which is missing. Answer key is excellent with clear marking schemes. |
| Legacy generator | Paper | 3-1 | Science | wa2-paper-1 | 8.6 | Yes | 9.5 | 8.5 | 7.0 | 9.0 | 8.0 | 10.0 | 8.5 | 7.5 | 9.0 | - | - | appropriate | The paper relies heavily on text-based diagrams (ASCII/Markdown) which are poor substitutes for actual Science diagrams. While the content is syllabus-aligned, the lack of real images for life cycles and human body parts is a major issue for a P3 Science paper. Difficulty is appropriate for WA2. |
| Legacy generator | Paper | 3-1 | Science | wa2-paper-2 | 8.9 | Yes | 10.0 | 8.5 | 7.0 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 10.0 | - | - | too easy | The paper relies heavily on low-order recall questions (e.g., identifying sense organs) which is slightly below the expected application/inference level for P3 Science. Significant issue: several questions (16, 18, 24) use ASCII/text-based diagrams instead of actual images, which is not suitable for a formal science paper. Syllabus adherence is good, though it includes human body systems (heart/lungs) which are often secondary to the core P3 themes of Diversity, Cycles, and Interactions. |
| Legacy generator | Paper | 3-1 | Science | wa2-paper-3 | 9.2 | Yes | 10.0 | 9.0 | 8.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | The paper is well-structured and follows the P3 syllabus closely. However, it relies heavily on text-based placeholders (e.g., [Dragonfly], [Flower]) instead of actual diagrams, which is a significant drawback for a Science paper. The difficulty is appropriate for a Weighted Assessment (WA2). |
| Legacy generator | Paper | 3-1 | Science | wa2-paper-4 | 8.6 | Yes | 9.5 | 8.5 | 7.0 | 9.0 | 8.0 | 10.0 | 8.5 | 7.5 | 9.0 | - | - | appropriate | The paper relies heavily on ASCII/text-based diagrams (plant parts, human body, life cycles) which are poor substitutes for actual scientific diagrams required in P3 Science. While the content is syllabus-aligned, the lack of real images makes it feel like a low-fidelity draft. Difficulty is appropriate for P3, though some Section C questions are slightly more descriptive than typical exam formats. |
| Legacy generator | Paper | 3-1 | Science | wa2-paper-5 | 8.1 | Yes | 9.5 | 6.0 | 8.0 | 9.0 | 7.0 | 10.0 | 9.0 | 5.0 | 9.0 | - | - | uneven | Major syllabus misalignment: The paper includes human body systems (kidneys, heart, lungs, brain, teeth) which are P4/P5 topics, not P3. P3 syllabus focuses on Diversity, Cycles, and Magnets. Difficulty is uneven because it mixes P3 life cycles with P5 human anatomy. Missing diagrams are represented by text placeholders, which is acceptable for a draft but requires actual images for a real paper. |
| Legacy generator | Paper | 3-1 | Science | wa3-paper-1 | 9.1 | Yes | 10.0 | 9.5 | 8.5 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 9.0 | - | - | too easy | The paper is very easy for P3; it focuses heavily on rote recall rather than application or inquiry-based questions. It relies on ASCII art for diagrams (Question 18) which is a poor substitute for actual scientific diagrams. Missing actual images for Section B/C. Syllabus adherence is good, but the depth of thinking required is below the expected standard for Singapore Science. |
| Legacy generator | Paper | 3-1 | Science | wa3-paper-2 | 9.6 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 10.0 | 10.0 | 8.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 Science syllabus. Missing diagrams for life cycles and plant parts which are standard in Science papers. Difficulty is appropriate for WA3 level. |
| Legacy generator | Paper | 3-1 | Science | wa3-paper-3 | 9.2 | Yes | 10.0 | 9.5 | 8.0 | 10.0 | 9.0 | 10.0 | 9.0 | 7.0 | 10.0 | - | - | too easy | The paper is very easy for P3; most questions are simple recall rather than application. Significant issue: several questions (e.g., Q1, Q17, Q18, Q21) imply the presence of diagrams or visual aids that are missing, making them difficult to solve as intended. Format and marking scheme are excellent. |
| Legacy generator | Paper | 3-1 | Science | wa3-paper-4 | 9.6 | No | 10.0 | 10.0 | 9.0 | 10.0 | 8.0 | 10.0 | 10.0 | 9.0 | 10.0 | - | - | appropriate | High quality paper. Content aligns perfectly with P3 Science syllabus (Diversity, Cycles, Senses). Format is professional with clear marks and timing. Answer key is detailed. Note: Section B Q17 is a matching table which is fine, but some science papers prefer visual matching. |
| Legacy generator | Paper | 3-1 | Science | wa3-paper-5 | 9.6 | Yes | 10.0 | 9.5 | 9.0 | 10.0 | 9.0 | 10.0 | 10.0 | 8.5 | 10.0 | - | - | appropriate | High quality paper. Adheres well to P3 syllabus themes (Diversity, Cycles). Note: Section B Q18 and Section C Q23 rely on visual diagrams/labels which are missing in the text-only generation. Difficulty is appropriate for a WA3 assessment. |
| Legacy generator | Parents Guide | 2-9 | Chinese | parents-guide | 9.4 | No | 9.5 | 9.0 | - | 10.0 | - | - | - | 9.0 | - | - | 9.5 | appropriate | High quality guide. Language is appropriate for parents. Content aligns well with P3 syllabus milestones (character counts and skill progression). Practical advice on reading and writing is useful. Note: Character recognition targets in the guide (1200-1500) are slightly higher than the syllabus minimum (1300-1350 cumulative), but acceptable for a support guide. |
| Legacy generator | Parents Guide | 2-9 | English | parents-guide | 10.0 | No | 10.0 | 10.0 | - | 10.0 | - | - | - | 10.0 | - | - | 10.0 | appropriate | Excellent parent guide. Highly aligned with MOE P3 English syllabus, covering grammar, vocabulary, reading, and writing. Provides practical, age-appropriate home activities and clear assessment expectations. |
| Legacy generator | Parents Guide | 2-9 | Higher Chinese | parents-guide | 9.4 | No | 9.5 | 9.0 | - | 10.0 | - | - | - | 9.0 | - | - | 9.5 | appropriate | Excellent parent guide. It provides practical language conversion examples (simple to advanced) and clear learning milestones. It aligns well with the P3 Higher Chinese syllabus, specifically regarding character recognition targets and cultural literacy. No major issues found. |
| Legacy generator | Parents Guide | 2-9 | Mathematics | parents-guide | 9.7 | No | 10.0 | 10.0 | - | 10.0 | - | - | 9.0 | 10.0 | 9.0 | - | 10.0 | appropriate | Excellent parent guide. It aligns perfectly with the MOE P3 syllabus, including specific topics like equivalent fractions, area/perimeter, and 4-digit numbers. The term-by-term breakdown and practical home support activities are highly useful for the target audience. |
| Legacy generator | Parents Guide | 2-9 | Science | parents-guide | 8.1 | No | 9.5 | 6.0 | - | 10.0 | - | - | - | 9.0 | - | - | 6.0 | appropriate | The guide is well-written for parents but fails syllabus adherence by including topics not in the P3 syllabus (Plant Systems, Human Systems) while omitting or de-emphasizing others. It introduces photosynthesis and respiratory/digestive systems which are typically P5 topics in Singapore. |
| Legacy generator | Quiz | 3-0 | Chinese | general | 8.2 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.0 | 7.0 | 5.0 | - | - | uneven | The content is high quality and syllabus-aligned, but the artifact is fundamentally broken for a 'Picture Composition' quiz because it uses text descriptions instead of actual images. The sheer volume of questions (13 compositions) makes the 90-minute timeframe impossible to achieve, making the difficulty/workload unevenly high for a single sitting. |
| Legacy generator | Quiz | 5-1 | Chinese | listening | 7.9 | No | 9.0 | 8.5 | 7.0 | 10.0 | 9.0 | - | 6.0 | 4.0 | 10.0 | - | - | too easy | The content is significantly below P3 level; it reads like P1 or early P2 material with very simple sentence structures and basic vocabulary. There is a mathematical inconsistency: the header says 15 marks, but the breakdown and total sum to 25 marks. The format lacks the formal structure of a standard Singapore MOE listening paper (e.g., specific instructions for the audio component). |
| Legacy generator | Quiz | 5-1 | Chinese | reading | 8.7 | No | 9.5 | 9.0 | 8.5 | 10.0 | 9.5 | - | 7.0 | 6.0 | 10.0 | - | - | too easy | Language and syllabus alignment are good for P3. However, the content is too simplistic (resembles P1/P2 level) and lacks the analytical depth expected in P3 reading comprehension. There is a major scoring discrepancy: the header says 15 marks, but the total calculated in the answer key is 25 marks. Exam format lacks specific time allocation. |
| Legacy generator | Quiz | 5-1 | Chinese | speaking | 9.1 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.0 | 9.0 | 10.0 | - | - | appropriate | Language and difficulty are well-aligned with P3 standards. The quiz structure (Read Aloud, Picture Description, Conversation) correctly mirrors the Singapore oral exam format. Major issue: Part B (Picture Description) relies on a text-based description of a picture rather than an actual image, which is a critical missing component for a speaking assessment. The answer key is high quality with clear marking rubrics. |
| Legacy generator | Quiz | 5-1 | Chinese | vocabulary | 9.1 | No | 10.0 | 9.0 | 8.0 | 10.0 | 9.0 | 10.0 | 7.0 | 9.0 | 10.0 | - | - | appropriate | Language and vocabulary are highly appropriate for P3. The quiz structure is logical. Major issue: The total score in the header (15) contradicts the actual total score in the marking scheme (25). The exam format lacks a duration/time limit instruction. |
| Legacy generator | Quiz | 5-1 | Chinese | writing | 8.9 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.0 | 8.5 | 9.0 | - | - | appropriate | The quiz is well-structured for P3 level, covering vocabulary, sentence patterns, and paragraph writing. However, Section A relies entirely on text descriptions of images (e.g., 'Picture: Sun') rather than actual images, which is a major flaw for a primary language assessment. The marking rubric is excellent and follows pedagogical standards. |
| Legacy generator | Quiz | 3-0 | English | cloze-passage | 8.2 | No | 9.0 | 8.5 | 7.0 | 10.0 | 8.0 | 10.0 | 6.0 | 5.0 | 10.0 | - | - | uneven | The quiz has significant logic issues in the answer key. Section A: 'on the morning sun' is semantically incorrect (should be 'in'). Section B: The word bank is exhausted poorly; 'prepared' is used twice (12 and 19), and 'surprised' (18) does not fit the sentence structure. The difficulty is uneven because the grammar is too simple but the vocabulary section has broken logic/mapping. |
| Legacy generator | Quiz | 3-0 | English | composition | 8.1 | Yes | 9.5 | 9.0 | 7.0 | 10.0 | - | - | 6.0 | 8.5 | 7.0 | - | - | appropriate | Language and model answers are excellent for P3. However, Task 1 is a 'Picture Composition' but the actual images are replaced by text descriptions, which is a major failure for this task type. The exam format lacks a total time allocation and a clear total marks header for the whole paper. The word count requirement (80 words) is slightly high for some P3 students but acceptable for a challenge. |
| Legacy generator | Quiz | 5-1 | English | composition | 8.6 | Yes | 9.5 | 9.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.0 | 9.0 | - | - | appropriate | Language and syllabus alignment are strong. However, Section C is a 'Picture Composition' but provides a text description instead of an actual image, which is a major deviation from standard P3 English formats. There is also a discrepancy in total marks: the header says 15, but the marking scheme sums to 20. |
| Legacy generator | Quiz | 3-0 | English | comprehension | 9.4 | No | 10.0 | 9.5 | 9.0 | 10.0 | 9.0 | 10.0 | 8.5 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Language is perfect for P3. Question types (MCQ, sequencing, open-ended, true/false, and inference) align well with Singapore primary standards. Answer key provides excellent textual evidence. Minor deduction for missing formal exam header (time/marks/instructions) usually found in actual papers. |
| Legacy generator | Quiz | 5-1 | English | comprehension | 8.6 | No | 9.5 | 9.0 | 7.0 | 10.0 | 8.5 | 10.0 | 6.0 | 8.0 | 9.0 | - | - | appropriate | Language is well-suited for P3. Major issue: The total marks in the header (15) contradicts the actual marks in the content (25). Question types follow standard comprehension patterns, though Section C is slightly heavy on subjective reasoning for P3. Answer key is excellent with clear justifications. |
| Legacy generator | Quiz | 3-0 | English | editing | 8.1 | No | 9.0 | 9.5 | 6.0 | 10.0 | 8.5 | - | 5.0 | 8.0 | 9.0 | - | - | appropriate | The content is syllabus-aligned for P3 editing. However, the format deviates from standard Singapore exam papers; editing sections in P3 usually involve a single passage with numbered errors rather than split sections of spelling, grammar, and punctuation. Marks assigned in the text (e.g., [9 marks] vs [10 marks]) are inconsistent. Answer key is high quality with helpful rules. |
| Legacy generator | Quiz | 5-1 | English | grammar | 9.0 | No | 10.0 | 10.0 | 7.0 | 10.0 | 10.0 | 10.0 | 5.0 | 9.0 | 10.0 | - | - | appropriate | Grammar topics align well with P3 syllabus. Major issue: The total marks in the header (15) contradicts the actual marks calculated in the marking scheme (25). Question format is slightly generic compared to standard Singaporean school papers. |
| Legacy generator | Quiz | 3-0 | English | grammar-vocabulary | 9.2 | No | 10.0 | 9.5 | 8.0 | 10.0 | 10.0 | 10.0 | 7.0 | 8.5 | 10.0 | - | - | appropriate | Content is well-aligned with P3 syllabus. Grammar and vocabulary questions are age-appropriate. Answer key provides excellent step-by-step explanations. Exam format is slightly lacking in formal elements like time duration and specific instructions for marks per question, but the structure is clear. |
| Legacy generator | Quiz | 5-1 | English | oral | 8.6 | Yes | 9.5 | 9.0 | 7.0 | 10.0 | 8.0 | 10.0 | 6.0 | 8.5 | 9.0 | - | - | appropriate | The content is linguistically appropriate for P3. However, for an Oral exam, the 'Picture Discussion' section is fundamentally broken because the actual image is missing, providing only a text description which defeats the purpose of visual stimulus. The exam format lacks a specified duration and the marking scheme for oral is better presented as a rubric rather than a point-per-question system used in written papers. |
| Legacy generator | Quiz | 3-0 | English | synthesis-transformation | 9.0 | No | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | 10.0 | 7.0 | 8.0 | 10.0 | - | - | appropriate | Language is well-suited for P3. Content aligns with the introduction of compound and complex sentences. Exam format is slightly lacking as it misses a total time duration and formal header, though marks are clearly assigned. Answer key is excellent with clear explanations. |
| Legacy generator | Quiz | 5-1 | English | vocabulary | 8.2 | No | 9.0 | 8.5 | 6.0 | 10.0 | 9.0 | 10.0 | 5.0 | 6.0 | 10.0 | - | - | too easy | The quiz is too simple for P3; many questions are P1/P2 level (e.g., synonyms for happy, opposites of brave). Major error: The header states a total score of 15, but the marking scheme and actual questions sum to 25. Question 13 (plural forms) is more of a grammar task than pure vocabulary. Format lacks standard MOE exam structure (e.g., no specific time duration provided). |
| Legacy generator | Quiz | 3-0 | Higher Chinese | general | 9.0 | No | 9.0 | 8.5 | 8.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.5 | 9.0 | - | - | appropriate | The quiz is well-structured and aligns with P3 Higher Chinese expectations. Vocabulary and idioms are appropriately challenging. The answer key includes useful scoring rubrics for open-ended questions. The reading passage is slightly short for a 30-minute timeframe but suitable for a topical drill. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | listening | 8.9 | No | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.0 | 8.5 | 9.0 | - | - | appropriate | High quality listening quiz. Language is well-suited for P3 Higher Chinese. The inclusion of the script (录音文本) is excellent for teacher use. Note: The total marks in the header (15) contradicts the actual total (25) calculated in the answer key. The format is clean and the vocabulary list is a helpful addition. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | reading | 8.4 | No | 9.0 | 8.5 | 7.0 | 10.0 | 9.5 | 10.0 | 6.0 | 7.0 | 9.0 | - | - | too easy | The content is slightly too simple for Higher Chinese P3, leaning towards Standard Chinese. Major issue: The total marks in the header (15) contradicts the actual total marks (25) calculated in the answer key. The question format lacks the formal structure of MOE papers (e.g., specific instruction styles). |
| Legacy generator | Quiz | 5-1 | Higher Chinese | speaking | 9.1 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.5 | 8.0 | 10.0 | - | - | appropriate | The quiz structure is excellent and follows the standard oral exam format (Read Aloud, Picture Description, Conversation). However, Part B is unusable without the actual image, as it relies on a text-based description of a scene. The difficulty is appropriate for P3 Higher Chinese, though the conversation questions are slightly generic. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | vocabulary | 8.3 | No | 9.0 | 8.5 | 7.0 | 10.0 | 9.0 | - | 6.0 | 8.0 | 9.0 | - | - | appropriate | The quiz content is appropriate for P3 Higher Chinese. However, there is a major discrepancy in the total marks: the header states 15 marks, but the marking scheme and actual questions sum to 25 marks. The exam format lacks a time limit and formal instructions typical of MOE papers. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | writing | 8.2 | Yes | 9.0 | 8.5 | 7.0 | 10.0 | 9.0 | - | 6.0 | 7.0 | 9.0 | - | - | appropriate | The quiz is well-structured for P3 level. However, Section C (Picture Composition) is fundamentally broken because it provides a text description of a picture instead of an actual image, which is the standard for Chinese composition exams. There is also a mathematical inconsistency: the header states a total of 15 marks, but the sum of sections is 18 marks. The difficulty is appropriate for Higher Chinese P3. |
| Legacy generator | Quiz | 3-0 | Mathematics | addition-subtraction | 9.4 | No | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | 8.0 | 9.0 | 10.0 | 10.0 | - | - | appropriate | High quality quiz. Adheres well to P3 syllabus for addition/subtraction up to 4 digits. Section A uses standard MCQ format. Section B and C provide appropriate working space. Answer key includes clear step-by-step working for word problems. Minor note: could use more formal LaTeX for mathematical expressions instead of plain text/code blocks. |
| Legacy generator | Quiz | 5-1 | Mathematics | addition-subtraction | 8.9 | No | 10.0 | 10.0 | 8.0 | 10.0 | 7.0 | 10.0 | 6.0 | 9.0 | 10.0 | - | - | appropriate | Content is syllabus-accurate for P3 addition/subtraction. Major issue: The total marks in the header (15) contradicts the marking scheme and actual total (25). Answer key provides simple verification but lacks full step-by-step working for Section B and C. |
| Legacy generator | Quiz | 3-0 | Mathematics | data-analysis | 8.6 | Yes | 10.0 | 9.0 | 7.0 | 8.0 | 10.0 | 10.0 | 8.0 | 6.0 | 9.0 | - | - | uneven | Major issue: Question 13 asks for 'average', which is a Primary 5/6 concept and not in the P3 syllabus. Question 6 contains a placeholder text instead of an actual image. Difficulty is uneven due to the inclusion of advanced concepts like averages alongside basic data reading. |
| Legacy generator | Quiz | 5-1 | Mathematics | data-analysis | 8.8 | Yes | 10.0 | 10.0 | 7.0 | 10.0 | 10.0 | 10.0 | 5.0 | 8.0 | 9.0 | - | - | appropriate | The quiz content is syllabus-aligned for P3 Data Analysis. However, there is a major internal inconsistency: the header states the total score is 15, but the marking scheme and actual question values sum to 25. Additionally, while the picture graph uses emojis, a formal exam would require actual diagrams or clear placeholders for bar graphs mentioned in Section B. |
| Legacy generator | Quiz | 3-0 | Mathematics | fractions | 8.8 | Yes | 10.0 | 10.0 | 8.0 | 9.0 | 10.0 | 5.0 | 8.0 | 9.0 | 10.0 | - | - | appropriate | Content is syllabus-accurate for P3 fractions. Major issue: lack of LaTeX for fractions (uses plain text slashes) and missing diagrams for Q1. Question count in header (20) does not match actual questions (14). |
| Legacy generator | Quiz | 5-1 | Mathematics | fractions | 8.3 | Yes | 10.0 | 10.0 | 8.0 | 10.0 | 9.0 | 2.0 | 7.0 | 9.0 | 10.0 | - | - | appropriate | Question 1 requires a visual diagram of a circle which is missing. Notation is poor; fractions are written as plain text (e.g., 3/8) instead of using proper LaTeX or mathematical formatting. Total marks in header (15) contradicts the marking scheme summary (25). |
| Legacy generator | Quiz | 3-0 | Mathematics | geometry | 9.3 | Yes | 10.0 | 10.0 | 9.0 | 9.0 | 10.0 | 8.0 | 9.0 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Content aligns perfectly with P3 geometry syllabus (angles, perimeter, area, parallel/perpendicular lines). Answer key is excellent with clear step-by-step working and teaching notes. Note: Multiple questions rely on diagrams that are currently placeholders. Notation is mostly standard, though some LaTeX could be used for units like cm2. |
| Legacy generator | Quiz | 5-1 | Mathematics | geometry | 8.9 | Yes | 10.0 | 10.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.0 | 10.0 | - | - | appropriate | Geometry questions for P3 often require visual diagrams (e.g., for angles and shapes), which are missing here. The total marks in the header (15) do not match the actual total (25). Language and syllabus alignment are excellent. |
| Legacy generator | Quiz | 3-0 | Mathematics | measurement | 9.7 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 10.0 | 10.0 | 9.0 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Content aligns perfectly with P3 measurement syllabus. Major issue: Question 3 requires a clock image which is only provided as a text placeholder. The question count in the header (20) does not match the actual number of questions (14). |
| Legacy generator | Quiz | 5-1 | Mathematics | measurement | 9.2 | No | 10.0 | 10.0 | 8.0 | 10.0 | 10.0 | 9.0 | 7.0 | 9.0 | 10.0 | - | - | appropriate | Content aligns well with P3 measurement syllabus. Note: The total marks in the header (15) contradicts the actual total (25) and the marking scheme summary. Notation for litres (ℓ) is used instead of standard LaTeX, but is acceptable for this level. |
| Legacy generator | Quiz | 3-0 | Mathematics | money | 9.3 | No | 10.0 | 10.0 | 8.0 | 10.0 | 9.0 | 9.0 | 9.0 | 9.0 | 10.0 | - | - | appropriate | Quiz aligns well with P3 Money syllabus (decimal notation, addition/subtraction). Question count in header (20) does not match actual questions (14). Answer key provides good step-by-step working. |
| Legacy generator | Quiz | 5-1 | Mathematics | money | 8.8 | No | 10.0 | 10.0 | 7.0 | 10.0 | 9.0 | 10.0 | 5.0 | 8.0 | 10.0 | - | - | appropriate | The quiz content is syllabus-accurate for P3 Money. However, there is a major discrepancy in the total marks: the header states 15 marks, but the marking scheme and actual question values sum to 25 marks. The question structure (Section A, B, C) is good, but lacks the formal instruction/timeframe block typical of Singapore exam papers. |
| Legacy generator | Quiz | 3-0 | Mathematics | multiplication-division | 9.4 | No | 10.0 | 10.0 | 8.0 | 10.0 | 9.0 | 10.0 | 9.0 | 9.0 | 10.0 | - | - | appropriate | Content aligns well with P3 syllabus (multiplication tables, division with remainders, and word problems). Format is standard for Singapore primary school quizzes. Note: The metadata says 20 questions, but only 14 are provided in the content. |
| Legacy generator | Quiz | 5-1 | Mathematics | multiplication-division | 8.4 | No | 10.0 | 10.0 | 7.0 | 10.0 | 5.0 | 10.0 | 6.0 | 8.0 | 10.0 | - | - | appropriate | Content is syllabus-accurate for P3. Major discrepancy: the quiz header states a total score of 15, but the marking scheme and actual questions sum to 25. Answer key provides single-line solutions rather than full step-by-step working required for P3 math. Exam format lacks time duration and standard instructions. |
| Legacy generator | Quiz | 3-0 | Mathematics | whole-numbers | 8.7 | Yes | 10.0 | 10.0 | 8.0 | 9.0 | 9.0 | 5.0 | 9.0 | 8.0 | 10.0 | - | - | appropriate | The quiz is well-aligned with P3 Whole Numbers syllabus. However, the answer key contains significant logical errors: Question 3 has two correct options (2 and 4), and Question 4 contains a self-correction loop that is confusing. Question 8 uses ASCII art for a number line which should ideally be an image. LaTeX is not used for mathematical notation. |
| Legacy generator | Quiz | 5-1 | Mathematics | whole-numbers | 9.3 | No | 10.0 | 10.0 | 8.0 | 10.0 | 10.0 | 10.0 | 7.0 | 9.0 | 10.0 | - | - | appropriate | Content is highly aligned with P3 Whole Numbers syllabus. The exam paper format has a discrepancy: the header states a total score of 15, but the marking scheme and actual questions sum to 25. Questions and step-by-step solutions are clear and well-structured. |
| Legacy generator | Quiz | 5-1 | Science | diversity | 8.2 | Yes | 9.5 | 9.0 | 6.0 | 10.0 | 8.5 | 10.0 | 5.0 | 7.0 | 9.0 | - | - | appropriate | Language is suitable for P3. Syllabus coverage is good. Major issue: Total marks in header (15) does not match the actual total (25). Question template lacks standard MOE Science structure (usually Section A is MCQ and Section B is Open-Ended, but the marks/weighting here is inconsistent with typical exam papers). Missing diagrams for questions like Q12 or Q13 which would typically use visual aids in a real exam. |
| Legacy generator | Quiz | 3-0 | Science | diversity-living-nonliving | 9.3 | No | 10.0 | 9.5 | 8.0 | 10.0 | 9.0 | 10.0 | 8.5 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Content aligns well with P3 Diversity syllabus. Section A uses standard MCQ format, though Section B/C could benefit from more visual stimuli (diagrams) common in Science papers. Answer key is excellent with clear marking rubrics for open-ended questions. |
| Legacy generator | Quiz | 3-0 | Science | diversity-materials | 9.4 | Yes | 10.0 | 10.0 | 8.0 | 10.0 | 9.0 | - | 9.0 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Content aligns perfectly with P3 Science syllabus. Question 15 and 16 are excellent for application. Note: Question 15 and 16 would benefit from diagrams/images in a real exam setting. Format is consistent with Singapore primary school standards. |
| Legacy generator | Quiz | 5-1 | Science | heat | 7.3 | No | 9.0 | 4.0 | 7.0 | 10.0 | 8.0 | 10.0 | 6.0 | 3.0 | 9.0 | - | - | too easy | Major syllabus misalignment: Heat is not a topic in the P3 Science syllabus (it is a P4 topic). The content is too advanced for P3. Additionally, the total marks in the header (15) do not match the actual total (25). Questions are very basic and lack the complexity of actual Singapore primary science papers. |
| Legacy generator | Quiz | 3-0 | Science | human-system | 8.1 | Yes | 10.0 | 4.0 | 7.0 | 10.0 | 9.0 | 10.0 | 8.0 | 5.0 | 10.0 | - | - | uneven | Major syllabus misalignment: The P3 Science syllabus focuses on Diversity, Cycles, and Interactions (Magnets). Human Body Systems (Respiratory, Circulatory, Nervous, etc.) is not part of the P3 syllabus; it is typically introduced in P4 or P5. While the quiz quality is high, it is testing content outside the specified level's scope. Missing diagrams for the heart rate and sense investigations. |
| Legacy generator | Quiz | 3-0 | Science | life-cycles | 9.5 | Yes | 9.5 | 10.0 | 8.5 | 10.0 | 9.5 | 10.0 | 9.0 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Content aligns perfectly with P3 Life Cycles syllabus. Missing diagrams for life cycle stages and investigation scenarios which are standard in Science papers. Answer key is excellent with clear explanations and mark allocation. |
| Legacy generator | Quiz | 5-1 | Science | life-cycles | 7.8 | Yes | 9.0 | 9.5 | 4.0 | 10.0 | 8.5 | 10.0 | 3.0 | 7.0 | 9.0 | - | - | appropriate | The quiz lacks diagrams which are essential for P3 Science life cycle questions. The total marks in the header (15) do not match the actual total (25). Question 15 is speculative regarding the 'how long' part which is not syllabus-standard. Formatting of instructions and marks per question is inconsistent with MOE exam styles. |
| Legacy generator | Quiz | 5-1 | Science | light | 8.8 | Yes | 9.5 | 10.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.5 | 9.0 | - | - | appropriate | Content is syllabus-accurate for P3 Light. Major issue: The total marks in the header (15) contradicts the actual total (25). Missing diagrams for shadow and light path questions which are standard for this topic. Answer key is high quality with marking schemes. |
| Legacy generator | Quiz | 3-0 | Science | magnets | 9.6 | Yes | 10.0 | 10.0 | 8.0 | 10.0 | 10.0 | 10.0 | 9.0 | 9.0 | 10.0 | - | - | appropriate | High quality quiz. Content aligns perfectly with P3 syllabus. Note: Question 13 uses text-based diagrams which should ideally be replaced with actual images for this level. Marks and timing are well-structured. |
| Legacy generator | Quiz | 5-1 | Science | magnets | 8.4 | Yes | 9.5 | 10.0 | 6.0 | 10.0 | 9.0 | - | 5.0 | 8.5 | 9.0 | - | - | appropriate | The quiz content is syllabus-accurate for P3. However, it fails on exam format: the total marks in the header (15) do not match the actual total (25), and there are no marks assigned per question in the body. Question 15 relies on a 'diagram description' rather than an actual image, which is poor practice for Science. Answer key is high quality with clear marking schemes. |
| Legacy generator | Quiz | 5-1 | Science | materials | 8.6 | No | 9.5 | 8.5 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.0 | 9.0 | - | - | appropriate | Language is well-suited for P3. Syllabus adherence is good, though it introduces 'conductors/insulators' which is slightly advanced for the core P3 materials topic but acceptable. Major issue: The total marks in the header (15) contradicts the actual total (25). Exam format lacks specific time allocation and standard MOE sectioning style. |
| Legacy generator | Quiz | 3-0 | Science | plant-system | 9.2 | Yes | 10.0 | 9.5 | 8.0 | 10.0 | 9.0 | 10.0 | 8.5 | 9.0 | 9.0 | - | - | appropriate | Content is highly accurate to P3 syllabus. Missing diagrams for the experiment (Q12) and plant parts (Q7/Q8) which are standard in Science papers. Format is good but lacks the specific Section A/B/C layout typical of MOE papers (usually MCQ and Open-Ended). Answer key is excellent with clear explanations. |
| Legacy generator | Quiz | 5-1 | Science | systems | 7.3 | Yes | 9.5 | 4.0 | 5.0 | 10.0 | 8.0 | 10.0 | 4.0 | 6.0 | 9.0 | - | - | uneven | Major syllabus misalignment: 'Systems' is not a standalone topic in the P3 MOE Science syllabus; it is a concept applied within Diversity, Cycles, and Interactions. The quiz introduces concepts like photosynthesis and human organs which are P5 level. The exam format is inconsistent: the header says 15 marks, but the total is 25. Lack of diagrams for science questions makes it less authentic to Singapore exam standards. |
Criteria
Scores use 10.0 as best fit. Missing images are tracked as a yes/no flag.
Language suitability
Syllabus adherence
Past-paper template adherence
No weird artefacts/symbols
Step-by-step answers
Latex/notation format
Exam paper format
Difficulty appropriateness
Doable within timeframe
Cheatsheet 3-point summaries
Parent guide syllabus fit