Stage 9-1 Level View
Primary 6 PSLE Benchmark Scores
Per-level benchmark view grouped by generation model and subject. Scores are derived from the Stage 9-0 evaluator reports without changing the underlying scoring algorithm.
Showing all Primary 6 PSLE subjects. Pick a subject to recalculate the LLM scores, low-score review list, and detailed rows for that subview.
LLM Summary
Average scores grouped by the model that generated the Primary 6 PSLE content.
| Generation Model | Artifacts | Overall | Missing Images | Language | Syllabus | Answers | Notation | Timing |
|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 4 | 5 | 9.2 | 2 | 9.4 | 9.0 | - | 9.7 | - |
| Legacy generator | 67 | 7.5 | 26 | 8.9 | 7.9 | 6.9 | 8.3 | 7.0 |
Subject Summary
Average scores grouped by subject inside Primary 6 PSLE.
| Subject | Artifacts | Overall | Missing Images | Language | Syllabus | Answers | Notation | Timing |
|---|---|---|---|---|---|---|---|---|
| Chinese | 7 | 7.8 | 2 | 8.6 | 7.5 | 9.1 | 10.0 | 7.8 |
| English | 7 | 8.9 | 2 | 9.4 | 9.3 | 9.2 | 10.0 | 8.2 |
| Higher Chinese | 27 | 7.1 | 2 | 8.2 | 7.1 | 6.4 | 10.0 | 6.2 |
| Mathematics | 22 | 7.2 | 14 | 9.5 | 8.2 | 5.7 | 6.7 | 7.3 |
| Science | 9 | 8.8 | 8 | 9.5 | 9.6 | 9.1 | 10.0 | 7.9 |
LLM by Content Type
Model scores split by quizzes, papers, cheatsheets, and parent guides.
| Generation Model | Type | Artifacts | Overall | Missing Images | Language | Syllabus | Answers | Notation | Timing |
|---|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 4 | Cheatsheet | 5 | 9.2 | 2 | 9.4 | 9.0 | - | 9.7 | - |
| Legacy generator | Parents Guide | 5 | 9.4 | 0 | 9.6 | 9.1 | - | 10.0 | - |
| Legacy generator | Quiz | 62 | 7.3 | 26 | 8.9 | 7.8 | 6.9 | 8.3 | 7.0 |
Needs Review: Scores Below 8.0
Artifacts with overall benchmark scores below 8.0 for the current level view.
| Overall | Model | Subject | Type | Stage | Topic / Paper | Language | Syllabus | Template | Answers | Notation | Timing | Comments |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4.3 | Legacy generator | Mathematics | Quiz | 3-0 | angles-geometry | 9.0 | 4.0 | 5.0 | 1.0 | 5.0 | 4.0 | Critical failure: The answer key is a hallucinated mess. The answers provided do not correspond to the questions asked (e.g., MCQ 1 answer is b, but the correct answer is c). The model's internal monologue/corrections are included in the output, making it unusable. Geometry questions are impossible to solve without the missing diagrams. Content includes advanced polygon properties (interior/exterior angles of n-gons) not typically required at P6 level. The difficulty is uneven and the logic is broken. |
| 4.4 | Legacy generator | Higher Chinese | Quiz | 3-0 | composition | 6.0 | 3.0 | 2.0 | 0.0 | 10.0 | 3.0 | Major hallucination/mismatch: The quiz is titled 'Argumentative Essay' (议论文) but the answer key provides answers for 'Narrative Essay' (记叙文). The content is also fundamentally incorrect for PSLE Higher Chinese; PSLE composition is a long-form creative writing task, not a multiple-choice or short-answer theory quiz on essay elements. The difficulty is far too low for P6 Higher Chinese. |
| 4.4 | Legacy generator | Mathematics | Quiz | 3-0 | problem-solving-heuristics | 9.0 | 6.0 | 5.0 | 1.0 | 0.0 | 5.0 | Critical failure: The answer key is completely hallucinated and does not correspond to the questions provided. For example, Answer 1 refers to stickers and a different math problem, while Question 1 is about Lisa's savings. Answer 2 is a geometric/algebraic solution for a problem not in the quiz. The quiz itself is too easy for P6 PSLE level, and the answer key contains internal contradictions and incorrect logic. No LaTeX used for math notation. |
| 4.6 | Legacy generator | Mathematics | Quiz | 3-0 | volume | 9.0 | 2.0 | 5.0 | 2.0 | 4.0 | 5.0 | Major failure in syllabus adherence: includes cylinders, cones, spheres, and pyramids which are not in the P6 MOE syllabus. The answer key is completely hallucinated and disconnected from the questions (e.g., Q1 answer refers to a different question's values). The difficulty is inappropriate as it introduces secondary school geometry. Answer key contains internal monologue and incorrect calculations. |
| 4.9 | Legacy generator | Higher Chinese | Quiz | 3-0 | vocabulary | 6.0 | 4.0 | 2.0 | 0.0 | 10.0 | 5.0 | Major hallucination/mismatch: The question paper content (Traditional Culture, Idioms, Poetry) does not match the answer key (Vocabulary, Synonyms, Multiple Choice). The question format is essay-style/open-ended, which is not typical for PSLE Higher Chinese vocabulary sections. The difficulty is uneven; poetry analysis is too hard for a simple vocabulary quiz, while the answer key's synonym section is too easy. |
| 5.1 | Legacy generator | Higher Chinese | Quiz | 3-0 | grammar | 4.0 | 3.0 | 5.0 | 7.0 | - | 4.0 | The content is highly inappropriate for P6 Higher Chinese. It includes advanced Classical Chinese (Wenyanwen) grammar, tonal analysis (Pingze), and rhyming composition, which are secondary school topics, not PSLE. Additionally, the provided answer key does not match the generated quiz content at all, making it useless for the artifact. |
| 5.4 | Legacy generator | Higher Chinese | Quiz | 3-0 | grammar | 8.0 | 5.0 | 4.0 | 2.0 | - | 6.0 | Major misalignment: The quiz content (sentence component analysis, rhetorical devices, formal vs informal) is more aligned with Mainland China middle school grammar than Singapore PSLE Higher Chinese. The answer key is completely disconnected from the quiz questions (e.g., quiz asks for sentence analysis, answer key provides sentence type identification). Difficulty is too low for P6 Higher Chinese; it lacks the depth of comprehension and nuanced language use required for PSLE. |
| 5.5 | Legacy generator | Higher Chinese | Quiz | 3-0 | comprehension | 8.0 | 7.0 | 4.0 | 2.0 | - | 5.0 | Critical failure: The answer key is completely hallucinated and does not correspond to the provided text. The quiz uses a Classical Chinese text (Wenyanwen) which is appropriate for Higher Chinese, but the answer key discusses a text about 'Springtime'. The difficulty is uneven because the content and answers are disconnected. |
| 5.7 | Legacy generator | Mathematics | Quiz | 3-0 | area-perimeter | 9.0 | 9.0 | 5.0 | 2.0 | 4.0 | 5.0 | Critical failure: The answer key does not match the quiz questions. The quiz has 16 questions (10 MCQ, 4 Short, 2 Problem Solving), but the answer key provides answers for a completely different set of 10 questions. The answer key also contains internal monologue/thinking process and incorrect calculations. Difficulty is uneven as MCQ is very basic while Section B/C requires higher-order thinking. Missing diagrams for geometry questions. |
| 5.9 | Legacy generator | Higher Chinese | Quiz | 3-0 | composition | 9.0 | 7.0 | 4.0 | 8.0 | - | 2.0 | The artifact is a theory quiz about composition rather than a PSLE-style composition practice. PSLE Higher Chinese composition requires writing a full 1400-1500 character essay based on a prompt/picture, not answering multiple-choice or short-answer questions about literary devices. The metadata claims 20 questions in 35 minutes, but only 10 are provided. The difficulty is too low for P6 Higher Chinese as it focuses on basic definitions rather than high-level application. |
| 5.9 | Legacy generator | Higher Chinese | Quiz | 3-0 | hanyu-pinyin | 8.0 | 6.0 | 4.0 | 0.0 | 10.0 | 7.0 | Major failure in answer key alignment: the answers provided do not match the questions asked (e.g., Part 1 asks for pinyin of specific words like 辉煌, but the key provides pinyin for different words like 雄伟). The difficulty is uneven; Part 1 uses high-level vocabulary, but Part 2 and 3 are basic phonetic rules. The exam format lacks standard PSLE sectioning and marks distribution logic. |
| 5.9 | Legacy generator | Mathematics | Quiz | 3-0 | psle-revision | 10.0 | 9.0 | 8.0 | 0.0 | 2.0 | 5.0 | Critical failure: The answer key is completely hallucinated and does not match the questions provided. For example, Q1 asks for a percentage but the answer key provides a mixed fraction; Q2 asks for triangle area but the answer key provides a different calculation entirely. Notation is poor, using plain text for fractions instead of LaTeX. Difficulty is uneven as questions are basic but the answer key is nonsensical. |
| 6.0 | Legacy generator | Higher Chinese | Quiz | 3-0 | vocabulary | 6.0 | 5.0 | 4.0 | 7.0 | 10.0 | 4.0 | The content is far too academic and philosophical for P6 level; it reads like a secondary school or university entrance exam for Chinese literature. The question types (explaining philosophical concepts like 'Tian Ren He Yi') are not aligned with PSLE Higher Chinese formats which focus on comprehension and language use. The total marks and time allocation are inconsistent between the quiz and the answer key provided. |
| 6.0 | Legacy generator | Mathematics | Quiz | 3-0 | angles-geometry | 9.0 | 8.0 | 6.0 | 3.0 | 5.0 | 8.0 | Major issues: 1. Multiple questions (12, 13, 15) require diagrams that are missing. 2. The answer key contains 'thinking aloud' artifacts where the model realizes its own errors and provides incorrect/unstable solutions. 3. Question 14 has a mathematical mismatch between the question and the provided answer. 4. Notation is inconsistent; some geometry terms are used without proper LaTeX. 5. Difficulty is uneven due to the broken logic in the answer key. |
| 6.1 | Legacy generator | Mathematics | Quiz | 5-1 | geometry | 9.0 | 7.0 | 5.0 | 9.0 | 2.0 | 7.0 | The quiz is significantly below PSLE standard; it focuses on basic properties rather than the complex composite figure and circle geometry required by the P6 syllabus. Question 14 uses a formula (n-2)*180 not taught in P6. Lack of LaTeX for math notation and reliance on ASCII art for geometry is poor. Total marks in marking scheme (55) do not match the header (50). |
| 6.2 | Legacy generator | Chinese | Quiz | 5-1 | listening | 6.0 | 5.0 | 4.0 | 9.0 | - | 9.0 | The content is significantly below P6 PSLE standards; the vocabulary and sentence structures are more aligned with P2 or P3 levels. The listening passages lack the complexity, inference requirements, and critical analysis expected in the P6 syllabus. Exam format is missing specific time allocations and standard PSLE instruction phrasing. |
| 6.4 | Legacy generator | Mathematics | Quiz | 3-0 | volume | 9.0 | 4.0 | 7.0 | 5.0 | 6.0 | 8.0 | Major syllabus violation: P6 Singapore syllabus only covers volume of cubes and cuboids; questions on cylinders, cones, spheres, and pyramids are not in the P6 Standard syllabus. The answer key contains internal monologue/recalculations which is unprofessional. Difficulty is uneven due to out-of-syllabus content. |
| 6.8 | Legacy generator | Higher Chinese | Quiz | 3-0 | comprehension | 8.0 | 7.0 | 5.0 | 9.0 | - | 5.0 | The quiz is excessively difficult for P6 Higher Chinese; it includes advanced classical Chinese grammar (sentence type identification) and creative writing in classical style which exceeds PSLE standards. The answer key provided is completely mismatched, referring to a text about 'Spring' instead of the '弈秋' text used in the quiz. The paper format is also inconsistent with PSLE weighting and structure. |
| 6.9 | Legacy generator | Chinese | Quiz | 5-1 | vocabulary | 8.0 | 7.0 | 5.0 | 9.0 | - | 8.0 | The content is significantly below P6 PSLE standards; it resembles P3/P4 level vocabulary. The exam format is inconsistent, specifically the total marks calculation (50 vs 74) is broken. Question types like 'fill in the blanks' for basic classifiers and simple sentence structures are too elementary for a P6 candidate. |
| 6.9 | Legacy generator | Higher Chinese | Quiz | 3-0 | composition | 9.0 | 8.0 | 7.0 | 2.0 | 10.0 | 4.0 | Major misalignment between the quiz content and the answer key. The quiz is an argumentative writing (议论文) practice, but the answer key provides answers for a narrative writing (记叙文) quiz. The word count requirement (300-350 words) is too low for Higher Chinese P6 standards, which expect 1400-1500 characters for full compositions. The timeframe is unrealistic for two full essays plus analysis tasks. |
| 6.9 | Legacy generator | Higher Chinese | Quiz | 3-0 | grammar | 7.0 | 6.0 | 5.0 | 8.0 | - | 9.0 | The content is too basic for Higher Chinese P6; it reads more like a P4/P5 primary Chinese grammar worksheet. PSLE Higher Chinese grammar usually focuses on more nuanced vocabulary usage, idioms, and complex sentence structures rather than simple sentence transformations like 'changing affirmative to negative'. The exam format lacks the standard PSLE paper structure (e.g., Section A: Vocabulary, Section B: Grammar/Cloze). The answer key is clear but the questions lack the depth required for the Higher Chinese syllabus. |
| 7.2 | Legacy generator | Higher Chinese | Quiz | 3-0 | composition | 9.0 | 8.5 | 7.0 | 5.0 | - | 5.0 | The artifact is highly inconsistent. The main quiz asks for two long compositions (350-400 words each) plus a skills section, which is impossible to complete in 60 minutes. Furthermore, the provided answer key does not match the quiz content; the quiz asks for compositions, but the answer key provides answers to a completely different set of multiple-choice and short-answer questions. The difficulty of the writing prompts is appropriate for Higher Chinese, but the structure is logically flawed. |
| 7.2 | Legacy generator | Higher Chinese | Quiz | 3-0 | comprehension | 8.5 | 7.0 | 6.0 | 9.0 | 10.0 | 5.0 | The text is too simplistic for Higher Chinese P6; it reads like a P3/P4 level primary text. The question count (20) contradicts the actual number of questions provided (7). The exam format lacks standard PSLE components like specific mark allocations per question and proper instruction headers. Timing is unrealistic for the depth of questions vs the text length. |
| 7.2 | Legacy generator | Higher Chinese | Quiz | 5-1 | listening | 8.5 | 7.0 | 6.0 | 9.0 | - | 8.0 | The content is too simple for Higher Chinese (HCL) P6; it reads more like Standard Chinese (CL) level. HCL requires critical analysis and sophisticated inference, whereas these questions only test literal comprehension. The exam format is missing total marks (total is 39, but header says 50) and lacks formal PSLE-style instructions/timing. The answer key is well-structured with explanations. |
| 7.2 | Legacy generator | Mathematics | Quiz | 3-0 | data-analysis | 10.0 | 10.0 | 6.0 | 0.0 | 10.0 | 8.0 | Critical failure: The answer key is completely disconnected from the questions. The questions ask about ages, spinners, and football goals, but the answers discuss pets, books, and temperature. Additionally, Section A questions require visual aids (graphs/pie charts) that are missing, making them impossible to solve as written. |
| 7.3 | Legacy generator | Higher Chinese | Quiz | 3-0 | grammar | 8.5 | 7.0 | 6.0 | 9.0 | - | 7.0 | The content is more aligned with secondary school grammar (sentence component analysis, rhetorical devices) than the standard PSLE Higher Chinese format, which focuses more on comprehension and vocabulary in context. The exam format is inconsistent: the quiz header claims 75 marks, but the answer key claims 50 marks. The difficulty is uneven, mixing very basic sentence types with advanced linguistic analysis. |
| 7.3 | Legacy generator | Higher Chinese | Quiz | 5-1 | writing | 9.0 | 8.5 | 7.0 | 9.0 | - | 4.0 | The artifact is problematic for a single quiz. It asks for two separate compositions (one visual, one topical) totaling 50 marks, which is impossible to complete within a standard exam timeframe. The word count requirements (200-250 words per essay) are actually too low for Higher Chinese P6, which typically requires much longer, more sophisticated compositions (1400-1500 characters for the whole paper). The 'images' are provided only as text descriptions, which is a major missing-image issue for a visual composition task. The answer key is excellent, providing high-quality model essays and analysis. |
| 7.3 | Legacy generator | Mathematics | Quiz | 3-0 | speed-distance-time | 10.0 | 10.0 | 7.0 | 2.0 | 10.0 | 5.0 | Critical failure: The answer key does not correspond to the generated quiz. The quiz has 16 questions (10 MCQ, 4 Short, 2 Problem Solving), but the answer key provides solutions for a completely different set of 10 questions. Additionally, the answer key contains internal monologue/corrections and incorrect math in Question 7. Difficulty is uneven; Section A is too basic for P6, while Section C is appropriate. |
| 7.4 | Legacy generator | Mathematics | Quiz | 5-1 | fractions | 10.0 | 10.0 | 6.0 | 9.0 | 2.0 | 8.0 | The quiz content is syllabus-aligned and the difficulty progression is good. However, it fails significantly on technical formatting: it uses plain text slashes for fractions instead of LaTeX, which is unacceptable for P6 level. The exam format is also inconsistent; the total marks in the header (50) does not match the marking scheme total (55). Question 4 has a weird 'Both B and C' option which is rare in PSLE. |
| 7.5 | Legacy generator | Higher Chinese | Quiz | 3-0 | vocabulary | 9.0 | 8.5 | 5.0 | 7.0 | 10.0 | 8.0 | The vocabulary level is appropriate for Higher Chinese, but the question format deviates significantly from PSLE standards. PSLE vocabulary questions are typically multiple-choice (MCQ) or fill-in-the-blanks within a passage, rather than open-ended 'define and make a sentence' tasks. Section 3 (Synonyms) is too simplistic for P6 Higher Chinese. The difficulty is uneven: Section 1 is high-level, while Section 3 is very basic. |
| 7.6 | Legacy generator | Higher Chinese | Quiz | 3-0 | vocabulary | 9.0 | 8.5 | 6.0 | 7.0 | 10.0 | 6.0 | The vocabulary level is appropriate for Higher Chinese, but the exam format is non-standard for PSLE (e.g., includes open-ended definition/sentence construction which is rare in standard MCQ/Cloze papers). The total marks in the quiz (70+15) do not match the answer key (50). The timeframe of 40 minutes is likely too short for the volume of writing and definition tasks required. |
| 7.7 | Legacy generator | Mathematics | Quiz | 5-1 | addition-subtraction | 10.0 | 6.0 | 7.0 | 9.0 | 10.0 | 8.0 | The content is far too simple for Primary 6 PSLE level; it focuses on basic large number arithmetic which is a P3/P4 topic. P6 should involve multi-step problems integrating ratio, percentage, or fractions. Also, the total marks in the header (50) contradicts the marking scheme total (55). |
| 7.7 | Legacy generator | Mathematics | Quiz | 5-1 | whole-numbers | 9.0 | 10.0 | 7.0 | 6.0 | 10.0 | 8.0 | The quiz is significantly too easy for P6 PSLE; it focuses on P3-P4 level place value and rounding rather than P6 level whole number applications. Major error in MCQ Q2 where no correct option is provided. The total marks in the marking scheme (55) do not match the header (50). Answer key for Q10 is vague and mathematically inconsistent. |
| 7.8 | Legacy generator | Chinese | Quiz | 5-1 | writing | 9.0 | 8.5 | 7.0 | 9.0 | - | 5.0 | The word count requirement (150-200 words) is significantly lower than the P6 PSLE standard, which typically requires 120-150 words for lower primary but much longer/more complex compositions for P6. The content is more aligned with P4/P5 level. Missing actual images for the picture composition; only text descriptions are provided. The marking scheme and model essays are high quality and helpful. |
| 7.8 | Legacy generator | English | Quiz | 5-1 | composition | 9.0 | 8.5 | 7.0 | - | 10.0 | 5.0 | The artifact uses text descriptions instead of actual images for the picture composition, which is a major flaw for a P6 student. The word counts for continuous writing (150-200 words) are significantly lower than the PSLE standard (approx 150-300 words). The total marks (50) and structure do not match the actual PSLE Paper 1 format. The difficulty is slightly low for P6 mastery level. |
| 7.8 | Legacy generator | Higher Chinese | Quiz | 3-0 | hanyu-pinyin | 9.0 | 7.0 | 6.0 | 8.0 | 10.0 | 9.0 | The content is far too simple for Primary 6 Higher Chinese; Pinyin fundamentals are typically mastered much earlier in the primary years. While the formatting is clean and the answer key is good, the level of cognitive demand does not match the PSLE Higher Chinese syllabus which focuses on critical analysis and sophisticated expression. |
| 7.9 | Legacy generator | Chinese | Quiz | 5-1 | reading | 9.0 | 8.5 | 7.0 | 9.5 | - | 8.0 | The content is too simple for P6 PSLE level; the texts and questions resemble P3/P4 level comprehension. The exam format lacks official PSLE structure (e.g., specific marks per question in the header, lack of time allocation, and incorrect total marks calculation: 50 vs 45). Answer key is excellent with clear explanations. |
| 7.9 | Legacy generator | Mathematics | Quiz | 5-1 | measurement | 10.0 | 9.0 | 7.0 | 9.0 | 8.0 | 8.0 | The quiz is significantly below P6 PSLE standard; it focuses on P3-P4 level conversions and basic perimeter/area rather than P6 topics like circles, volume of composite solids, or complex multi-step measurement problems. The total marks in the marking scheme (55) do not match the header (50). Missing diagrams for geometry questions. |
| 7.9 | Legacy generator | Mathematics | Quiz | 5-1 | multiplication-division | 9.0 | 8.0 | 7.0 | 8.0 | 10.0 | 9.0 | The quiz is significantly below P6 PSLE standard; it focuses on basic arithmetic rather than the complex multi-step word problems or ratio/percentage integration expected at this level. Major errors in the answer key: Q3 has no correct option (remainder is 5, not in A-D) and Q4 has two correct options (C and D). The total marks in the marking scheme (55) do not match the header (50). |
| 7.9 | Legacy generator | Mathematics | Quiz | 3-0 | problem-solving-heuristics | 9.5 | 9.0 | 7.0 | 8.5 | 5.0 | 9.0 | The quiz covers relevant heuristics. However, it lacks proper LaTeX for mathematical expressions (uses plain text instead). The answer key contains internal 'thinking' artifacts (e.g., 'Wait, let me recalculate') which should be cleaned. Question 8 requires a diagram to be clear. The exam format is slightly simplified compared to actual PSLE papers. |
Content Type Summary
Average scores grouped by content type.
Detailed Benchmark Rows
Topics, quiz variants, paper versions, cheatsheets, and parent guides listed individually.
| Model | Type | Stage | Subject | Topic / Paper | Overall | Missing Images | Language | Syllabus | Template | Clean | Step Answers | Notation | Paper Format | Difficulty | Time Fit | 3-Point Summary | Parent Guide | Difficulty | Comments |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Claude Sonnet 4 | Cheatsheet | 2-7 | Chinese | cheatsheet | 8.0 | No | 9.0 | 6.0 | - | 10.0 | - | 10.0 | - | 5.0 | - | 8.0 | - | uneven | The cheatsheet includes content that is not part of the Primary 6 PSLE Chinese syllabus, specifically 'Classical Chinese' (文言文) and 'Modern Poetry Appreciation' (现代诗歌鉴赏), which are typically secondary school topics. While the three-point summaries are useful, the inclusion of irrelevant academic topics makes the difficulty and syllabus fit uneven. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | English | cheatsheet | 9.6 | No | 9.5 | 10.0 | - | 10.0 | - | - | - | 9.0 | - | 9.5 | - | appropriate | Excellent cheatsheet. Highly aligned with the P6 English syllabus, covering all key components from Writing to Oral. Uses effective three-point summaries for each chapter rather than generic pointers. Language is appropriate for the level. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | Higher Chinese | cheatsheet | 9.3 | No | 9.5 | 9.0 | - | 10.0 | - | 10.0 | 8.5 | 9.0 | - | 9.0 | - | appropriate | High quality cheatsheet. Excellent use of tables and structured summaries for different sections (composition, reading, oral). Content aligns well with Higher Chinese requirements, including classical Chinese (文言文) and sophisticated rhetorical analysis. The exam structure overview is accurate for PSLE context. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | Mathematics | cheatsheet | 9.5 | Yes | 9.5 | 10.0 | - | 10.0 | - | 9.0 | - | 9.5 | - | 9.0 | - | appropriate | High quality cheatsheet. Excellent syllabus coverage including P6 specific topics like circle properties and algebra. Uses effective three-point summaries for most chapters. Notation is clear, though some mathematical symbols like cube roots could use formal LaTeX. Missing diagrams for geometry and model drawing sections. |
| Claude Sonnet 4 | Cheatsheet | 2-7 | Science | cheatsheet | 9.5 | Yes | 9.5 | 10.0 | - | 10.0 | - | - | - | 9.0 | - | 9.0 | - | appropriate | High quality cheatsheet. Excellent syllabus coverage including all P6 themes. Uses effective bulleted summaries rather than generic text. Note: As a cheatsheet, it lacks diagrams which are crucial for Science (e.g., energy conversion diagrams or food webs), but the text content is highly accurate for PSLE prep. |
| Legacy generator | Parents Guide | 2-9 | Chinese | parents-guide | 8.8 | No | 9.5 | 8.5 | - | 10.0 | - | - | 7.0 | 9.0 | - | - | 9.0 | appropriate | High quality guide. Syllabus adherence is strong regarding character counts and skills. Note: The exam structure section (time/marks) is slightly generic and does not perfectly mirror the exact current PSLE weightage/minutes, but serves the purpose of a parent guide well. |
| Legacy generator | Parents Guide | 2-9 | English | parents-guide | 9.4 | No | 9.5 | 9.0 | - | 10.0 | - | - | 8.5 | 10.0 | - | - | 9.5 | appropriate | High quality guide. The paper breakdown table is slightly inaccurate regarding the exact marks/sections of the current PSLE format (e.g., Section G/H/I naming conventions), but the pedagogical advice is excellent and syllabus-aligned. |
| Legacy generator | Parents Guide | 2-9 | Higher Chinese | parents-guide | 9.4 | No | 9.5 | 9.0 | - | 10.0 | - | - | - | 9.0 | - | - | 9.5 | appropriate | High quality guide. Language is professional and suitable for parents. Adheres well to Higher Chinese syllabus requirements (character counts, literary appreciation). Provides practical, actionable advice for home learning and exam preparation. |
| Legacy generator | Parents Guide | 2-9 | Mathematics | parents-guide | 10.0 | No | 10.0 | 10.0 | - | 10.0 | - | 10.0 | 10.0 | 10.0 | - | - | 10.0 | not applicable | Excellent parent guide. Highly structured, follows the MOE syllabus accurately, and provides practical, actionable advice for parents. Exam format and AL grading are correctly represented. |
| Legacy generator | Parents Guide | 2-9 | Science | parents-guide | 9.5 | No | 9.5 | 9.0 | - | 10.0 | - | - | 9.0 | 10.0 | - | - | 9.5 | appropriate | High quality guide. Language is professional yet accessible for parents. Excellent alignment with the P6 Science syllabus and PSLE format. Provides practical home support activities and a clear assessment timeline. |
| Legacy generator | Quiz | 5-1 | Chinese | listening | 6.2 | No | 6.0 | 5.0 | 4.0 | 10.0 | 9.0 | - | 4.0 | 3.0 | 9.0 | - | - | too easy | The content is significantly below P6 PSLE standards; the vocabulary and sentence structures are more aligned with P2 or P3 levels. The listening passages lack the complexity, inference requirements, and critical analysis expected in the P6 syllabus. Exam format is missing specific time allocations and standard PSLE instruction phrasing. |
| Legacy generator | Quiz | 5-1 | Chinese | reading | 7.9 | No | 9.0 | 8.5 | 7.0 | 10.0 | 9.5 | - | 6.0 | 5.0 | 8.0 | - | - | too easy | The content is too simple for P6 PSLE level; the texts and questions resemble P3/P4 level comprehension. The exam format lacks official PSLE structure (e.g., specific marks per question in the header, lack of time allocation, and incorrect total marks calculation: 50 vs 45). Answer key is excellent with clear explanations. |
| Legacy generator | Quiz | 5-1 | Chinese | speaking | 9.0 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.0 | 9.0 | 9.0 | - | - | appropriate | The content is highly relevant to PSLE Oral (Reading and Stimulus-based Conversation). Language is appropriate for P6. Major issue: The 'Picture Description' section relies entirely on text descriptions of images rather than actual images, which is a critical failure for an oral exam artifact. The answer key provides excellent model responses and scoring rubrics. |
| Legacy generator | Quiz | 5-1 | Chinese | vocabulary | 6.9 | No | 8.0 | 7.0 | 5.0 | 10.0 | 9.0 | - | 4.0 | 4.0 | 8.0 | - | - | too easy | The content is significantly below P6 PSLE standards; it resembles P3/P4 level vocabulary. The exam format is inconsistent, specifically the total marks calculation (50 vs 74) is broken. Question types like 'fill in the blanks' for basic classifiers and simple sentence structures are too elementary for a P6 candidate. |
| Legacy generator | Quiz | 5-1 | Chinese | writing | 7.8 | Yes | 9.0 | 8.5 | 7.0 | 10.0 | 9.0 | - | 7.5 | 6.0 | 5.0 | - | - | too easy | The word count requirement (150-200 words) is significantly lower than the P6 PSLE standard, which typically requires 120-150 words for lower primary but much longer/more complex compositions for P6. The content is more aligned with P4/P5 level. Missing actual images for the picture composition; only text descriptions are provided. The marking scheme and model essays are high quality and helpful. |
| Legacy generator | Quiz | 5-1 | English | composition | 7.8 | Yes | 9.0 | 8.5 | 7.0 | 10.0 | - | 10.0 | 6.0 | 7.0 | 5.0 | - | - | too easy | The artifact uses text descriptions instead of actual images for the picture composition, which is a major flaw for a P6 student. The word counts for continuous writing (150-200 words) are significantly lower than the PSLE standard (approx 150-300 words). The total marks (50) and structure do not match the actual PSLE Paper 1 format. The difficulty is slightly low for P6 mastery level. |
| Legacy generator | Quiz | 5-1 | English | comprehension | 8.3 | No | 9.0 | 8.5 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 7.5 | 8.0 | - | - | appropriate | Language and content are well-suited for P6. The marking scheme is excellent with clear breakdown. However, the exam format is slightly off: the total marks in the header (50) do not match the sum of the marking scheme (41), and the structure (Section A-E) deviates from the standard PSLE Paper 2 layout which typically separates Visual Text, Comprehension Cloze, and Comprehension Open-Ended. |
| Legacy generator | Quiz | 5-1 | English | grammar | 9.3 | No | 9.5 | 10.0 | 8.5 | 10.0 | 10.0 | 10.0 | 7.5 | 9.0 | 9.0 | - | - | appropriate | High quality grammar quiz. Content aligns well with P6 PSLE requirements (subject-verb agreement, reported speech, etc.). The marking scheme total (45) does not match the header total (50), which is a minor inconsistency. Question templates are good but slightly deviate from the exact PSLE Paper 2 Booklet A/B structure. |
| Legacy generator | Quiz | 5-1 | English | oral | 9.6 | Yes | 10.0 | 10.0 | 9.0 | 10.0 | 9.0 | - | 9.0 | 10.0 | 10.0 | - | - | appropriate | High quality oral practice. Content aligns perfectly with PSLE Stimulus-Based Conversation and Reading Aloud formats. Note: The stimulus image is described in text but the actual image is missing, which is expected for text-only generation but requires a placeholder in a real artifact. |
| Legacy generator | Quiz | 5-1 | English | vocabulary | 8.6 | No | 9.5 | 9.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.0 | 9.0 | - | - | appropriate | Vocabulary level is appropriate for P6. Question types (synonyms, antonyms, word formation) align well with PSLE prep. However, the exam paper format is slightly off: it lacks specific instructions for each section, does not include a total time duration, and the marks/weightage distribution is a bit artificial compared to actual Paper 2 booklets. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | composition | 5.9 | No | 9.0 | 7.0 | 4.0 | 10.0 | 8.0 | - | 3.0 | 4.0 | 2.0 | - | - | too easy | The artifact is a theory quiz about composition rather than a PSLE-style composition practice. PSLE Higher Chinese composition requires writing a full 1400-1500 character essay based on a prompt/picture, not answering multiple-choice or short-answer questions about literary devices. The metadata claims 20 questions in 35 minutes, but only 10 are provided. The difficulty is too low for P6 Higher Chinese as it focuses on basic definitions rather than high-level application. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | composition | 4.4 | No | 6.0 | 3.0 | 2.0 | 10.0 | 0.0 | 10.0 | 4.0 | 2.0 | 3.0 | - | - | too easy | Major hallucination/mismatch: The quiz is titled 'Argumentative Essay' (议论文) but the answer key provides answers for 'Narrative Essay' (记叙文). The content is also fundamentally incorrect for PSLE Higher Chinese; PSLE composition is a long-form creative writing task, not a multiple-choice or short-answer theory quiz on essay elements. The difficulty is far too low for P6 Higher Chinese. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | composition | 7.2 | No | 9.0 | 8.5 | 7.0 | 10.0 | 5.0 | - | 7.5 | 6.0 | 5.0 | - | - | uneven | The artifact is highly inconsistent. The main quiz asks for two long compositions (350-400 words each) plus a skills section, which is impossible to complete in 60 minutes. Furthermore, the provided answer key does not match the quiz content; the quiz asks for compositions, but the answer key provides answers to a completely different set of multiple-choice and short-answer questions. The difficulty of the writing prompts is appropriate for Higher Chinese, but the structure is logically flawed. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | composition | 6.9 | No | 9.0 | 8.0 | 7.0 | 10.0 | 2.0 | 10.0 | 7.0 | 5.0 | 4.0 | - | - | uneven | Major misalignment between the quiz content and the answer key. The quiz is an argumentative writing (议论文) practice, but the answer key provides answers for a narrative writing (记叙文) quiz. The word count requirement (300-350 words) is too low for Higher Chinese P6 standards, which expect 1400-1500 characters for full compositions. The timeframe is unrealistic for two full essays plus analysis tasks. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | comprehension | 7.2 | No | 8.5 | 7.0 | 6.0 | 10.0 | 9.0 | 10.0 | 5.0 | 4.0 | 5.0 | - | - | too easy | The text is too simplistic for Higher Chinese P6; it reads like a P3/P4 level primary text. The question count (20) contradicts the actual number of questions provided (7). The exam format lacks standard PSLE components like specific mark allocations per question and proper instruction headers. Timing is unrealistic for the depth of questions vs the text length. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | comprehension | 5.5 | No | 8.0 | 7.0 | 4.0 | 10.0 | 2.0 | - | 5.0 | 3.0 | 5.0 | - | - | uneven | Critical failure: The answer key is completely hallucinated and does not correspond to the provided text. The quiz uses a Classical Chinese text (Wenyanwen) which is appropriate for Higher Chinese, but the answer key discusses a text about 'Springtime'. The difficulty is uneven because the content and answers are disconnected. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | comprehension | 8.2 | No | 9.0 | 8.5 | 7.0 | 10.0 | 8.0 | 10.0 | 6.0 | 8.0 | 7.0 | - | - | appropriate | Language and literary depth are well-suited for Higher Chinese. However, the exam format is inconsistent: the main quiz lists a total of 90 marks, but the provided answer key only accounts for 50 marks. The question structure deviates from standard PSLE paper formats which usually separate comprehension into specific sections with distinct mark allocations. Timeframe is slightly tight for the volume of literary analysis requested. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | comprehension | 6.8 | No | 8.0 | 7.0 | 5.0 | 10.0 | 9.0 | - | 6.0 | 4.0 | 5.0 | - | - | too hard | The quiz is excessively difficult for P6 Higher Chinese; it includes advanced classical Chinese grammar (sentence type identification) and creative writing in classical style which exceeds PSLE standards. The answer key provided is completely mismatched, referring to a text about 'Spring' instead of the '弈秋' text used in the quiz. The paper format is also inconsistent with PSLE weighting and structure. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | grammar | 6.9 | No | 7.0 | 6.0 | 5.0 | 10.0 | 8.0 | - | 6.0 | 4.0 | 9.0 | - | - | too easy | The content is too basic for Higher Chinese P6; it reads more like a P4/P5 primary Chinese grammar worksheet. PSLE Higher Chinese grammar usually focuses on more nuanced vocabulary usage, idioms, and complex sentence structures rather than simple sentence transformations like 'changing affirmative to negative'. The exam format lacks the standard PSLE paper structure (e.g., Section A: Vocabulary, Section B: Grammar/Cloze). The answer key is clear but the questions lack the depth required for the Higher Chinese syllabus. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | grammar | 5.4 | No | 8.0 | 5.0 | 4.0 | 10.0 | 2.0 | - | 5.0 | 3.0 | 6.0 | - | - | too easy | Major misalignment: The quiz content (sentence component analysis, rhetorical devices, formal vs informal) is more aligned with Mainland China middle school grammar than Singapore PSLE Higher Chinese. The answer key is completely disconnected from the quiz questions (e.g., quiz asks for sentence analysis, answer key provides sentence type identification). Difficulty is too low for P6 Higher Chinese; it lacks the depth of comprehension and nuanced language use required for PSLE. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | grammar | 7.3 | No | 8.5 | 7.0 | 6.0 | 10.0 | 9.0 | - | 5.0 | 6.0 | 7.0 | - | - | uneven | The content is more aligned with secondary school grammar (sentence component analysis, rhetorical devices) than the standard PSLE Higher Chinese format, which focuses more on comprehension and vocabulary in context. The exam format is inconsistent: the quiz header claims 75 marks, but the answer key claims 50 marks. The difficulty is uneven, mixing very basic sentence types with advanced linguistic analysis. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | grammar | 5.1 | No | 4.0 | 3.0 | 5.0 | 10.0 | 7.0 | - | 6.0 | 2.0 | 4.0 | - | - | too hard | The content is highly inappropriate for P6 Higher Chinese. It includes advanced Classical Chinese (Wenyanwen) grammar, tonal analysis (Pingze), and rhyming composition, which are secondary school topics, not PSLE. Additionally, the provided answer key does not match the generated quiz content at all, making it useless for the artifact. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | hanyu-pinyin | 8.1 | No | 9.0 | 8.0 | 6.0 | 10.0 | 10.0 | 10.0 | 7.0 | 4.0 | 9.0 | - | - | too easy | The content is far too simple for Primary 6 Higher Chinese; Hanyu Pinyin is a foundational skill usually mastered much earlier. The quiz lacks the complexity expected at the PSLE level. Exam format is decent but lacks the formal structure of a standard MOE paper. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | hanyu-pinyin | 5.9 | No | 8.0 | 6.0 | 4.0 | 10.0 | 0.0 | 10.0 | 5.0 | 3.0 | 7.0 | - | - | uneven | Major failure in answer key alignment: the answers provided do not match the questions asked (e.g., Part 1 asks for pinyin of specific words like 辉煌, but the key provides pinyin for different words like 雄伟). The difficulty is uneven; Part 1 uses high-level vocabulary, but Part 2 and 3 are basic phonetic rules. The exam format lacks standard PSLE sectioning and marks distribution logic. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | hanyu-pinyin | 7.8 | No | 9.0 | 7.0 | 6.0 | 10.0 | 8.0 | 10.0 | 7.0 | 4.0 | 9.0 | - | - | too easy | The content is far too simple for Primary 6 Higher Chinese; Pinyin fundamentals are typically mastered much earlier in the primary years. While the formatting is clean and the answer key is good, the level of cognitive demand does not match the PSLE Higher Chinese syllabus which focuses on critical analysis and sophisticated expression. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | hanyu-pinyin | 8.2 | No | 9.0 | 8.5 | 7.0 | 10.0 | 6.0 | 10.0 | 8.0 | 7.5 | 8.0 | - | - | appropriate | The quiz content is high quality and uses sophisticated vocabulary suitable for Higher Chinese. However, there is a major discrepancy between the main quiz and the provided answer key; the answer key contains entirely different questions and marks than the quiz itself. The quiz format is generally good, but the answer key lacks the step-by-step explanation requested for complex rules. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | listening | 7.2 | No | 8.5 | 7.0 | 6.0 | 10.0 | 9.0 | - | 5.0 | 4.0 | 8.0 | - | - | too easy | The content is too simple for Higher Chinese (HCL) P6; it reads more like Standard Chinese (CL) level. HCL requires critical analysis and sophisticated inference, whereas these questions only test literal comprehension. The exam format is missing total marks (total is 39, but header says 50) and lacks formal PSLE-style instructions/timing. The answer key is well-structured with explanations. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | reading | 8.1 | No | 8.5 | 9.0 | 7.5 | 10.0 | 9.0 | - | 7.0 | 6.0 | 8.0 | - | - | too easy | The content is too simple for Higher Chinese P6; the texts and questions resemble Standard Chinese or even P4/P5 level. The 'Classical Chinese' section is extremely basic. Marks total 48 instead of the stated 50. Question templates lack the rigor of actual PSLE Higher Chinese comprehension papers. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | speaking | 9.0 | Yes | 9.5 | 9.0 | 8.5 | 10.0 | 9.0 | - | 8.0 | 9.0 | 9.0 | - | - | appropriate | High quality content for Higher Chinese. The language and depth of discussion questions are well-aligned with P6 expectations. Major issue: The 'Picture Description' section uses text descriptions instead of actual images, making it impossible to conduct a real oral exam. The answer key provides excellent model responses. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | vocabulary | 7.5 | No | 9.0 | 8.5 | 5.0 | 10.0 | 7.0 | 10.0 | 4.0 | 6.0 | 8.0 | - | - | uneven | The vocabulary level is appropriate for Higher Chinese, but the question format deviates significantly from PSLE standards. PSLE vocabulary questions are typically multiple-choice (MCQ) or fill-in-the-blanks within a passage, rather than open-ended 'define and make a sentence' tasks. Section 3 (Synonyms) is too simplistic for P6 Higher Chinese. The difficulty is uneven: Section 1 is high-level, while Section 3 is very basic. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | vocabulary | 4.9 | No | 6.0 | 4.0 | 2.0 | 10.0 | 0.0 | 10.0 | 3.0 | 4.0 | 5.0 | - | - | uneven | Major hallucination/mismatch: The question paper content (Traditional Culture, Idioms, Poetry) does not match the answer key (Vocabulary, Synonyms, Multiple Choice). The question format is essay-style/open-ended, which is not typical for PSLE Higher Chinese vocabulary sections. The difficulty is uneven; poetry analysis is too hard for a simple vocabulary quiz, while the answer key's synonym section is too easy. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | vocabulary | 7.6 | No | 9.0 | 8.5 | 6.0 | 10.0 | 7.0 | 10.0 | 5.0 | 7.0 | 6.0 | - | - | uneven | The vocabulary level is appropriate for Higher Chinese, but the exam format is non-standard for PSLE (e.g., includes open-ended definition/sentence construction which is rare in standard MCQ/Cloze papers). The total marks in the quiz (70+15) do not match the answer key (50). The timeframe of 40 minutes is likely too short for the volume of writing and definition tasks required. |
| Legacy generator | Quiz | 3-0 | Higher Chinese | vocabulary | 6.0 | No | 6.0 | 5.0 | 4.0 | 10.0 | 7.0 | 10.0 | 5.0 | 3.0 | 4.0 | - | - | too hard | The content is far too academic and philosophical for P6 level; it reads like a secondary school or university entrance exam for Chinese literature. The question types (explaining philosophical concepts like 'Tian Ren He Yi') are not aligned with PSLE Higher Chinese formats which focus on comprehension and language use. The total marks and time allocation are inconsistent between the quiz and the answer key provided. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | vocabulary | 8.1 | No | 9.0 | 8.5 | 7.0 | 9.0 | 9.0 | - | 6.0 | 8.0 | 8.0 | - | - | appropriate | Vocabulary level is appropriate for Higher Chinese. Major issue: The total marks in the header (50) do not match the calculated total in the answer key (65), and the scoring table contains a manual correction note that should have been resolved in the artifact itself. Question format is a mix of styles rather than a strict PSLE paper simulation. |
| Legacy generator | Quiz | 5-1 | Higher Chinese | writing | 7.3 | Yes | 9.0 | 8.5 | 7.0 | 10.0 | 9.0 | - | 6.0 | 5.0 | 4.0 | - | - | too hard | The artifact is problematic for a single quiz. It asks for two separate compositions (one visual, one topical) totaling 50 marks, which is impossible to complete within a standard exam timeframe. The word count requirements (200-250 words per essay) are actually too low for Higher Chinese P6, which typically requires much longer, more sophisticated compositions (1400-1500 characters for the whole paper). The 'images' are provided only as text descriptions, which is a major missing-image issue for a visual composition task. The answer key is excellent, providing high-quality model essays and analysis. |
| Legacy generator | Quiz | 5-1 | Mathematics | addition-subtraction | 7.7 | No | 10.0 | 6.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 3.0 | 8.0 | - | - | too easy | The content is far too simple for Primary 6 PSLE level; it focuses on basic large number arithmetic which is a P3/P4 topic. P6 should involve multi-step problems integrating ratio, percentage, or fractions. Also, the total marks in the header (50) contradicts the marking scheme total (55). |
| Legacy generator | Quiz | 3-0 | Mathematics | angles-geometry | 6.0 | Yes | 9.0 | 8.0 | 6.0 | 4.0 | 3.0 | 5.0 | 7.0 | 4.0 | 8.0 | - | - | uneven | Major issues: 1. Multiple questions (12, 13, 15) require diagrams that are missing. 2. The answer key contains 'thinking aloud' artifacts where the model realizes its own errors and provides incorrect/unstable solutions. 3. Question 14 has a mathematical mismatch between the question and the provided answer. 4. Notation is inconsistent; some geometry terms are used without proper LaTeX. 5. Difficulty is uneven due to the broken logic in the answer key. |
| Legacy generator | Quiz | 3-0 | Mathematics | angles-geometry | 4.3 | Yes | 9.0 | 4.0 | 5.0 | 3.0 | 1.0 | 5.0 | 6.0 | 2.0 | 4.0 | - | - | too hard | Critical failure: The answer key is a hallucinated mess. The answers provided do not correspond to the questions asked (e.g., MCQ 1 answer is b, but the correct answer is c). The model's internal monologue/corrections are included in the output, making it unusable. Geometry questions are impossible to solve without the missing diagrams. Content includes advanced polygon properties (interior/exterior angles of n-gons) not typically required at P6 level. The difficulty is uneven and the logic is broken. |
| Legacy generator | Quiz | 3-0 | Mathematics | area-perimeter | 5.7 | Yes | 9.0 | 9.0 | 5.0 | 7.0 | 2.0 | 4.0 | 6.0 | 4.0 | 5.0 | - | - | uneven | Critical failure: The answer key does not match the quiz questions. The quiz has 16 questions (10 MCQ, 4 Short, 2 Problem Solving), but the answer key provides answers for a completely different set of 10 questions. The answer key also contains internal monologue/thinking process and incorrect calculations. Difficulty is uneven as MCQ is very basic while Section B/C requires higher-order thinking. Missing diagrams for geometry questions. |
| Legacy generator | Quiz | 3-0 | Mathematics | data-analysis | 9.0 | Yes | 10.0 | 10.0 | 7.0 | 10.0 | 10.0 | 10.0 | 8.0 | 6.0 | 10.0 | - | - | too easy | The quiz is too easy for P6 PSLE level; it focuses on basic definitions and simple calculations rather than the complex, multi-step word problems typical of the exam. Missing visual representations for the bar chart and pie chart mentioned in the text. Answer key is excellent and provides clear steps. |
| Legacy generator | Quiz | 3-0 | Mathematics | data-analysis | 7.2 | Yes | 10.0 | 10.0 | 6.0 | 10.0 | 0.0 | 10.0 | 7.0 | 4.0 | 8.0 | - | - | uneven | Critical failure: The answer key is completely disconnected from the questions. The questions ask about ages, spinners, and football goals, but the answers discuss pets, books, and temperature. Additionally, Section A questions require visual aids (graphs/pie charts) that are missing, making them impossible to solve as written. |
| Legacy generator | Quiz | 5-1 | Mathematics | data-analysis | 8.7 | Yes | 10.0 | 10.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.0 | 8.0 | - | - | appropriate | Content is mathematically sound and syllabus-aligned. However, several questions (pie charts, bar graphs) rely on visual data that is described in text but lacks the actual diagrams/images required for a real exam experience. The total marks in the header (50) do not match the marking scheme total (55). Section D is slightly more challenging, providing good progression. |
| Legacy generator | Quiz | 5-1 | Mathematics | fractions | 7.4 | No | 10.0 | 10.0 | 6.0 | 10.0 | 9.0 | 2.0 | 5.0 | 7.0 | 8.0 | - | - | appropriate | The quiz content is syllabus-aligned and the difficulty progression is good. However, it fails significantly on technical formatting: it uses plain text slashes for fractions instead of LaTeX, which is unacceptable for P6 level. The exam format is also inconsistent; the total marks in the header (50) does not match the marking scheme total (55). Question 4 has a weird 'Both B and C' option which is rare in PSLE. |
| Legacy generator | Quiz | 5-1 | Mathematics | geometry | 6.1 | Yes | 9.0 | 7.0 | 5.0 | 8.0 | 9.0 | 2.0 | 4.0 | 4.0 | 7.0 | - | - | too easy | The quiz is significantly below PSLE standard; it focuses on basic properties rather than the complex composite figure and circle geometry required by the P6 syllabus. Question 14 uses a formula (n-2)*180 not taught in P6. Lack of LaTeX for math notation and reliance on ASCII art for geometry is poor. Total marks in marking scheme (55) do not match the header (50). |
| Legacy generator | Quiz | 5-1 | Mathematics | measurement | 7.9 | Yes | 10.0 | 9.0 | 7.0 | 10.0 | 9.0 | 8.0 | 6.0 | 4.0 | 8.0 | - | - | too easy | The quiz is significantly below P6 PSLE standard; it focuses on P3-P4 level conversions and basic perimeter/area rather than P6 topics like circles, volume of composite solids, or complex multi-step measurement problems. The total marks in the marking scheme (55) do not match the header (50). Missing diagrams for geometry questions. |
| Legacy generator | Quiz | 5-1 | Mathematics | money | 9.2 | No | 10.0 | 10.0 | 8.0 | 10.0 | 10.0 | 10.0 | 7.0 | 8.5 | 9.0 | - | - | appropriate | Good coverage of money topics including percentage and GST. Note: The total marks in the marking scheme (55) contradicts the header (50). Question 4 requires rounding which is slightly ambiguous for P6 without specific instruction. Section A/B/C/D structure is good but lacks the standard PSLE Paper 1/Paper 2 distinction. |
| Legacy generator | Quiz | 5-1 | Mathematics | multiplication-division | 7.9 | No | 9.0 | 8.0 | 7.0 | 10.0 | 8.0 | 10.0 | 6.0 | 4.0 | 9.0 | - | - | too easy | The quiz is significantly below P6 PSLE standard; it focuses on basic arithmetic rather than the complex multi-step word problems or ratio/percentage integration expected at this level. Major errors in the answer key: Q3 has no correct option (remainder is 5, not in A-D) and Q4 has two correct options (C and D). The total marks in the marking scheme (55) do not match the header (50). |
| Legacy generator | Quiz | 3-0 | Mathematics | problem-solving-heuristics | 7.9 | Yes | 9.5 | 9.0 | 7.0 | 10.0 | 8.5 | 5.0 | 6.0 | 7.0 | 9.0 | - | - | appropriate | The quiz covers relevant heuristics. However, it lacks proper LaTeX for mathematical expressions (uses plain text instead). The answer key contains internal 'thinking' artifacts (e.g., 'Wait, let me recalculate') which should be cleaned. Question 8 requires a diagram to be clear. The exam format is slightly simplified compared to actual PSLE papers. |
| Legacy generator | Quiz | 3-0 | Mathematics | problem-solving-heuristics | 4.4 | Yes | 9.0 | 6.0 | 5.0 | 4.0 | 1.0 | 0.0 | 7.0 | 3.0 | 5.0 | - | - | uneven | Critical failure: The answer key is completely hallucinated and does not correspond to the questions provided. For example, Answer 1 refers to stickers and a different math problem, while Question 1 is about Lisa's savings. Answer 2 is a geometric/algebraic solution for a problem not in the quiz. The quiz itself is too easy for P6 PSLE level, and the answer key contains internal contradictions and incorrect logic. No LaTeX used for math notation. |
| Legacy generator | Quiz | 3-0 | Mathematics | psle-revision | 8.2 | Yes | 10.0 | 10.0 | 7.0 | 10.0 | 10.0 | 5.0 | 8.0 | 4.0 | 10.0 | - | - | too easy | The quiz is significantly easier than actual PSLE standard; most questions are direct one-step calculations. Question 12 and 15 require diagrams for standard exam presentation. Notation uses plain text fractions instead of proper LaTeX. Marks distribution is slightly inconsistent with PSLE weighting. |
| Legacy generator | Quiz | 3-0 | Mathematics | psle-revision | 5.9 | No | 10.0 | 9.0 | 8.0 | 10.0 | 0.0 | 2.0 | 7.0 | 2.0 | 5.0 | - | - | uneven | Critical failure: The answer key is completely hallucinated and does not match the questions provided. For example, Q1 asks for a percentage but the answer key provides a mixed fraction; Q2 asks for triangle area but the answer key provides a different calculation entirely. Notation is poor, using plain text for fractions instead of LaTeX. Difficulty is uneven as questions are basic but the answer key is nonsensical. |
| Legacy generator | Quiz | 3-0 | Mathematics | speed-distance-time | 7.3 | No | 10.0 | 10.0 | 7.0 | 10.0 | 2.0 | 10.0 | 8.0 | 4.0 | 5.0 | - | - | uneven | Critical failure: The answer key does not correspond to the generated quiz. The quiz has 16 questions (10 MCQ, 4 Short, 2 Problem Solving), but the answer key provides solutions for a completely different set of 10 questions. Additionally, the answer key contains internal monologue/corrections and incorrect math in Question 7. Difficulty is uneven; Section A is too basic for P6, while Section C is appropriate. |
| Legacy generator | Quiz | 3-0 | Mathematics | volume | 6.4 | Yes | 9.0 | 4.0 | 7.0 | 8.0 | 5.0 | 6.0 | 8.0 | 3.0 | 8.0 | - | - | too hard | Major syllabus violation: P6 Singapore syllabus only covers volume of cubes and cuboids; questions on cylinders, cones, spheres, and pyramids are not in the P6 Standard syllabus. The answer key contains internal monologue/recalculations which is unprofessional. Difficulty is uneven due to out-of-syllabus content. |
| Legacy generator | Quiz | 3-0 | Mathematics | volume | 4.6 | Yes | 9.0 | 2.0 | 5.0 | 7.0 | 2.0 | 4.0 | 6.0 | 1.0 | 5.0 | - | - | too hard | Major failure in syllabus adherence: includes cylinders, cones, spheres, and pyramids which are not in the P6 MOE syllabus. The answer key is completely hallucinated and disconnected from the questions (e.g., Q1 answer refers to a different question's values). The difficulty is inappropriate as it introduces secondary school geometry. Answer key contains internal monologue and incorrect calculations. |
| Legacy generator | Quiz | 5-1 | Mathematics | whole-numbers | 7.7 | No | 9.0 | 10.0 | 7.0 | 10.0 | 6.0 | 10.0 | 5.0 | 4.0 | 8.0 | - | - | too easy | The quiz is significantly too easy for P6 PSLE; it focuses on P3-P4 level place value and rounding rather than P6 level whole number applications. Major error in MCQ Q2 where no correct option is provided. The total marks in the marking scheme (55) do not match the header (50). Answer key for Q10 is vague and mathematically inconsistent. |
| Legacy generator | Quiz | 5-1 | Science | diversity | 8.3 | Yes | 9.5 | 9.0 | 7.0 | 10.0 | 8.5 | 10.0 | 6.0 | 8.0 | 7.0 | - | - | appropriate | Content is syllabus-accurate for P6 Diversity. However, the exam format is flawed: the total marks in the header (50) do not match the marking scheme total (46), and the time limit is missing. Several questions (e.g., Q11, Q13) rely on diagrams or tables that are text-based rather than visual, which is atypical for PSLE Science. Answer key is high quality with good explanations. |
| Legacy generator | Quiz | 5-1 | Science | heat | 9.0 | Yes | 9.5 | 10.0 | 8.5 | 9.0 | 9.5 | 10.0 | 7.5 | 9.0 | 8.0 | - | - | appropriate | Content is high quality and syllabus-aligned. Uses ASCII art for diagrams which is a placeholder for real images. Exam format lacks specific time allocation (e.g., 1 hour) and total marks in the header (header says 50, marking scheme says 47). Questions are well-structured for PSLE prep. |
| Legacy generator | Quiz | 5-1 | Science | life-cycles | 8.6 | Yes | 9.5 | 9.0 | 7.0 | 10.0 | 9.0 | 10.0 | 6.0 | 8.5 | 8.0 | - | - | appropriate | The quiz content is high quality and syllabus-aligned. However, it relies heavily on diagrams (mosquito life cycle, flower structure) which are currently represented by text-based ASCII/placeholders rather than actual images. The exam format lacks standard PSLE components like specific time duration (e.g., 1 hour 30 mins) and the total marks in the header (50) does not match the marking scheme total (47). |
| Legacy generator | Quiz | 5-1 | Science | light | 8.6 | Yes | 9.5 | 10.0 | 7.0 | 9.0 | 9.0 | 10.0 | 6.0 | 8.5 | 8.0 | - | - | appropriate | Content is high quality and syllabus-aligned. However, the exam format is flawed: the total marks in the header (50) do not match the marking scheme total (47), and the section-based mark distribution is inconsistent with standard PSLE Science papers (which usually consist of Section A: MCQ and Section B: Open-Ended). ASCII diagrams are used as placeholders for missing actual diagrams. |
| Legacy generator | Quiz | 5-1 | Science | magnets | 9.0 | Yes | 9.5 | 10.0 | 8.5 | 9.0 | 9.5 | 10.0 | 7.5 | 9.0 | 8.0 | - | - | appropriate | Content is high quality and syllabus-aligned. Major issue: relies heavily on ASCII diagrams (magnetic fields, magnet orientation) which are poor substitutes for actual scientific diagrams required in PSLE. Total marks in key (47) does not match header (50). Section A/B marks are slightly high for the content depth. |
| Legacy generator | Quiz | 5-1 | Science | materials | 8.2 | Yes | 9.5 | 9.0 | 6.0 | 10.0 | 9.0 | 10.0 | 5.0 | 7.0 | 8.0 | - | - | appropriate | Content is scientifically accurate and level-appropriate. However, it fails to follow the PSLE Science paper format: Section A should be MCQ only (no True/False), and Section B should be open-ended (no Section C/D/E division). The total marks in the key (47) do not match the header (50). Several questions (e.g., Q14, Q15) imply diagrams or visual objects that are missing. |
| Legacy generator | Quiz | 5-1 | Science | systems | 8.5 | Yes | 9.5 | 10.0 | 7.0 | 10.0 | 9.0 | - | 6.0 | 8.5 | 8.0 | - | - | appropriate | Content is high quality and syllabus-aligned. However, it relies heavily on ASCII diagrams which are poor substitutes for actual biological diagrams required in PSLE. The total marks in the marking scheme (47) do not match the header (50). Exam format lacks specific instructions for Section A (e.g., 'Choose the most suitable answer'). |
Criteria
Scores use 10.0 as best fit. Missing images are tracked as a yes/no flag.
Language suitability
Syllabus adherence
Past-paper template adherence
No weird artefacts/symbols
Step-by-step answers
Latex/notation format
Exam paper format
Difficulty appropriateness
Doable within timeframe
Cheatsheet 3-point summaries
Parent guide syllabus fit