ADS Online Problem-Solving Exam Skill Bundle Spec

Objective

Create a Codex/agent skill bundle that helps solve ADS/ADS2 online exam questions using the user’s local course materials as the source of truth. The bundle should make an AI agent faster and more reliable at recognizing question types, selecting the correct statistical method, writing correct R code, interpreting output, and formatting final exam answers.

The bundle is not intended to copy entire course documents into prompts. It should distill reusable workflows, decision rules, code patterns, interpretation templates, and common exam traps from the provided materials.

Source Material

Primary folders:

Important observed file groups:

Non-ADS files in the second folder, such as IFBS, GP2, and DST2 PDFs, should be ignored unless they are explicitly referenced by ADS files.

Second-pass review sources added on 2026-05-20:

Extraction notes from the second review pass:

Review Findings: Actual Common Exam Tasks

The skill bundle should prioritize the following task families because they recur in past papers, mock papers, ICA work, or course practicals.

1. Exam Report Workflow

Every coding challenge expects a knitted RMarkdown-style response, not just a numeric answer. The skill must force this answer shape:

  1. import and inspect data;
  2. clean or reshape only as needed;
  3. make a useful plot;
  4. state the method and why it fits;
  5. state null and alternative hypotheses;
  6. check assumptions or justify non-parametric/simulation alternative;
  7. run the test/model;
  8. interpret the p-value/effect/estimate in context;
  9. make a practical recommendation and mention limitations.

This workflow is as important as choosing the right test.

2. Data Cleaning and Reshaping

Common exam expectation:

Required recipes:

3. Probability, Bayes, and Conditional Reasoning

Observed tasks:

Skill requirements:

4. Power and Sample Size

Observed tasks:

Mathematical points:

5. t-Tests and Wilcoxon Tests

Observed tasks:

Decision rules:

6. ANOVA and Factorial Designs

Observed tasks:

Decision rules:

7. Categorical Data and Chi-Square/Fisher Tests

Observed tasks:

Decision rules:

8. Correlation and Regression

Observed tasks:

Decision rules:

9. Clustering and Feature Extraction

Observed tasks:

Skill scope:

10. Bootstrapping and Simulation

Observed tasks:

Skill requirements:

11. ICA/Open Analysis

Observed tasks:

Skill requirements:

Review Findings: Corrections Needed in the User Template

The user-provided notes are valuable and should be used, but the skill should correct these points before turning them into reusable exam recipes:

Constraints

Proposed Output

Create one self-contained skill bundle:

Optional, only if useful after extraction:

The skill should trigger on phrases such as:

Skill Design

SKILL.md

Purpose:

Core workflow:

  1. Start from the exam report workflow, not the statistical test.
  2. Import, inspect, clean, and reshape data only as needed.
  3. Classify the question type.
  4. Identify variables, response/explanatory roles, measurement scale, sample structure, and independence assumptions.
  5. Select the method using ads2-method-map.md.
  6. Generate or inspect R code using r-exam-recipes.md.
  7. Check assumptions and edge cases.
  8. Interpret results in exam language using interpretation-templates.md.
  9. Cross-check against common-traps.md.
  10. Produce a concise final answer with method, code/output summary, conclusion, assumptions, limitations, and next-step recommendation.

ads2-method-map.md

Expected content:

r-exam-recipes.md

Expected content:

interpretation-templates.md

Expected content:

common-traps.md

Expected content:

source-index.md

Expected content:

This is important so the skill remains auditable and can be updated later.

Implementation Plan

Phase 1: Source Inventory

Actions:

  1. Build a complete file inventory for both source folders. Completed in second-pass review.
  2. Filter to ADS-relevant files. Completed in second-pass review.
  3. Categorize files into exam papers, practicals, slides, datasets, and unrelated material. Completed in second-pass review.
  4. Create an initial source-index.md during implementation from the reviewed source list.

Acceptance criteria:

Phase 2: Text Extraction

Actions:

  1. Extract PDF text with pdftotext into a temporary workspace under /private/tmp. Completed for review under /private/tmp/ads2_skill_review.
  2. Extract R and RMarkdown source directly. Completed for review under /private/tmp/ads2_skill_review/r_text.
  3. Extract PPTX text only for slides that fill gaps not covered by PDFs/Rmd. Completed for readable slides under /private/tmp/ads2_skill_review/pptx_text.
  4. Summarize dataset columns using local commands or R, without copying full datasets. Partially completed for key exam datasets; finish during source-index creation.

Acceptance criteria:

Phase 3: Topic Distillation

Actions:

  1. Identify repeated exam question patterns from mock/past papers and Rmd solutions. Completed in this SPEC’s review findings.
  2. Map each pattern to:
  3. Compare practical/lecture content with exam files to fill missing theory and code details. Completed at the planning level; implementation should convert this into reference files.
  4. Convert the user’s provided notes into polished, corrected exam recipes.

Acceptance criteria:

Phase 4: Skill Bundle Draft

Actions:

  1. Create .codex/skills/ads2-online-exam-solver/SKILL.md.
  2. Create reference files listed in the proposed output.
  3. Keep SKILL.md compact and move detailed reference material into references/.
  4. Add optional scripts only if repeated data-audit or model-checking code is clearly useful.
  5. Include bilingual Chinese/English explanation where it preserves the user’s exam-prep notes, while keeping R code and final exam report phrasing in clear English unless asked otherwise.

Acceptance criteria:

Phase 5: Validation

Actions:

  1. Test the skill manually against at least six representative prompts:
  2. Use past/mock exam questions where feasible.
  3. Check that final answers are concise, contextual, and do not overclaim.
  4. Check that the skill corrects the known risky template errors listed above.

Acceptance criteria:

Phase 6: Optional Installation/Portability

Actions:

  1. Keep the repo-local skill under .codex/skills/.
  2. If the user wants global reuse, copy or mirror it into a user-level Codex skill location after approval.
  3. If the user wants a shareable archive, create a portable folder or zip after approval.

Acceptance criteria:

Verification Commands

Planned local checks:

find .codex/skills/ads2-online-exam-solver -maxdepth 3 -type f | sort
sed -n '1,220p' .codex/skills/ads2-online-exam-solver/SKILL.md
rg -n "TODO|TBD|lorem|placeholder" .codex/skills/ads2-online-exam-solver

If R scripts are added:

Rscript --vanilla .codex/skills/ads2-online-exam-solver/scripts/quick_data_audit.R --help

Risks and Mitigations

Open Questions

  1. Should the final skill be installed only repo-locally under .codex/skills/, or also copied to a global Codex skill directory for use outside this website repo?
  2. Should the skill answer in English only, Chinese only, or bilingual exam-support style?
  3. Should the skill include compact R helper scripts, or should it remain pure Markdown references?
  4. Do you want the skill to include an “exam mode” final-answer format, for example: Method, R code, Key output, Conclusion, Assumptions?

Next Step After Approval

After this SPEC is accepted, start implementation from the reviewed extraction notes. The first concrete implementation action will be to create .codex/skills/ads2-online-exam-solver/, then write SKILL.md, source-index.md, and the reference files from the second-pass findings above.