AI Essay Scoring : Piqosity Support Center

Piqosity now integrates with ChatGPT to AI-grade student essays. Essay prompts are found throughout Piqosity including in standardized test prep courses like TSI, ACT, and ISEE plus our English Language Arts (ELA) courses for grades 5-11.

Note that the Essay on the ISEE is not scored. Simply use Piqosity's new AI-scoring as a tool for helping students to improve their writing. Click here to read more about the ISEE Essay.

Upon writing their essay and scoring the quiz, students will now see an AI-generated score and specific feedback. As deployed on January 30, 2025, we are currently using ChatGPT 4o-mini.

The AI score is viewable in two places:

Test Results Page by clicking on the expansion carat
Essay View page by clicking to the page showing the essay prompt

Screenshot of Test Results Page

Screenshot of Essay View page

Category Scores

Piqosity will give each essay a score across six categories:

Purpose and Focus - primarily responsiveness to the prompt and overall coherence
Organization and Structure - paragraph usage and transitions
Development and Support - related, specific examples
Sentence Variety and Style - different types of sentence structures
Mechanical Conventions - grammar and spelling
Critical Thinking - following through on a thesis

Each of the six categories receives an AI-generated score of 1 to 9 where 1 is the lowest score and 9 is the highest:

Scores of 1, 2, and 3 are below average
Scores of 4, 5, and 6 are average
Scores of 7, 8, and 9 are above average

Piqosity will justify its score in each category including citations from the student's response. For example, if students receive a low score in the category "Mechanical Conventions," Piqosity may specifically point out mistakes in grammar and spelling.

Overall Score

Additionally, students will receive an Overall Score, which is simply the average of the six category scores. This Overall Score is accompanied by a summary justification of the score including:

what the student did well,
where they fell short, and
specific steps to improve their essay.

Mapping Piqosity Essay Scores to Standardized Test Scores

There are only two standardized courses on Piqosity with graded essays:

TSIA2
ACT Writing

Given the subjective nature of essay scoring, particularly the human-scored ACT essays, it's difficult to provide exact score concordance. Instead, it's better to think of Piqosity's scores as below average, average, and average and use the scores plus explanations to improve your writing.

However, we provide numerical mapping guidance below.

TSIA2 Score Mapping

The actual TSIA2 Essay receives just one holistic score from 0 to 8 versus Piqosity's 1 to 9 scale. Therefore, to technically match Piqosity's score with the TSIA2, you would take Piqosity's Overall Score and subtract 1. However, our testing reveals a little bit more nuance:

Piqosity scores of 1, 2, and 3 are probably too high, so subtract 1
Piqosity scores of 4, 5, and 6 probably don't need adjustment
Piqosity scores of 6 and 7 are too low and would probably be a 7 or 8 on the TSIA

ACT Writing Score Mapping

The actual ACT writing test receives both an Overall score and a Domain score. Here's how to map ACT's domains to Piqosity's categories:

ACT "Ideas and Analysis" = Piqosity Purpose & Focus + Critical Thinking
ACT "Development and Support" = Piqosity Development and Support
ACT "Organization" = Piqosity Organization and Logic
ACT "Language Use and Conventions" = Piqosity Mechanical Conventions + Sentence Variety and Style

Here's how to Match Piqosity's 9-point scale to ACT's 6-point scale:

Piqosity scores of 1, 2, and 3 are equivalent to ACT's scores of 1 and 2
Piqosity's scores of 4, 5, and 6 are equivalent to ACT's scores of 3 and 4
Piqosity's scores of 7, 8, and 9 are equivalent to ACT's scores of 5 and 6

Comparing AI Scoring to Human Scoring

Fourth generation large language models (LLMs) like ChatGPT 4o-mini are powerful tools but they do have limitations. We spent dozens of hours engineering our AI prompts and comparing the results to human-scored essays.

Piqosity is a "fair grader" of bad to good essays
But a "hard grader" of great essays

The AI is good at providing detailed feedback and providing "ballpark scores" for about 85% of essays that most educators would score as decent to good. However, it's less capable at identifying truly exceptional work according to a student's grade level or recognizing when the student simply didn't try.

Our current implementation does a good job of mitigating for the truly bad essays by feeding specific data from the app to the AI like word-count expectations and examples of what an automatic failure looks like. However, we have been admittedly less successful at getting it to score remarkable essays as 8 and 9 on our grading scale.

As a result of these known limitations, students could probably describe our essay scoring as a "fair grader" for poor to good essays but a "hard grader" of exceptional essays. We will continue to refine our scoring methodology and look forward to even better results in the future by updating to newer LLM versions and further prompt refinement according to specific courses.

Usage Limitations

The number of essays a student can submit for AI scoring is limited by their subscription plan:

On-Track - 4 submissions
Honors - 30
Advanced - 60

AI Essay Scoring Print

Related Articles