AI-powered marking #1248

xxdydx · 2025-04-06T06:43:31Z

Description

Created a new feature to automate feedback generation of code submissions in Source Academy, with the help of LLMs. This will help save TAs some time in grading code submissions from students!

Added AI comment generation functionality in the backend.
Integrated OpenAI API for generating AI comments.
Implemented generate_ai_comments endpoint to fetch question details and generate AI-generated comments for submissions.
Added save_chosen_comments endpoint to save multiple chosen comments for a submission and question for logging purposes.
Added save_final_comment endpoint to save the final comment chosen for a submission for logging purposes.
Added new ai_comment_logs table to log various data points from inputs, original student's code, outputs generated by LLM, comments chosen, and final comment.
Added AIComments module to handle creation, retrieval, and updates for AI comments, including saving final and chosen comments.
Added AICodeAnalysisController to handle AI comment generation, saving final comments, and saving chosen comments.
Added test cases for generate_ai_comments, save_final_comment, and save_chosen_comments endpoints in AICodeAnalysisControllerTest.
Updated Swagger documentation for the new endpoints.
Added necessary migrations to update the database schema.
Added encryption and decryption logic for LLM API keys using AES-GCM.

Note: This may require changes to the DB diagram in README.md.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Code quality improvements

Checklist

I have tested this code
I have updated the documentation

coveralls · 2025-04-09T05:05:59Z

coverage: 88.72% (-0.9%) from 89.636%
when pulling 8cf09b9 on feat/add-AI-generated-comments-grading
into 40cd12a on master.

GabrielCWT

Thanks for this feature! Quite a bit of comments. Please look through, resolve them. I do have some clarification comments as well so please answer those as well.

One question I have is when are these comments used? I couldn't find anytime in which the comments are returned/retrieved to/by the FE.

lib/cadet/ai_comments.ex

GabrielCWT · 2025-04-09T08:45:39Z

lib/cadet/ai_comments.ex

+  Retrieves an AI comment for a specific submission and question.
+  Returns `nil` if no comment exists.
+  """
+  def get_ai_comments_for_submission(submission_id, question_id) do


Naming implies you are getting all AI comments. Also what is the use case for getting only one of the comments?

It appears this function is not used either

I think we can remove this function then.

Will keep this for eventual AI comments retrieval when loading a submission

lib/cadet/ai_comments.ex

lib/cadet/ai_comments/ai_comment.ex

lib/cadet/assessments/assessments.ex

lib/cadet_web/controllers/generate_ai_comments.ex

…add-AI-generated-comments-grading

RichDom2185 · 2025-10-26T14:14:28Z

@sentry review

Copilot

Pull Request Overview

This PR introduces an AI-powered marking feature that leverages LLMs to automate feedback generation for code submissions in Source Academy, reducing grading workload for teaching assistants.

Key Changes:

Added AI comment generation infrastructure with OpenAI API integration and AES-GCM encrypted API key storage at course level
Created database schema and logging system for AI-generated comments with new endpoints for generating, saving, and managing AI feedback
Extended existing models with LLM-related fields including course-level, assessment-level, and question-level prompts

Reviewed Changes

Copilot reviewed 30 out of 31 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
lib/cadet_web/controllers/generate_ai_comments.ex	New controller implementing AI comment generation, LLM API interaction, and comment persistence
lib/cadet/ai_comments.ex	New module handling CRUD operations for AI comments
lib/cadet/ai_comments/ai_comment.ex	Schema definition for ai_comment_logs table
lib/cadet/courses/course.ex	Added LLM configuration fields and AES-GCM encryption logic for API keys
lib/cadet/assessments/assessments.ex	Extended get_answers_in_submission to support AI comments and added assessment prompt retrieval
priv/repo/migrations/*	Database migrations for LLM features and ai_comment_logs table
test/cadet_web/controllers/ai_code_analysis_controller_test.exs	Test coverage for AI comment generation endpoints
config/test.exs	Added encryption key configuration for testing
lib/cadet_web/router.ex	Registered new AI comment endpoints
lib/cadet_web/admin_views/admin_grading_view.ex	Updated grading view to include AI comments
test/support/seeds.ex	Updated test fixtures with new fields

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lib/cadet_web/controllers/generate_ai_comments.ex

priv/repo/migrations/20240320000001_add_llm_api_key_to_courses.exs

lib/cadet/courses/course.ex

lib/cadet_web/admin_views/admin_grading_view.ex

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…ource-academy/backend into feat/add-AI-generated-comments-grading

RichDom2185 · 2025-10-26T14:30:56Z

@sentry review

lib/cadet_web/controllers/generate_ai_comments.ex

priv/repo/migrations/20240320000001_add_llm_api_key_to_courses.exs

lib/cadet/courses/course.ex

lib/cadet_web/controllers/generate_ai_comments.ex

lib/cadet/assessments/assessments.ex

config/dev.secrets.exs.example

test/cadet_web/controllers/ai_code_analysis_controller_test.exs

RichDom2185

Some questions about the schema

RichDom2185 · 2025-10-29T05:18:26Z

lib/cadet_web/helpers/assessments_helpers.ex

          prepend: "prepend",
          solutionTemplate: "template",
          postpend: "postpend",
+          llm_prompt: "llm_prompt",


Why do we need the LLM prompt returned to the FE?

This is to allow the "Composed prompt" view in the grading page

RichDom2185 · 2025-10-29T05:19:05Z

lib/cadet_web/views/courses_view.ex

          enableSourcecast: :enable_sourcecast,
          enableStories: :enable_stories,
+          enableLlmGrading: :enable_llm_grading,
+          llmApiKey: :llm_api_key,


Why do we need these returned? Also, this is returning the encrypted value?

RichDom2185 · 2025-10-29T05:21:55Z

priv/repo/migrations/20251022103623_create_ai_comments.exs

+    create(index(:ai_comment_logs, [:submission_id]))
+    create(index(:ai_comment_logs, [:question_id]))


Are these indices strictly necessary? Since AI comments (+ logs nonetheless) are very often being written (and in massive volumes), adding an index will result in significantly degraded write performance. FK lookup is already really well-optimized on Postgres, I don't see a need for the index imo.

RichDom2185 · 2025-10-29T05:22:39Z

priv/repo/migrations/20251022103623_create_ai_comments.exs

+  def change do
+    create table(:ai_comment_logs) do
+      add(:submission_id, references(:submissions, on_delete: :delete_all), null: false)
+      add(:question_id, references(:questions, on_delete: :delete_all), null: false)


Why is it by question instead of by answer? (which has the submission + question)

RichDom2185 · 2025-10-29T06:17:11Z

lib/cadet_web/controllers/generate_ai_comments.ex

+  defp decrypt_llm_api_key(encrypted_key) do
+    case Application.get_env(:openai, :encryption_key) do
+      secret when is_binary(secret) and byte_size(secret) >= 16 ->
+        key = binary_part(secret, 0, min(32, byte_size(secret)))
+
+        case Base.decode64(encrypted_key) do
+          {:ok, decoded} ->
+            iv = binary_part(decoded, 0, 16)
+            tag = binary_part(decoded, 16, 16)
+            ciphertext = binary_part(decoded, 32, byte_size(decoded) - 32)
+
+            case :crypto.crypto_one_time_aead(:aes_gcm, key, iv, ciphertext, "", tag, false) do
+              plain_text when is_binary(plain_text) -> {:ok, plain_text}
+              _ -> {:decrypt_error, :decryption_failed}
+            end
+
+          _ ->
+            Logger.error(
+              "Failed to decode encrypted key, is it a valid AES-256 key of 16, 24 or 32 bytes?"
+            )
+
+            {:decrypt_error, :decryption_failed}
+        end
+
+      _ ->
+        Logger.error("Encryption key not configured")
+        {:decrypt_error, :invalid_encryption_key}
+    end
+  end
+end


Should be abstracted away together with the encrypt function in a separate module for better abstraction/encapsulation

RichDom2185 · 2025-10-29T06:19:18Z

lib/cadet/courses/course.ex

+        )
+
+      # Store both the IV, ciphertext and tag
+      encrypted = Base.encode64(iv <> tag <> ciphertext)


Nit: Please add a non-b64 character delimiter between the parts instead of just concatenating. That way we don't need to rely on offsets, and can just split using the delimiter (safer, more robust against variable-length payloads)

RichDom2185 · 2025-10-29T06:21:55Z

lib/cadet_web/controllers/generate_ai_comments.ex

+      # Get head of answers (should only be one answer for given submission
+      # and question since we filter to only 1 question)
+      case answers do
+        [] ->
+          conn
+          |> put_status(:not_found)
+          |> text("No answer found for the given submission and question_id")
+


We can just pull the answer directly if we store the answer as FK right? Is there a reason we store separate FKs instead?

RichDom2185 · 2025-10-29T06:23:07Z

lib/cadet_web/controllers/generate_ai_comments.ex

+      {:decrypt_error, err} ->
+        conn
+        |> put_status(:internal_server_error)
+        |> text("Failed to decrypt LLM API key: #{inspect(err)}")


We really shouldn't be returning a stack trace in HTTP responses, especially on a security critical thing like decryption

RichDom2185 · 2025-10-29T06:24:40Z

lib/cadet_web/controllers/generate_ai_comments.ex

+      **Additional Instructions for this Question:**
+      #{answer.question.question["llm_prompt"] || "N/A"}
+
+      **Question:**
+      ```
+      #{answer.question.question["content"] || "N/A"}
+      ```
+
+      **Model Solution:**
+      ```
+      #{answer.question.question["solution"] || "N/A"}
+      ```
+
+      **Autograding Status:** #{answer.autograding_status || "N/A"}
+      **Autograding Results:** #{format_autograding_results(answer.autograding_results)}
+
+      The student answer will be given below as part of the User Prompt.


Beware of the unstripped leading indent

RichDom2185 · 2025-10-29T06:24:59Z

lib/cadet_web/controllers/generate_ai_comments.ex

+    ]
+
+    case call_llm_endpoint(llm_api_url, input, headers) do
+      {:ok, %HTTPoison.Response{status_code: 200, body: body}} ->


Can we alias it for readabilty?

RichDom2185 · 2025-10-29T06:27:14Z

lib/cadet_web/controllers/generate_ai_comments.ex

+    HTTPoison.post(llm_api_url, input, headers,
+      timeout: 60_000,
+      recv_timeout: 60_000
+    )
+  end


Why are we using manual POST/etc instead of OpenAI client library like chat_controller.ex? Is it to make it provider agnostic? If it is, then it doesn't really make sense since the downstream response parsing expects an OpenAI-compatible provider.

But if it's already an OpenAI-compatible provider, then just set the base URL in the OpenAI client library?

xxdydx and others added 16 commits February 2, 2025 17:18

feat: v1 of AI-generated comments

5cd6ac6

feat: added logging of inputs and outputs

853ba84

Update generate_ai_comments.ex

4c37d14

feat: function to save outputs to database

8192b3d

Format answers json before sending to LLM

8a235b3

Add LLM Prompt to question params when submitting assessment xml file

d384e06

Add LLM Prompt to api response when grading view is open

98feac2

feat: added llm_prompt from qn to raw_prompt

7716d57

feat: enabling/disabling of LLM feature by course level

df34dbd

feat: added llm_grading boolean field to course creation API

0a25fa8

feat: added api key storage in courses & edit api key/enable llm grading

2723f5a

feat: encryption for llm_api_key

02f7ed1

feat: added final comment editing route

cb34984

feat: added logging of chosen comments

09a7b09

fix: bugs when certain fields were missing

ed44a7e

feat: updated tests

3715368

xxdydx changed the title ~~Feat/add ai generated comments grading~~ AI-powered marking Apr 6, 2025

formatting

5bfe276

xxdydx self-assigned this Apr 6, 2025

xxdydx added 4 commits April 6, 2025 12:06

Merge branch 'master' into feat/add-AI-generated-comments-grading

c27b93b

fix: error handling when calling openai API

17884fd

fix: credo issues

f91cc92

formatting

81e5bf7

xxdydx marked this pull request as ready for review April 9, 2025 05:06

xxdydx requested a review from GabrielCWT April 9, 2025 05:06

GabrielCWT requested changes Apr 9, 2025

View reviewed changes

EugeneOYZ1203n mentioned this pull request Apr 18, 2025

CP3108 SR AI-powered marking source-academy/frontend#3126

Open

5 tasks

RichDom2185 self-requested a review April 18, 2025 06:17

Merge branch 'master' into feat/add-AI-generated-comments-grading

8f8b93a

Tkaixiang and others added 13 commits October 14, 2025 23:08

Fix bug preventing avengers from generating ai comments

714476a

Fix up tests + error msgs

5573a21

Formatting

efc4c57

Merge branch 'master' of github.com:source-academy/backend into feat/…

c68f331

…add-AI-generated-comments-grading

some mix credo suggestions

aa84560

format

4537270

Fix credo issue

62b6437

bug fix + credo fixes

6e769fd

Fix tests

1bd9398

format

4c341d1

Merge branch 'master' into feat/add-AI-generated-comments-grading

bd6e677

Merge branch 'master' into feat/add-AI-generated-comments-grading

2ad0056

Modify test.exs

2947f3c

RichDom2185 requested a review from Copilot October 26, 2025 14:13

Copilot AI reviewed Oct 26, 2025

View reviewed changes

Tkaixiang and others added 4 commits October 26, 2025 22:16

Update lib/cadet_web/controllers/generate_ai_comments.ex

8669e3a

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot feedback

58345d4

Merge branch 'feat/add-AI-generated-comments-grading' of github.com:s…

353b533

…ource-academy/backend into feat/add-AI-generated-comments-grading

format

f794a14

sentry-io bot reviewed Oct 26, 2025

View reviewed changes

Tkaixiang and others added 6 commits October 27, 2025 00:57

Work on sentry comments

a9e2b2a

Fix type

3e96d6b

Redate migrations to maintain total order

5bf1aa6

Add newline at EOF

f2c02c9

Fix indent

0d2f0c2

Fix capitalization

8cf09b9

RichDom2185 reviewed Oct 29, 2025

View reviewed changes

RichDom2185 requested changes Oct 29, 2025

View reviewed changes

		create(index(:ai_comment_logs, [:submission_id]))
		create(index(:ai_comment_logs, [:question_id]))

Uh oh!

AI-powered marking #1248

Are you sure you want to change the base?

AI-powered marking #1248

Uh oh!

Conversation

xxdydx commented Apr 6, 2025

Description

Type of Change

Checklist

Uh oh!

coveralls commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GabrielCWT left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RichDom2185 commented Oct 26, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RichDom2185 commented Oct 26, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

RichDom2185 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coveralls commented Apr 9, 2025 •

edited

Loading