A Locally Deployed RAG-Based Academic Advising System for Course Selection

This paper proposes a locally deployed, privacy-preserving RAG-based system that leverages large language models and structured syllabus data to assist students in overcoming information overload and institutions in addressing resource limitations by generating optimal, prerequisite-aware course sequences for personalized academic planning.

Original authors: Feng Li, Yoritaka Iwata

Published 2026-06-03
📖 5 min read🧠 Deep dive

Original authors: Feng Li, Yoritaka Iwata

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are a student trying to build the perfect schedule for your degree. You have a massive library of course descriptions (syllabi) in front of you. The problem? There are so many books, and they are written in a confusing way. You don't know which class you need to take before another, or if a specific class fits your goals. Trying to find the answers yourself is like looking for a needle in a haystack while wearing blindfolds. Meanwhile, the human advisors who usually help are overwhelmed, tired, and can't talk to everyone at once.

This paper introduces a solution called Syllabot. Think of Syllabot as a super-smart, privacy-focused librarian who lives entirely on your own computer (not on the internet). Its job is to read all those course syllabi and answer your questions about which classes to take, without ever sending your personal data to a cloud server.

Here is how the paper breaks down the system and what it found, using simple analogies:

1. The Problem: The "Overwhelmed Student" vs. The "Busy Advisor"

Students often get stuck because they have too much information and don't know how to connect the dots. They might pick a class just because the title sounds cool, missing the fact that they need a prerequisite class first. Human advisors want to help, but they are stretched too thin.

2. The Solution: Syllabot (The Local Librarian)

The researchers built a system that combines two things:

  • A Search Engine: It scans the syllabus documents to find the exact pages you need.
  • A Local Brain (LLM): It reads those pages and writes a clear answer for you.

The "Local" Part: Usually, AI systems send your questions to big servers in the cloud. Syllabot runs entirely on the university's own computers. This is like having a private tutor in your own study room rather than calling a stranger on a public phone line. It keeps your data safe and private.

3. How the Search Works (The Three Strategies)

The researchers tested three different ways for the system to find the right information, similar to how you might search for a book:

  • The Keyword Hunter (BM25): This looks for exact word matches. If you ask, "What is the textbook for Math 101?", it finds the page that literally says "Math 101" and "textbook." It's great for specific terms but bad if you ask the same question in a different way.
  • The Meaning Matcher (Semantic/Embedding): This understands the idea behind your words. If you ask, "What book do I need for the intro math class?", it knows that "intro math class" means "Math 101" even if those exact words aren't there. It's like a librarian who understands your intent, not just your vocabulary.
  • The Hybrid Approach: This tries to use both the Keyword Hunter and the Meaning Matcher at the same time, hoping to get the best of both worlds.

4. The Experiments: Testing the Librarian

To see which search method worked best, the researchers created a "test drive" with 100 questions. They categorized the questions into four types:

  • The Easy One: "What is the textbook?" (Exact words match).
  • The Paraphrase: "What book do I need?" (Same meaning, different words).
  • The Puzzle: "What do I need to know before taking Class A and Class B?" (Requires finding info in multiple places).
  • The Trap: "Can I take a class that doesn't exist?" (The system should say "I don't know").

5. What They Found

  • The "Meaning Matcher" was the MVP: Surprisingly, the system that understood the meaning of the questions (Semantic) worked better than the one that just looked for exact words (Keyword), even for simple questions. The "Hybrid" approach didn't always beat the "Meaning Matcher" on its own; sometimes, mixing them just added noise.
  • More Context isn't Always Better: When the researchers gave the AI too many pages to read at once (increasing the "Top-k" limit), the quality of the answers actually dropped. It's like giving a student a whole encyclopedia to answer a simple question; they get confused and start making things up (hallucinations).
  • The "Refusal" Skill: When asked about things the syllabus didn't cover, the system was good at saying "I don't know." However, if the researchers gave it too much irrelevant information to read, it started guessing answers instead of refusing. It's like a student who, when overwhelmed with too much extra reading, starts making up an answer just to finish the test.

6. The Bottom Line

The paper concludes that for academic advising, a system that understands the meaning of a student's question is crucial. However, you have to be careful not to feed the AI too much extra information, or it might get confused and give you a confident but wrong answer.

The researchers suggest that future versions should be smarter about which pieces of information to pick, rather than just picking everything that sounds similar. They also note that while their system works well for this specific set of Japanese course syllabi, it's a proof-of-concept for how universities can build private, helpful AI tools without risking student privacy.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →