The Big Picture: A Smart Travel Agent for a Small Business
Imagine you run a small, local travel agency (a "Small and Medium Enterprise" or SME). You have a massive catalog of thousands of events—concerts, theater shows, pool parties, and food festivals.
Traditionally, your customers have to use a boring filter menu: "Show me concerts under $50 happening next week." It's rigid, and if they don't know exactly what they want, they get overwhelmed.
Enter EventChat. This is a new kind of digital assistant powered by Large Language Models (LLMs)—the same "brain" technology behind tools like ChatGPT. Instead of clicking filters, a customer can just chat with the app: "I'm in the mood for something spooky but cheap, maybe this weekend." The AI understands the vibe, checks your database, and suggests the perfect haunted house tour.
The paper is a report card on how a real startup tried to build this system, how it performed in the real world, and what they learned about the costs and headaches of using AI for small businesses.
1. The Goal: Why Build This?
The startup wanted two things:
- Better Customer Experience: Make finding fun things to do easier and more fun.
- Fix Bad Data: Their event list was built by "web scrapers" (robots that copy-paste info from other websites). Sometimes these robots miss details (like the price or the exact date). A human-like chatbot can read the messy text descriptions and figure out the details better than a rigid search bar.
2. The Design: Building the "Robot" on a Budget
The researchers didn't build a super-complex, expensive robot. They built a pragmatic, budget-friendly version suitable for a small company.
- The Brain: They used ChatGPT (via Microsoft Azure) because it was the smartest tool available at the time and they got it for free through a startup grant.
- The Strategy (Prompt-Based Learning): Instead of teaching the AI from scratch (which requires huge amounts of data and money, like sending a student to a 4-year university), they just gave it a set of instructions (a "prompt"). Think of it like hiring a very smart intern and giving them a detailed cheat sheet on how to do the job, rather than training them for years.
- The Workflow (Stage-Based): They didn't let the AI run wild. They built a "conveyor belt" system with five fixed steps:
- Chat: Just talk.
- Refusal: Say "no" if the question is weird.
- Search: Look for specific things.
- Recommend: Suggest things based on what you liked before.
- Inquiry: Answer specific questions about an event.
- Analogy: Imagine a restaurant kitchen. Instead of letting the chef (the AI) wander around the store to find ingredients (which takes too long and costs too much), the chef stays at the station and follows a strict recipe card. It's faster and cheaper.
3. The Results: The Good, The Bad, and The Expensive
They tested this with real people in a German city. Here is what happened:
✅ The Good News
- It Works: Most people (85.5%) felt the recommendations were accurate.
- Low Effort: Users felt it was easy to find what they wanted.
- The "ResQue" Model: The researchers updated a famous survey model (called ResQue) to measure how people feel about AI chatbots. They added new questions about how "consistent" and "coherent" the chat felt. This new survey tool is now a blueprint for other researchers.
❌ The Bad News (The "Gotchas")
- The "Latency" Problem (Waiting Time): The system was slow. On average, it took 5.7 seconds just to reply to a single message.
- Analogy: Imagine ordering a coffee, and the barista takes 6 seconds just to say "One coffee coming up!" before actually making it. It feels sluggish.
- The Cost Problem: Every time the AI thought, it cost money. The median cost was $0.04 per interaction.
- Analogy: If you have 1,000 customers a day, that's $40 a day just for the AI to think. For a small business with thin profit margins, this adds up fast.
- The "Hallucination" Problem: Sometimes the AI made things up. It might suggest a concert that didn't exist or ignore the price limit you set.
- Analogy: It's like a tour guide who confidently tells you a museum is open on Sundays, but they just made that up because they forgot to check the schedule.
- The "Context" Problem: The AI sometimes forgot details. If you said "I want something under $20," it might later suggest a $50 event because it got distracted by the long list of event descriptions.
4. Key Lessons for Small Businesses
The paper concludes with five "Golden Rules" for small companies trying to use AI:
- Keep it Simple (Stage-Based): Don't let the AI be a free-roaming "agent" that tries to do everything. Give it a fixed list of tasks. It's cheaper and more stable.
- Instructions Aren't Everything: Just giving the AI a prompt isn't enough if the task is too complex. Sometimes you need to "train" it (fine-tune) or give it better data, but that costs more.
- Guardrails are Essential: You need safety nets to stop the AI from lying (hallucinating) or getting hacked.
- Listen to How People Talk: Users ignored the "Time Button" on the screen and just typed their dates in the chat. Design your app to listen to the chat, not just the buttons.
- The "Re-Ranking" Tax: The most expensive part of the system was the AI trying to pick the best 10 events out of 100. Using a super-smart AI to do this is like hiring a Nobel Prize-winning chef to chop onions. It works, but it's too expensive for a small business.
The Bottom Line
EventChat proves that small businesses can build cool, AI-powered chatbots to help customers find events. It's a powerful tool that feels like magic.
However, it's not a "set it and forget it" solution. It's like driving a high-performance sports car: it's fast and fun, but it burns a lot of gas (money), needs a skilled driver (technical know-how), and if you aren't careful, it might stall (latency) or take a wrong turn (hallucinations).
For a small business to succeed with this, they have to carefully balance the wow factor for the customer against the wallet factor for the business.
Drowning in papers in your field?
Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.