Safety and optimisation
NSWEduChat incorporates several layers of security and optimisation features. These layers work together to make sure our AI chatbot is not just intelligent but also safe, ethical, and a higher quality experience than free-to-use Generative AI.
Below, we explain how the different layers contributing to NSWEduChat work.
Video Transcript
NSWEduChat is the department's AI chatbot, designed for both staff and students.
In this video, we will explain how NSWEduChat works to make your experience smooth and safe.
When you enter a prompt into NSWEduChat, it goes through a series of filters.
This happens in a blink of a second so you won't even notice it's happening.
The first step is called system prompt. Think of it as setting the stage.
System prompt tells NSWEduChat that is here to support New South Wales classrooms.
Focusing on education and safety, jailbreak prevention checks the prompt making sure no one tries to make NSWEduChat respond outside its guidelines.
Profanity filtering ensures all our conversations within NSWEduChat are respectful, both ways.
The semantic content filter checks the meaning behind your words,
ensuring everything stays appropriate and beneficial.
The orchestrator picks the best knowledge sources from different AI models to answer your prompt before it presents its response, it applies one more round of semantic and profanity filters.
NSWEduChat smart, safe and ready to support your educational journey.
For more information, please visit our website.
End of transcript
System prompt
The system prompt is the first layer of providing the unique NSWEduChat experience. The system prompt provides the context and boundaries of the platform. The system prompt allows the underlying AI model to know it is operating in NSW Classrooms, and it is this layer that ensures students are accessing an education-focused assistant.
Jailbreak prevention
Jailbreak prevention scans user messages to ensure they are not trying to bypass established protections. This ensures that users cannot trick the system into giving answers it shouldn’t.
Profanity filtering
This layer operates twice in every chat interaction. It ensures that all language going into the bot and coming out is free from profanity and harmful language.
Semantic content filter
This level checks that the content the user provides is free from anything inappropriate. This checks the broader meaning of the message for a wider scope of harmful content beyond just profanity. If harmful content is detected, this layer will step in and stop the interaction from going any further.
Orchestrator
The Orchestrator layer allows NSWEduChat to utilise different knowledge sources and generative AI technologies. This ensures that the most effective and efficient model and knowledge source is being used for each request. The orchestrator streamlines this process without requiring any user input. NSWEduChat transparently advises the user which model was used to answer their question.
This page has recently been modified to reflect the latest information.