SlideShare a Scribd company logo
AI-Enhanced RAG System for Automated
University Course Content Generation
Contents
2
Project Overview and Scope
Business Problem
Business Understanding
CRISP-ML(Q) Methodology
Technical Stacks
Project Architecture
Model Building
Prompt Templates (Techniques)
Model Deployment - Strategy
Deployment
Challenges
Future Scopes
Prompt Templates Response Estimation
Model Response Accuracy
Project Overview and Scope
3
Overview
Project Purpose:
Create an AI system to automatically generate and improve university
course content using domain-specific data.
3 Major Phases:
• Data Collect and clean domain-specific PDFs and data.
• Prompt Testing Analyse how different prompts affect the model.
• Response Generation Create and improve responses using the model.
Project Scope:
Develop an AI system to generate and evaluate university course
content based on domain-specific data.
Business Problem
4
• The university requires a wide range of on-demand content.
• Manual content creation is time-consuming and resource-intensive.
Business Understanding
5
• Business Objective:
Improve content generation efficiency and quality while minimizing the time required.
• Business Constraint:
Ensure content originality to avoid plagiarism.
Success Criteria:
• Business Success Criteria:
Reduce content generation time by up to 80%.
• Machine Learning Success Criteria:
Keep question duplication rate below 5%.
• Economic Success Criteria:
Increase revenue from online courses by 15% in the first semester.
Business Understanding
5
AI-Driven Course Content Generation
• A Retrieval-Augmented Generation (RAG) system leveraging advanced AI models.
• Key models: Gemini-1.5-Pro , Gemini-1.5-Flash .
• Over various prompt techniques used for optimized content accuracy.
CRISP-DM(Q) Methodology
This project involve 6 phases of CRISP-ML(Q) methodology:
Business and Data Understanding:
• The goal is to develop a Retrieval-Augmented Generation (RAG) system using advanced AI latest models.
• The system integrates over various prompt techniques to enhance content accuracy and relevance.
Data Preparation:
• Gather domain-specific academic data.
• Clean and label the data for AI training.
• Ensure datasets are diverse and representative, covering all course topics.
• Provide an option for students to upload their course PDFs and retrieve relevant information from them.
Model Building and Tuning:
• Utilize the latest Gemini API models (Gemini-1.5 Pro and Flash) for content generation.
• Fine-tune model parameters by optimizing various prompt techniques to improve system performance.
Evaluation:
• Segment course content based on topic and difficulty.
• Apply instructional design principles to guide AI model training.
• Incorporate strategies like spaced repetition and interactive quizzes for effective learning outcomes.
CRISP-ML(Q) Methodology
Model Deployment:
• Deploy the model using a user-friendly interface, like Streamlit, for seamless interaction.
• Establish a robust feedback loop that continuously collects user inputs for model refinement.
• Implement automated retraining processes that leverage real-time user interaction data.
• Ensure the system is adaptable to curriculum updates and evolving educational needs, maintaining content quality.
• Implement automated retraining using user interaction data.
Monitoring and Maintenance:
• Continuously monitor system performance and user engagement metrics to identify areas for improvement.
• Regularly update the model to incorporate new content, optimizing the relevance of generated material.
• Maintain alignment with institutional goals by integrating feedback from both instructors and students.
Technical Stacks
Programming Languages:
• Python
Api Models:
• Gemin-1.5-pro
• Gemini-1.5-flash
Optimization:
• Prompt Techniques
Database (for Data Storage and Retrieval):
• Vector DB
• Local directory
Notebooks and Development Environment:
• Google Collab
• Visual Studio Code
Deployment
• Streamlit for user interface and interaction
Project Architecture
AI-Enhanced RAG System for Automated University Course Content Generation
Model Building
Lang Chain Framework
1. Select Models:
• 2 types of model being utilized in this project which is the Gemin-1.5-pro / Gemini-1.5-flash Models and
embedding-004 Model.
2. Retrieval & Augmented with Models:
• Gather domain-specific academic data.
• Generate responses from domain-specific PDFs.
• Estimate response quality.
3. Integration:
• Utilize the Lang-Chain framework for the RAG system.
• Access prompt techniques via prompt_techniques.py file using Gemini API models.
Model Building
3. Generate Response:
• The models generate responses based on prompt instructions outlined in the prompt techniques.py' file.
• For each query, the generated content is aligned with predefined guidelines and optimized for accuracy using
the fine-tuned prompt techniques.
• For each generated output, the relevance and precision of the content are assessed against a validation set to
ensure high-quality results.
Prompt Templates (Techniques)
Prompt Techniques Response Estimation
Model Response Accuracy
We evaluated the effectiveness of different prompt techniques by observing whether they followed the given instructions
correctly. To compare these techniques, we performed an estimation based on their performance.
To estimate the effectiveness of different prompt techniques, you can consider the following methods:
• Accuracy of Instruction Following: Check how closely each response adheres to the specific instructions given in the
prompt.
• Consistency: Compare how consistently each technique generates relevant and accurate responses across different
prompts.
• Quality of Responses: Assess the quality of the answers based on criteria such as relevance, coherence, and
informativeness.
• User Feedback: Gather feedback from users or evaluators on the usefulness and clarity of responses generated by each
technique.
• Quantitative Metrics: If applicable, using metrics like response length, relevancy scores, or error rates to objectively
measure the performance of each technique.
Model Deployment - Strategy
1. Test on New Data:
• Evaluate prompt techniques with domain-specific documents and PDFs.
• Validate performance with new datasets.
2. Continuous Monitoring:
• Monitor effectiveness with incoming data.
• Refine techniques for accuracy and relevance.
3. Feedback & Iteration:
• Collect user feedback.
• Iterate and improve prompt strategies.
4. Streamlit Deployment:
• Deploy prompts in a Streamlit interface for real-time testing.
• Allow interactive user input and feedback.
5. Documentation:
• Provide guidelines for interpreting results and troubleshooting.
• Document the prompt responses for each technique and estimated the Quality of Responses.
• Document Model farmwork's and deployment and usage in Streamlit.
Deployment Showcase
Challenges
• Predictive Difficulty:
Challenges in accurately predicting responses when models are focused solely on specific data types or prompts.
• Privacy and Data Restrictions:
Handling sensitive data while complying with privacy regulations can limit access to relevant information and
impact model performance.
• Integration Complexities:
Integrating prompt techniques and models with existing systems can be complex and time-consuming.
• Limited Data Availability:
Insufficient historical data or specific domain information can restrict the effectiveness and accuracy of prompt-
based models.
• External Factors:
Unexpected changes or external influences (e.g., shifts in domain-specific data, new research developments) could
affect model accuracy and prompt responses.
Future Scopes
Enhanced Prompt Techniques:
• Continuously improve prompt techniques by integrating advanced methods.
• Incorporate factors like prompt diversity, context relevance, and user interaction.
Model Optimization:
• Refine model performance by incorporating feedback and adjusting parameters.
• Consider additional data sources and domain-specific contexts for enhanced accuracy.
System Integration:
• Collaborate with developers to optimize the integration of prompt techniques within the Lang Chain framework.
• Streamline processes for real-time updates and efficient prompt handling.
User Experience Enhancement:
• Utilize feedback to predict and address user needs.
• Optimize interaction strategies and prompt effectiveness based on user insights.
Ad

More Related Content

Similar to AI-Enhanced RAG System for Automated University Course Content Generation (20)

Best Practices for Implementing Automated Functional Testing
Best Practices for Implementing Automated Functional TestingBest Practices for Implementing Automated Functional Testing
Best Practices for Implementing Automated Functional Testing
Jason Roy
 
chapter11-120827115420-phpapp01.pdf
chapter11-120827115420-phpapp01.pdfchapter11-120827115420-phpapp01.pdf
chapter11-120827115420-phpapp01.pdf
AxmedMaxamuud6
 
Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...
nicholes21
 
1) Question Add Targets to Balanced score Card
1) Question  Add Targets to Balanced score Card1) Question  Add Targets to Balanced score Card
1) Question Add Targets to Balanced score Card
MartineMccracken314
 
1) Question Add Targets to Balanced score Card
1) Question  Add Targets to Balanced score Card1) Question  Add Targets to Balanced score Card
1) Question Add Targets to Balanced score Card
AbbyWhyte974
 
1) question add targets to balanced score card
1) question  add targets to balanced score card1) question  add targets to balanced score card
1) question add targets to balanced score card
smile790243
 
Building successful and secure products with AI and ML
Building successful and secure products with AI and MLBuilding successful and secure products with AI and ML
Building successful and secure products with AI and ML
Simon Lia-Jonassen
 
Research on the Application of Deep Learning Algorithms in Image Classificati...
Research on the Application of Deep Learning Algorithms in Image Classificati...Research on the Application of Deep Learning Algorithms in Image Classificati...
Research on the Application of Deep Learning Algorithms in Image Classificati...
ContentPedia
 
Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...
Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...
Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...
Prasanna Hegde
 
Testing of Object-Oriented Software
Testing of Object-Oriented SoftwareTesting of Object-Oriented Software
Testing of Object-Oriented Software
Praveen Penumathsa
 
Final ec2 kt
Final ec2 ktFinal ec2 kt
Final ec2 kt
Katie Tran
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
Avinash Patil
 
How Machine learning Integration supports testing automation in software
How Machine learning Integration supports testing automation in softwareHow Machine learning Integration supports testing automation in software
How Machine learning Integration supports testing automation in software
hamzaaftab25
 
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
PMI_IREP_TP
 
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
Day 1   1620 - 1705 - maple - pranabendu bhattacharyyaDay 1   1620 - 1705 - maple - pranabendu bhattacharyya
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
PMI2011
 
career guidance using ml and python for college students projects
career guidance using ml and python for college students projectscareer guidance using ml and python for college students projects
career guidance using ml and python for college students projects
Hamed Raza
 
Chapter 15 software product metrics
Chapter 15 software product metricsChapter 15 software product metrics
Chapter 15 software product metrics
SHREEHARI WADAWADAGI
 
ppt2.pptx
ppt2.pptxppt2.pptx
ppt2.pptx
JOHNNYGALLA2
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
Kamal Acharya
 
Unit 8 software quality and matrices
Unit 8 software quality and matricesUnit 8 software quality and matrices
Unit 8 software quality and matrices
Preeti Mishra
 
Best Practices for Implementing Automated Functional Testing
Best Practices for Implementing Automated Functional TestingBest Practices for Implementing Automated Functional Testing
Best Practices for Implementing Automated Functional Testing
Jason Roy
 
chapter11-120827115420-phpapp01.pdf
chapter11-120827115420-phpapp01.pdfchapter11-120827115420-phpapp01.pdf
chapter11-120827115420-phpapp01.pdf
AxmedMaxamuud6
 
Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...Performance Management to Program Evaluation: Creating a Complementary Connec...
Performance Management to Program Evaluation: Creating a Complementary Connec...
nicholes21
 
1) Question Add Targets to Balanced score Card
1) Question  Add Targets to Balanced score Card1) Question  Add Targets to Balanced score Card
1) Question Add Targets to Balanced score Card
MartineMccracken314
 
1) Question Add Targets to Balanced score Card
1) Question  Add Targets to Balanced score Card1) Question  Add Targets to Balanced score Card
1) Question Add Targets to Balanced score Card
AbbyWhyte974
 
1) question add targets to balanced score card
1) question  add targets to balanced score card1) question  add targets to balanced score card
1) question add targets to balanced score card
smile790243
 
Building successful and secure products with AI and ML
Building successful and secure products with AI and MLBuilding successful and secure products with AI and ML
Building successful and secure products with AI and ML
Simon Lia-Jonassen
 
Research on the Application of Deep Learning Algorithms in Image Classificati...
Research on the Application of Deep Learning Algorithms in Image Classificati...Research on the Application of Deep Learning Algorithms in Image Classificati...
Research on the Application of Deep Learning Algorithms in Image Classificati...
ContentPedia
 
Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...
Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...
Unlocking DataDriven Talent Intelligence Transforming TALENTX with Industry P...
Prasanna Hegde
 
Testing of Object-Oriented Software
Testing of Object-Oriented SoftwareTesting of Object-Oriented Software
Testing of Object-Oriented Software
Praveen Penumathsa
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
Avinash Patil
 
How Machine learning Integration supports testing automation in software
How Machine learning Integration supports testing automation in softwareHow Machine learning Integration supports testing automation in software
How Machine learning Integration supports testing automation in software
hamzaaftab25
 
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
Day1 1620-1705-maple-pranabendubhattacharyya-131008043643-phpapp02
PMI_IREP_TP
 
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
Day 1   1620 - 1705 - maple - pranabendu bhattacharyyaDay 1   1620 - 1705 - maple - pranabendu bhattacharyya
Day 1 1620 - 1705 - maple - pranabendu bhattacharyya
PMI2011
 
career guidance using ml and python for college students projects
career guidance using ml and python for college students projectscareer guidance using ml and python for college students projects
career guidance using ml and python for college students projects
Hamed Raza
 
Chapter 15 software product metrics
Chapter 15 software product metricsChapter 15 software product metrics
Chapter 15 software product metrics
SHREEHARI WADAWADAGI
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
Kamal Acharya
 
Unit 8 software quality and matrices
Unit 8 software quality and matricesUnit 8 software quality and matrices
Unit 8 software quality and matrices
Preeti Mishra
 

Recently uploaded (20)

Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
Time series for yotube_1_data anlysis.pdf
Time series for yotube_1_data anlysis.pdfTime series for yotube_1_data anlysis.pdf
Time series for yotube_1_data anlysis.pdf
asmaamahmoudsaeed
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
CS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docxCS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docx
nidarizvitit
 
AI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptxAI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptx
AyeshaJalil6
 
How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?
Process mining Evangelist
 
Dynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics DynamicsDynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics Dynamics
heyoubro69
 
report (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhsreport (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhs
AngelPinedaTaguinod
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
bastakwyry
 
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
muhammed84essa
 
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
disnakertransjabarda
 
AWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptxAWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptx
bharatkumarbhojwani
 
Fundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithmsFundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithms
priyaiyerkbcsc
 
Automation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success storyAutomation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success story
Process mining Evangelist
 
Feature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record SystemsFeature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record Systems
Process mining Evangelist
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Language Learning App Data Research by Globibo [2025]
Language Learning App Data Research by Globibo [2025]Language Learning App Data Research by Globibo [2025]
Language Learning App Data Research by Globibo [2025]
globibo
 
Understanding Complex Development Processes
Understanding Complex Development ProcessesUnderstanding Complex Development Processes
Understanding Complex Development Processes
Process mining Evangelist
 
Lagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdfLagos School of Programming Final Project Updated.pdf
Lagos School of Programming Final Project Updated.pdf
benuju2016
 
Time series for yotube_1_data anlysis.pdf
Time series for yotube_1_data anlysis.pdfTime series for yotube_1_data anlysis.pdf
Time series for yotube_1_data anlysis.pdf
asmaamahmoudsaeed
 
Controlling Financial Processes at a Municipality
Controlling Financial Processes at a MunicipalityControlling Financial Processes at a Municipality
Controlling Financial Processes at a Municipality
Process mining Evangelist
 
CS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docxCS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docx
nidarizvitit
 
AI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptxAI ------------------------------ W1L2.pptx
AI ------------------------------ W1L2.pptx
AyeshaJalil6
 
How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?How to Set Up Process Mining in a Decentralized Organization?
How to Set Up Process Mining in a Decentralized Organization?
Process mining Evangelist
 
Dynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics DynamicsDynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics Dynamics
heyoubro69
 
report (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhsreport (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhs
AngelPinedaTaguinod
 
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
文凭证书美国SDSU文凭圣地亚哥州立大学学生证学历认证查询
Taqyea
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
2-Raction quotient_١٠٠١٤٦.ppt of physical chemisstry
bastakwyry
 
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
CERTIFIED BUSINESS ANALYSIS PROFESSIONAL™
muhammed84essa
 
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
disnakertransjabarda
 
AWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptxAWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptx
bharatkumarbhojwani
 
Fundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithmsFundamentals of Data Analysis, its types, tools, algorithms
Fundamentals of Data Analysis, its types, tools, algorithms
priyaiyerkbcsc
 
Automation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success storyAutomation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success story
Process mining Evangelist
 
Feature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record SystemsFeature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record Systems
Process mining Evangelist
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Language Learning App Data Research by Globibo [2025]
Language Learning App Data Research by Globibo [2025]Language Learning App Data Research by Globibo [2025]
Language Learning App Data Research by Globibo [2025]
globibo
 
Ad

AI-Enhanced RAG System for Automated University Course Content Generation

  • 1. AI-Enhanced RAG System for Automated University Course Content Generation
  • 2. Contents 2 Project Overview and Scope Business Problem Business Understanding CRISP-ML(Q) Methodology Technical Stacks Project Architecture Model Building Prompt Templates (Techniques) Model Deployment - Strategy Deployment Challenges Future Scopes Prompt Templates Response Estimation Model Response Accuracy
  • 3. Project Overview and Scope 3 Overview Project Purpose: Create an AI system to automatically generate and improve university course content using domain-specific data. 3 Major Phases: • Data Collect and clean domain-specific PDFs and data. • Prompt Testing Analyse how different prompts affect the model. • Response Generation Create and improve responses using the model. Project Scope: Develop an AI system to generate and evaluate university course content based on domain-specific data.
  • 4. Business Problem 4 • The university requires a wide range of on-demand content. • Manual content creation is time-consuming and resource-intensive.
  • 5. Business Understanding 5 • Business Objective: Improve content generation efficiency and quality while minimizing the time required. • Business Constraint: Ensure content originality to avoid plagiarism. Success Criteria: • Business Success Criteria: Reduce content generation time by up to 80%. • Machine Learning Success Criteria: Keep question duplication rate below 5%. • Economic Success Criteria: Increase revenue from online courses by 15% in the first semester.
  • 6. Business Understanding 5 AI-Driven Course Content Generation • A Retrieval-Augmented Generation (RAG) system leveraging advanced AI models. • Key models: Gemini-1.5-Pro , Gemini-1.5-Flash . • Over various prompt techniques used for optimized content accuracy.
  • 7. CRISP-DM(Q) Methodology This project involve 6 phases of CRISP-ML(Q) methodology: Business and Data Understanding: • The goal is to develop a Retrieval-Augmented Generation (RAG) system using advanced AI latest models. • The system integrates over various prompt techniques to enhance content accuracy and relevance. Data Preparation: • Gather domain-specific academic data. • Clean and label the data for AI training. • Ensure datasets are diverse and representative, covering all course topics. • Provide an option for students to upload their course PDFs and retrieve relevant information from them. Model Building and Tuning: • Utilize the latest Gemini API models (Gemini-1.5 Pro and Flash) for content generation. • Fine-tune model parameters by optimizing various prompt techniques to improve system performance. Evaluation: • Segment course content based on topic and difficulty. • Apply instructional design principles to guide AI model training. • Incorporate strategies like spaced repetition and interactive quizzes for effective learning outcomes.
  • 8. CRISP-ML(Q) Methodology Model Deployment: • Deploy the model using a user-friendly interface, like Streamlit, for seamless interaction. • Establish a robust feedback loop that continuously collects user inputs for model refinement. • Implement automated retraining processes that leverage real-time user interaction data. • Ensure the system is adaptable to curriculum updates and evolving educational needs, maintaining content quality. • Implement automated retraining using user interaction data. Monitoring and Maintenance: • Continuously monitor system performance and user engagement metrics to identify areas for improvement. • Regularly update the model to incorporate new content, optimizing the relevance of generated material. • Maintain alignment with institutional goals by integrating feedback from both instructors and students.
  • 9. Technical Stacks Programming Languages: • Python Api Models: • Gemin-1.5-pro • Gemini-1.5-flash Optimization: • Prompt Techniques Database (for Data Storage and Retrieval): • Vector DB • Local directory Notebooks and Development Environment: • Google Collab • Visual Studio Code Deployment • Streamlit for user interface and interaction
  • 12. Model Building Lang Chain Framework 1. Select Models: • 2 types of model being utilized in this project which is the Gemin-1.5-pro / Gemini-1.5-flash Models and embedding-004 Model. 2. Retrieval & Augmented with Models: • Gather domain-specific academic data. • Generate responses from domain-specific PDFs. • Estimate response quality. 3. Integration: • Utilize the Lang-Chain framework for the RAG system. • Access prompt techniques via prompt_techniques.py file using Gemini API models.
  • 13. Model Building 3. Generate Response: • The models generate responses based on prompt instructions outlined in the prompt techniques.py' file. • For each query, the generated content is aligned with predefined guidelines and optimized for accuracy using the fine-tuned prompt techniques. • For each generated output, the relevance and precision of the content are assessed against a validation set to ensure high-quality results.
  • 16. Model Response Accuracy We evaluated the effectiveness of different prompt techniques by observing whether they followed the given instructions correctly. To compare these techniques, we performed an estimation based on their performance. To estimate the effectiveness of different prompt techniques, you can consider the following methods: • Accuracy of Instruction Following: Check how closely each response adheres to the specific instructions given in the prompt. • Consistency: Compare how consistently each technique generates relevant and accurate responses across different prompts. • Quality of Responses: Assess the quality of the answers based on criteria such as relevance, coherence, and informativeness. • User Feedback: Gather feedback from users or evaluators on the usefulness and clarity of responses generated by each technique. • Quantitative Metrics: If applicable, using metrics like response length, relevancy scores, or error rates to objectively measure the performance of each technique.
  • 17. Model Deployment - Strategy 1. Test on New Data: • Evaluate prompt techniques with domain-specific documents and PDFs. • Validate performance with new datasets. 2. Continuous Monitoring: • Monitor effectiveness with incoming data. • Refine techniques for accuracy and relevance. 3. Feedback & Iteration: • Collect user feedback. • Iterate and improve prompt strategies. 4. Streamlit Deployment: • Deploy prompts in a Streamlit interface for real-time testing. • Allow interactive user input and feedback. 5. Documentation: • Provide guidelines for interpreting results and troubleshooting. • Document the prompt responses for each technique and estimated the Quality of Responses. • Document Model farmwork's and deployment and usage in Streamlit.
  • 19. Challenges • Predictive Difficulty: Challenges in accurately predicting responses when models are focused solely on specific data types or prompts. • Privacy and Data Restrictions: Handling sensitive data while complying with privacy regulations can limit access to relevant information and impact model performance. • Integration Complexities: Integrating prompt techniques and models with existing systems can be complex and time-consuming. • Limited Data Availability: Insufficient historical data or specific domain information can restrict the effectiveness and accuracy of prompt- based models. • External Factors: Unexpected changes or external influences (e.g., shifts in domain-specific data, new research developments) could affect model accuracy and prompt responses.
  • 20. Future Scopes Enhanced Prompt Techniques: • Continuously improve prompt techniques by integrating advanced methods. • Incorporate factors like prompt diversity, context relevance, and user interaction. Model Optimization: • Refine model performance by incorporating feedback and adjusting parameters. • Consider additional data sources and domain-specific contexts for enhanced accuracy. System Integration: • Collaborate with developers to optimize the integration of prompt techniques within the Lang Chain framework. • Streamline processes for real-time updates and efficient prompt handling. User Experience Enhancement: • Utilize feedback to predict and address user needs. • Optimize interaction strategies and prompt effectiveness based on user insights.
  翻译: