Spaces:

KaiquanMah
/

agents-customer-service-chatbot

Sleeping

App Files Files Community

KaiquanMah commited on Jun 7

Commit

ca01ef5

verified ·

1 Parent(s): 021d8cd

Update README.md

Browse files

Files changed (1) hide show

README.md +22 -2

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ requirements.txt
 # Design Considerations and Constraints
 * A Streamlit app
 * A free LLM API which allows us to test without running into concerns with running out of credits - gemini-2.5-flash-preview-04-17-thinking
-* A way to send relevant information to the LLM
   * We tried sending the full dataset in a dataframe containing approx 4.1m tokens. This failed because Gemini only accepts a maximum of 1m tokens as input
   * We tried splitting the full dataset into 5 equal chunks (worked), uploaded  each chunk for the chat interface (worked), but unable to use all 5 chunks (ie the full dataset) in the chat (failed)
   * Since we were unable to send the huge dataset to Gemini, we explored sending a summary of the dataset
@@ -69,4 +69,24 @@ requirements.txt
 # Architecture
 * With the above design considerations and constraints, we created the chatbot with the following architecture

 # Design Considerations and Constraints
 * A Streamlit app
 * A free LLM API which allows us to test without running into concerns with running out of credits - gemini-2.5-flash-preview-04-17-thinking
+* A way to send relevant information/grounding context to the LLM
   * We tried sending the full dataset in a dataframe containing approx 4.1m tokens. This failed because Gemini only accepts a maximum of 1m tokens as input
   * We tried splitting the full dataset into 5 equal chunks (worked), uploaded  each chunk for the chat interface (worked), but unable to use all 5 chunks (ie the full dataset) in the chat (failed)
   * Since we were unable to send the huge dataset to Gemini, we explored sending a summary of the dataset
 # Architecture
 * With the above design considerations and constraints, we created the chatbot with the following architecture
+![Architecture_Diagram_Yair_Kai](documentation/Architecture_Diagram_Yair_Kai.png)
+* Preparation steps
+  * For dataset and model
+    * Download dataset
+    * Exploratory data analysis
+    * Prepare grounding information
+    * Prepare system prompt
+    * Gemini API chat configuration
+    * Tool call - GoogleSearch
+  * For Streamlit app
+    * Configure chat interface
+* During normal usage of the chatbot
+  * User loads the chat interface and picks a question from the 'Example questions' section
+  * User types their question in the 'Ask a question' box and submits their question to Gemini (our LLM)
+    * The system prompt (which contains the dataframe summary, i.e. grounding information) also gets sent to Gemini
+  * Gemini can
+    * Analyse the user's question and respond directly, sending the response back to our Streamlit app, which processes the response and shows on the chat box
+    * Alternatively assess whether a tool call is required
+      * If Gemini decides that a tool call is required to answer an 'out of domain' question', Gemini make 1 or more calls to the GoogleSearch tool
+      * Then Gemini will formulate a response, sends that response to our Streamlit app, to process and show on the chat box
+        * We also extract the search links and show below the LLM's answer, for users to verify the LLM's response if they wish