Kuwa OS v0.2.0-beta Released

April 15, 2024 · 4 min read

Yung-Hsiang Hu

Contributor

info

This version does not include the TAIDE model itself, and a version pre-loaded with the TAIDE model is expected to be released after the TAIDE model is publicly available.

Hello to our community friends,

After collecting everyone's feedback, we plan to roll out the long-awaited RAG feature in v0.2.0. The RAG part has been internally tested, so we are releasing v0.2.0-beta to invite everyone to test it out and see if it meets your expectations.
In addition, this update also provides a way to connect with TAIDE API and TAIDE models.
At the same time, we have also adjusted the system installation script and fixed some known bugs, making the entire system more stable, easier to extend, and easier to use.
If you have any suggestions or if you think there is room for improvement, please let us know!

The details of this update are as follows:

Windows Portable Edition

Adjust the model hosting method:
- Enable Gemini Pro and ChatGPT APIs by default
- Use Gemini Pro by default to launch RAG applications for WebQA and Document QA
Deprecate env.bat and use run.bat to launch the executor instead
Enhance executor functionality:
- Allow direct configuration of execution instructions, parameters, and other information
- Adjust init.bat to be a simple tool to help create run.bat. Users can also directly write run.bat to launch the required model
Fix the error of a non-existent PHP download link in v0.1.0 (archived due to version update)
Integrate RAG into the simplified launch framework of the Windows version
Specify file path improvements:
- In the executors folder of the Windows version, files will be specified using relative paths by default
Fix the executor error of the Custom category
Permission adjustment:
- Only groups with Manage Tab permission can be directly assigned permission to use the model when the model is added
Fix the issue in the Windows version where Redis uses localhost as the IP, causing DNS queries to be delayed by 2 seconds each time

Docker Edition

Integrate RAG (Document QA / Web QA / DB QA) into the executor's Docker image and compose the system
Provide a compose example of Gemini Pro using a global API key
Complete the missing words in the document

Executor

Provide a TAIDE API executor, which can be directly connected to the TAIDE API of TECO
Port RAG executor (Document QA / Web QA / DB QA / Search QA) to the new framework
Let the RAG executor support automatic model search, i18n, and interrupt generation

Kernel

Provide an API to list currently available executors

Multi-chat

Adjust the timeout waiting time for the no-message state:
- Extend from 1 minute to 2 minutes to accommodate the waiting time for the RAG processing speed gap
Fix the bug of AdminSeeder:
- Fix the issue of granting duplicate model usage permissions
Add the function of sending Kuwa tokens from the website to the executor:
- To fix the past issue of unowned API tokens for RAG
Add a method to adjust the default model image path:
- Can be configured via LLM_DEFAULT_IMG in the .env file
Fix the bug that the API in v0.1.0 could not be used normally

Known Issues and Limitations

At present, the Windows version of Document QA can process files in .doc and .docx formats. However, due to library dependency issues, it may not be able to read certain .pdf files. If such a need arises, please consider utilizing the Linux version of Kuwa for PDF processing.
RAG applications tend to generate long input. If only using the CPU-based version of the on-premises model, timeout errors can occur more easily. In this case, we recommend either using the cloud-based model, or using the GPU-based version of the on-premises model, and then using the RAG application.

Windows Portable Edition​

Docker Edition​

Executor​

Kernel​

Multi-chat​