Intelligent Caching

Tags:

Assistant

Ditana Assistant implements a sophisticated request caching system for both Wolfram|Alpha and LLM queries. This feature optimizes API usage and improves response times, especially for repeated or similar queries.

Key aspects of the caching system:

Caches all outgoing requests to APIs.
Implements an intelligent lifetime management for cache entries.
Adapts cache lifetime based on the stability of responses.
Initial cache lifetime depends on API: 1 week for LLMs, ≈ 11 minutes for Wolfram|Alpha.
Particularly beneficial for internal queries made by Ditana Assistant, which often lack dialog history.
Maximum cache size can be configured.

This caching mechanism significantly reduces API calls, especially for short, repetitive queries, leading to improved performance and reduced API usage.

Feedback

Was this page helpful?

Glad to hear it! Please tell us how we can improve.

Sorry to hear that. Please tell us how we can improve.

Last modified October 27, 2024: docs: enhance documentation and performance for Ditana Assistant (080e8b4)