Commit Graph

1 Commits

Author SHA1 Message Date
Ariel Frischer 686ab42f88
Add Deepseek Provider Support and Custom Deepseek Reasoner Chat Model (#50)
* chore: Add DeepSeek provider environment variable support in env.py

* feat: Add DeepSeek provider validation strategy in provider_strategy.py

* feat: Add support for DEEPSEEK provider in initialize_llm function

* feat: Create ChatDeepseekReasoner for custom handling of R1 models

* feat: Configure custom OpenAI client for DeepSeek API integration

* chore: Remove unused json import from deepseek_chat.py

* refactor: Simplify invocation_params and update acompletion_with_retry method

* feat: Override _generate to ensure message alternation in DeepseekReasoner

* feat: Add support for ChatDeepseekReasoner in LLM initialization

* feat: Use custom ChatDeepseekReasoner for DeepSeek models in OpenRouter

* fix: Remove redundant condition for DeepSeek model initialization

* feat: Add DeepSeek support for expert model initialization in llm.py

* feat: Add DeepSeek model handling for OpenRouter in expert LLM initialization

* fix: Update model name checks for DeepSeek and OpenRouter providers

* refactor: Extract common logic for LLM initialization into reusable methods

* test: Add unit tests for DeepSeek and OpenRouter functionality

* test: Refactor tests to match updated LLM initialization and helpers

* fix: Import missing helper functions to resolve NameError in tests

* fix: Resolve NameError and improve environment variable fallback logic

* feat(readme): add DeepSeek API key requirements to documentation for better clarity on environment variables
feat(main.py): include DeepSeek as a supported provider in argument parsing for enhanced functionality
feat(deepseek_chat.py): implement ChatDeepseekReasoner class for handling DeepSeek reasoning models
feat(llm.py): add DeepSeek client creation logic to support DeepSeek models in the application
feat(models_tokens.py): define token limits for DeepSeek models to manage resource allocation
fix(provider_strategy.py): correct validation logic for DeepSeek environment variables to ensure proper configuration
chore(memory.py): refactor global memory structure for better readability and maintainability in the codebase
2025-01-22 07:21:10 -05:00