Kalo Chin
|
c8dcde6cd0
|
fix: Gemini 2.0 Flash 001 model yaml file naming (#13372)
|
2025-02-08 09:12:42 +08:00 |
Riddhimaan-Senapati
|
8f9db61688
|
feat: added new silicon flow models (#13369)
|
2025-02-08 09:12:22 +08:00 |
Steven sun
|
38c31e64db
|
add enable_search parameter to qwen_max, plus, turbo (#13335)
Co-authored-by: steven <sunzwj@digitalchina.com>
|
2025-02-07 22:16:26 +08:00 |
-LAN-
|
413dfd5628
|
feat: add completion mode and context size options for LLM configuration (#13325)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-07 15:08:53 +08:00 |
-LAN-
|
f9515901cc
|
fix: Azure AI Foundry model cannot be used in the workflow (#13323)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-07 14:52:57 +08:00 |
呆萌闷油瓶
|
3f42fabff8
|
chore:improve thinking display for llm from xinference and ollama pro… (#13318)
|
2025-02-07 14:29:29 +08:00 |
-LAN-
|
1caa578771
|
chore(*): Update style of thinking (#13319)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-07 14:06:35 +08:00 |
非法操作
|
3eb3db0663
|
chore: refactor the OpenAICompatible and improve thinking display (#13299)
|
2025-02-07 13:28:46 +08:00 |
sino
|
6e5c915f96
|
feat(model): add deepseek-r1 for openrouter (#13312)
|
2025-02-07 12:39:13 +08:00 |
Riddhimaan-Senapati
|
2348abe4bf
|
feat: added a couple of models not defined in vertex ai, that were already … (#13296)
|
2025-02-07 09:11:25 +08:00 |
呆萌闷油瓶
|
f7e7a399d9
|
feat:add think tag display for xinference deepseek r1 (#13291)
|
2025-02-06 22:04:58 +08:00 |
zhu-an
|
16865d43a8
|
feat: add deepseek models for volcengine provider (#13283)
Co-authored-by: zhaoqingyu.1075 <zhaoqingyu.1075@bytedance.com>
|
2025-02-06 18:20:03 +08:00 |
呆萌闷油瓶
|
0d13aee15c
|
feat:add deepseek r1 think display for ollama provider (#13272)
|
2025-02-06 15:32:10 +08:00 |
engchina
|
40dd63ecef
|
Upgrade oracle models (#13174)
Co-authored-by: engchina <atjapan2015@gmail.com>
|
2025-02-06 13:24:27 +08:00 |
-LAN-
|
6d66d6da15
|
feat(model_providers): Support deepseek-r1 for Nvidia Catalog (#13269)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-06 13:03:19 +08:00 |
-LAN-
|
87763fc234
|
feat(model_providers): Support deepseek for Azure AI Foundry (#13267)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-06 12:45:48 +08:00 |
JasonVV
|
f6c44cae2e
|
feat(model): add gemini-2.0 model (#13266)
|
2025-02-06 12:28:59 +08:00 |
xhe
|
da2ee04fce
|
fix: correct linewrap think display in generic openai api (#13260)
Signed-off-by: xhe <xw897002528@gmail.com>
|
2025-02-06 10:53:08 +08:00 |
JasonVV
|
7673c36af3
|
feat(model): add gemini-2.0-flash-thinking-exp-01-21 (#13230)
|
2025-02-06 10:01:00 +08:00 |
Riddhimaan-Senapati
|
9457b2af2f
|
feat: added models :gemini 2.0 flash 001 and gemini 2.0 pro exp 02-05 (#13247)
|
2025-02-06 09:58:39 +08:00 |
k-zaku
|
7203991032
|
feat: add parameter "reasoning_effort" and Openai o3-mini (#13243)
|
2025-02-06 09:29:48 +08:00 |
xhe
|
5a685f7156
|
feat: add think display for volcengine and generic openapi (#13234)
Signed-off-by: xhe <xw897002528@gmail.com>
|
2025-02-06 09:24:40 +08:00 |
Riddhimaan-Senapati
|
a6a25030ad
|
fix: updated _position.yaml to include the latest model already integ… (#13245)
|
2025-02-06 09:21:51 +08:00 |
Riddhimaan-Senapati
|
00458a31d5
|
feat: added deepseek r1 and v3 to siliconflow (#13238)
|
2025-02-05 21:59:18 +08:00 |
-LAN-
|
c6ddf6d6cc
|
feat(model_providers): Add Groq DeepSeek-R1-Distill-Llama-70b (#13229)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-05 19:15:29 +08:00 |
Joshbly
|
34b21b3065
|
feat: Add o3-mini and o3-mini-2025-01-31 model variants (#13129)
Co-authored-by: crazywoola <427733928@qq.com>
|
2025-02-05 17:04:45 +08:00 |
-LAN-
|
59ca44f493
|
chore(model_runtime): Move deepseek ahead in the providers list. (#13197)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-02-05 16:08:28 +08:00 |
MaFee921
|
1a2523fd15
|
feat: bedrock_endpoint_url (#12838)
|
2025-02-05 12:24:24 +08:00 |
Kei YAMAZAKI
|
7452032d81
|
add azure openai api version 2024-12-01-preview (#13135)
|
2025-02-03 11:04:20 +08:00 |
非法操作
|
840729afa5
|
feat: the think tag display of siliconflow's deepseek r1 (#13153)
|
2025-02-02 21:55:13 +08:00 |
Yingchun Lai
|
b09c39c8dc
|
refactor: avoid to use extra space when finding model by name (#13043)
|
2025-01-30 15:08:29 +08:00 |
heyszt
|
b4b09ddc3c
|
add tongyi qwen2.5-14b/7b-instruct-1m model (#13089)
|
2025-01-29 11:58:01 +08:00 |
Yingchun Lai
|
d44882c1b5
|
refactor: reduce duplciate code by inheritance (#13073)
|
2025-01-28 10:52:01 +08:00 |
Jason
|
560c5de1b7
|
Fixed Novita AI color and added DeepSeek R1 model (#13074)
|
2025-01-28 10:38:54 +08:00 |
heyszt
|
6c31ee36cd
|
fix qwen-vl blocking mode (#13052)
|
2025-01-27 11:35:23 +08:00 |
Jason
|
d4be5ef9de
|
Update Novita AI predefined models (#13045)
|
2025-01-26 09:25:29 +08:00 |
非法操作
|
59b3e672aa
|
feat: add agent thinking content display of deepseek R1 (#12949)
|
2025-01-24 20:13:42 +08:00 |
IWAI, Masaharu
|
a2f8bce8f5
|
chore: add Japanese translation: model_providers/bedrock (#13016)
|
2025-01-24 18:43:33 +08:00 |
IWAI, Masaharu
|
28067640b5
|
fix: wrong zh_Hans translation: Ohio (#13006)
|
2025-01-24 13:41:20 +08:00 |
lowell
|
da67916843
|
feat: add glm-4-air-0111 (#12997)
Co-authored-by: lowell <lowell.hu@zkteco.in>
|
2025-01-24 10:04:46 +08:00 |
sino
|
d167d5b1be
|
feat(ark): support doubao 1.5 series of models (#12935)
|
2025-01-22 15:25:57 +08:00 |
jiandanfeng
|
e23f4b0265
|
feat: add gemini-2.0-flash-thinking-exp-01-21 (#12924)
|
2025-01-22 10:14:37 +08:00 |
luckylhb90
|
3d1ce4c53f
|
bug: fixed bedrock rerank bug (#12774)
Co-authored-by: hobo.l <hobo.l@binance.com>
|
2025-01-21 19:09:36 +08:00 |
k-zaku
|
46e95e8309
|
fix: OpenAI o1 Bad Request Error (#12839)
|
2025-01-21 15:29:13 +08:00 |
JasonVV
|
a7b9375877
|
Update deepseek model configuration (#12899)
|
2025-01-21 15:28:11 +08:00 |
JasonVV
|
9903f1e703
|
add deepseek-reasoner (#12898)
|
2025-01-21 12:40:58 +08:00 |
Bowen Liang
|
166221d784
|
chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x (#12702)
|
2025-01-21 10:12:29 +08:00 |
Ding Jiatong
|
925d69a2ee
|
feat:Support Minimax-Text-01 (#12763)
|
2025-01-21 10:08:53 +08:00 |
jiandanfeng
|
9d86147d20
|
fix: SparkLite API Auth error (#12781) (#12790)
|
2025-01-20 22:21:21 +08:00 |
jiandanfeng
|
6ea77ab4cd
|
fix: DeepSeek API Error with response format active (text and json_object) (#12747)
|
2025-01-20 22:04:18 +08:00 |
yihong
|
4e101604c3
|
fix: ruff check for True if ... else (#12576)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2025-01-13 09:38:48 +08:00 |
Gen Sato
|
dbe7a7c4fd
|
Fix: Add a INFO-level log when fallback to gpt2tokenizer (#12508)
|
2025-01-09 14:37:46 +08:00 |
-LAN-
|
0a49d3dd52
|
fix: tiktoken cannot be loaded without internet (#12478)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-01-08 14:49:44 +08:00 |
crazywoola
|
6222179a57
|
Revert "fix:deepseek tool call not working correctly" (#12463)
|
2025-01-08 10:50:34 +08:00 |
Infinitnet
|
4e6c86341d
|
Add 'document' feature to Sonnet 3.5 through OpenRouter (#12444)
|
2025-01-07 19:51:38 +08:00 |
呆萌闷油瓶
|
9677144015
|
fix:deepseek tool call not working correctly (#12437)
|
2025-01-07 17:25:38 +08:00 |
SiliconFlow, Inc
|
15797c556f
|
add fish-speech-1.5 from siliconflow (#12425)
|
2025-01-07 15:27:34 +08:00 |
-LAN-
|
d3f5b1cbb6
|
refactor: use tiktoken for token calculation (#12416)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-01-07 13:32:30 +08:00 |
SiliconFlow, Inc
|
dc650c5368
|
Fixes #12414: Add cheaper model and long context model for Qwen2.5-72B-Instruct from siliconflow (#12415)
|
2025-01-07 11:28:24 +08:00 |
Alex Chen
|
2bb521b135
|
Support TTS and Speech2Text for Model Provider GPUStack (#12381)
|
2025-01-07 09:42:11 +08:00 |
SiliconFlow, Inc
|
409cc7d9b0
|
mark deprecated models in siliconflow #12399 (#12405)
Co-authored-by: crazywoola <427733928@qq.com>
|
2025-01-07 09:08:58 +08:00 |
Warren Chen
|
147d578922
|
[Fix] revert sagemaker llm to support model hub (#12378)
|
2025-01-06 18:01:45 +08:00 |
方程
|
6df17a334c
|
fix: Update the API call address for the text_embedding model (#12342)
Co-authored-by: 方程 <fangcheng@oschina.cn>
|
2025-01-03 19:19:17 +08:00 |
jifei
|
3c2e30f348
|
fix: #12143 support streaming mode content start with "data:" (#12171)
|
2025-01-03 16:33:37 +08:00 |
丹枫染秋色
|
7c1961e618
|
feat: Add response format support to GLM-4 (#12252)
|
2025-01-03 09:38:50 +08:00 |
xander-art
|
baeddd4d15
|
feat:Add support for stop parameter in hunyuan model #12313 (#12315)
Co-authored-by: xander-art <xander-art@gmail.com>
|
2025-01-03 09:15:04 +08:00 |
-LAN-
|
6f5a8a33d9
|
refactor: replace gevent threadpool with ProcessPoolExecutor in GPT2Tokenizer (#12316)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2025-01-03 09:13:18 +08:00 |
Giovanny Gutiérrez
|
d7c0bc8c23
|
feat: Add response format support for openai compat models (#12240)
Co-authored-by: Gio Gutierrez <giovannygutierrez@gmail.com>
|
2025-01-02 09:59:34 +08:00 |
yihong
|
f30bf08580
|
fix: close #12215 for yi special case (#12222)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2025-01-02 09:58:34 +08:00 |
Warren Chen
|
9954ddb780
|
[Fix] modify sagemaker llm (#12274)
|
2025-01-02 09:49:11 +08:00 |
-LAN-
|
6a85960605
|
feat: implement asynchronous token counting in GPT2Tokenizer (#12239)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-31 17:02:08 +08:00 |
Kepler
|
2a909e634b
|
feat: support Ernie-lite-pro-128k (#12161)
Co-authored-by: bigfish49 <bigfish49@126.com>
|
2024-12-27 20:23:46 +08:00 |
jiangbo721
|
c98d91e44d
|
fix: o1 model error, use max_completion_tokens instead of max_tokens. (#12037)
Co-authored-by: 刘江波 <jiangbo721@163.com>
|
2024-12-25 13:29:43 +08:00 |
yihong
|
56e15d09a9
|
feat: mypy for all type check (#10921)
|
2024-12-24 18:38:51 +08:00 |
yihong
|
6a0ff3686c
|
fix: fix typo (#12034)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-24 15:23:27 +08:00 |
-LAN-
|
af2888d394
|
fix: remove json_schema if response format is disabled. (#12014)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-23 17:53:57 +08:00 |
-LAN-
|
10caab1729
|
fix: change CredentialsValidateFailedError to inherit from ValueError (#11950)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-22 10:43:31 +08:00 |
非法操作
|
366857cd26
|
fix: gemini system prompt with variable raise error (#11946)
|
2024-12-21 23:14:05 +08:00 |
-LAN-
|
455791b710
|
fix(model_runtime): make invoke as ValueError (#11929)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-21 21:22:14 +08:00 |
Kalo Chin
|
2681bafb76
|
fix: handle document fetching from URL in Anthropic LLM model, solving base64 decoding error (#11858)
|
2024-12-20 18:23:42 +08:00 |
yihong
|
7b03a0316d
|
fix: better memory usage from 800+ to 500+ (#11796)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-20 14:51:43 +08:00 |
-LAN-
|
996a9135f6
|
feat(llm_node): support order in text and files (#11837)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-20 14:12:50 +08:00 |
Dr.MerdanBay
|
bb2f46d7cc
|
fix: add safe dictionary access for bedrock credentials (#11860)
|
2024-12-20 12:13:39 +09:00 |
yihong
|
463fbe2680
|
fix: better gard nan value from numpy for issue #11827 (#11864)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-20 09:28:32 +08:00 |
非法操作
|
9d93ad1f16
|
feat: add gemini-2.0-flash-thinking-exp-1219 (#11863)
|
2024-12-20 09:26:31 +08:00 |
yihong
|
12d45e9114
|
fix: silicon change its model fix #11844 (#11847)
Signed-off-by: yihong0618 <zouzou0208@gmail.com>
|
2024-12-19 20:50:09 +08:00 |
barabicu
|
d057067543
|
fix: remove ruff ignore SIM300 (#11810)
|
2024-12-19 18:30:51 +08:00 |
sino
|
560d375e0f
|
feat(ark): add doubao-pro-256k and doubao-embedding-large (#11831)
|
2024-12-19 17:49:31 +08:00 |
Agung Besti
|
3388d6636c
|
add-model-azure-gpt-4o-2024-11-20 (#11803)
Co-authored-by: agungbesti <agung.besti@insignia.co.id>
|
2024-12-19 12:36:11 +08:00 |
xander-art
|
56434db4f5
|
feat:add hunyuan model(hunyuan-role, hunyuan-large, hunyuan-large-rol… (#11766)
Co-authored-by: xanderdong <xanderdong@tencent.com>
|
2024-12-18 15:25:53 +08:00 |
-LAN-
|
a5db7c9acb
|
feat: add openai o1 & update pricing and max_token of other models (#11780)
Signed-off-by: -LAN- <laipz8200@outlook.com>
|
2024-12-18 12:15:11 +08:00 |
非法操作
|
9048832a9a
|
chore: improve gemini models (#11745)
|
2024-12-17 17:42:21 +08:00 |
Shota Totsuka
|
7d5a385811
|
feat: use Gemini response metadata for token counting (#11743)
|
2024-12-17 17:42:05 +08:00 |
sino
|
99430a5931
|
feat(ark): support doubao vision series models (#11740)
|
2024-12-17 15:43:11 +08:00 |
非法操作
|
c9b4029ce7
|
chore: the consistency of MultiModalPromptMessageContent (#11721)
|
2024-12-17 15:01:38 +08:00 |
呆萌闷油瓶
|
cd4310df25
|
chore:update azure api version (#11711)
|
2024-12-17 13:39:56 +08:00 |
非法操作
|
74fdc16bd1
|
feat: enhance gemini models (#11497)
|
2024-12-17 12:05:13 +08:00 |
方程
|
fc8fdbacb4
|
feat: add gitee ai vl models (#11697)
Co-authored-by: 方程 <fangcheng@oschina.cn>
|
2024-12-16 18:45:26 +08:00 |
zhongliliu-butterfly
|
daccb10d8c
|
fix: volcengine_maas and baichuan message error (#11625)
Co-authored-by: zhongliliu <liuzlx@digitalchina.com>
|
2024-12-16 13:05:27 +08:00 |
zhaobingshuang
|
79801f5c30
|
fix: deepseek reports an error when using Response Format #11677 (#11678)
Co-authored-by: zhaobs <zhaobs@cailian.net>
|
2024-12-16 12:58:03 +08:00 |