-
Notifications
You must be signed in to change notification settings - Fork 303
Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] Can't Quantize llava-v1.6-34b (AssertionError)
#2079
opened Jul 18, 2024 by
rtadewald
2 tasks done
[Bug] Sending image from gradio interface to 4bit model
#2075
opened Jul 18, 2024 by
zhuraromdev
2 tasks
[Bug] lmdeploy Auto AWQ量化后的4bit模型生成内容全部为"姿势姿势...."
#2073
opened Jul 18, 2024 by
MingyuZha
2 tasks done
[Bug] Internvl2-llama3-76B 在8卡V100上报错不支持 flash attention
awaiting response
v100
v100 related issue
#2067
opened Jul 18, 2024 by
thesby
2 tasks done
[Bug] The problem of very low multi-card inference efficiency
#2066
opened Jul 17, 2024 by
maxin9966
2 tasks
[Bug] 最新版 lmdeploy 0.5.1 在v100以及rtx 2080 ti 上部署cogvlm2,推理过程会报错: ERROR - Engine loop failed with error: map::at
v100
v100 related issue
#2055
opened Jul 17, 2024 by
kklots
2 tasks done
[Bug] Segmentation fault during run of quantized internlm/internlm-xcomposer2-4khd-7b model
#2051
opened Jul 16, 2024 by
zhuraromdev
2 tasks
[Bug] gradio reset button stucked after I cancel a response.
#2043
opened Jul 16, 2024 by
zhulinJulia24
2 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.