Skip to content

Issues: microsoft/onnxruntime

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Mismatch in results for TensorRT session and cuda Session ep:CUDA issues related to the CUDA execution provider ep:TensorRT issues related to TensorRT execution provider model:transformer issues related to a transformer model: BERT, GPT2, Hugging Face, Longformer, T5, etc.
#20986 opened Jun 10, 2024 by akmalmasud96
[Build] moduleNotfoundError: no module named 'onnxruntime.training' & 'No matching distribution found for onnxruntime-training' training issues related to ONNX Runtime training; typically submitted using template
#20985 opened Jun 10, 2024 by rajkamal-007
After adding preprocessing steps to the model, it gives an error platform:windows issues related to the Windows platform
#20984 opened Jun 10, 2024 by feff2
[Performance] Severe performance penalty with transformer model and DirectML ep:CUDA issues related to the CUDA execution provider ep:DML issues related to the DirectML execution provider platform:windows issues related to the Windows platform
#20983 opened Jun 10, 2024 by andrea-cimatoribus-pix4d
[Feature Request] Extend quantization tool to support blocked quantization feature request request for unsupported feature or enhancement quantization issues related to quantization
#20981 opened Jun 8, 2024 by DaniAffCH
[Web] Latest version does nonstandard imports platform:web issues related to ONNX Runtime web; typically submitted using template
#20978 opened Jun 8, 2024 by KTibow
How QLinearConv layer absorb the Relu function quantization issues related to quantization
#20975 opened Jun 7, 2024 by zhongpanwu
[Build] Unable to build onnxruntime from source (with oneDNN EP) build build issues; typically submitted using template ep:oneDNN questions/issues related to DNNL EP
#20971 opened Jun 7, 2024 by shreya-um
[Web] LinkError when using custom built WASM artifacts platform:web issues related to ONNX Runtime web; typically submitted using template
#20970 opened Jun 7, 2024 by miguel-lorenzo
Request for Hidden States Access in Phi-3 with ONNX Runtime documentation improvements or additions to documentation; typically submitted using template
#20969 opened Jun 7, 2024 by ajliouat
[JAVA] Ability to construct a Tensor from a GPU memory pointer api:Java issues related to the Java API ep:CUDA issues related to the CUDA execution provider ep:TensorRT issues related to TensorRT execution provider platform:windows issues related to the Windows platform
#20966 opened Jun 7, 2024 by balenamiaa
[Build] Unable to build ONNX Runtime against CUDA 12.5 build build issues; typically submitted using template ep:CUDA issues related to the CUDA execution provider ep:TensorRT issues related to TensorRT execution provider platform:windows issues related to the Windows platform
#20953 opened Jun 6, 2024 by mc-nv
Mac m1 build android.The compiler doesn't support BFLOAT16!!! build build issues; typically submitted using template platform:mobile issues related to ONNX Runtime mobile; typically submitted using template
#20948 opened Jun 6, 2024 by yangy996
[Performance] Is my script set to get optimal performance of onnxruntime? performance issues related to performance regressions
#20945 opened Jun 6, 2024 by JackWeiw
Stateful/Memory models
#20943 opened Jun 5, 2024 by bhack
[Web] Using ceil() in shape computation is not yet supported for MaxPool ep:WebGPU ort-web webgpu provider platform:web issues related to ONNX Runtime web; typically submitted using template
#20938 opened Jun 5, 2024 by marrrcin
[Documentation] How to run this model on android mobile platform documentation improvements or additions to documentation; typically submitted using template platform:mobile issues related to ONNX Runtime mobile; typically submitted using template
#20937 opened Jun 5, 2024 by Vinaysukhesh98
[Web] There seems to be some issues with the comments on this document regarding GetCount function platform:web issues related to ONNX Runtime web; typically submitted using template
#20931 opened Jun 5, 2024 by wxxz975
[Build] build build issues; typically submitted using template ep:CUDA issues related to the CUDA execution provider
#20928 opened Jun 5, 2024 by nikolai3d
[Build] CUDA 12.5 Build ERROR in MOE/cutlass for sm=90 build build issues; typically submitted using template ep:CUDA issues related to the CUDA execution provider ep:TensorRT issues related to TensorRT execution provider platform:windows issues related to the Windows platform
#20924 opened Jun 4, 2024 by tianleiwu
[Build] TensorRT 10 nvinfer1 APIs deprecated build build issues; typically submitted using template ep:CUDA issues related to the CUDA execution provider ep:TensorRT issues related to TensorRT execution provider platform:windows issues related to the Windows platform
#20923 opened Jun 4, 2024 by tianleiwu
Microsoft.ML.OnnxRuntime.Gpu 1.18.0 not working with NVIDIA CUDA 11.6 documentation improvements or additions to documentation; typically submitted using template ep:CUDA issues related to the CUDA execution provider platform:windows issues related to the Windows platform release:1.18.0
#20916 opened Jun 4, 2024 by jacobilsoe
[Feature Request] Move graph compilation behind higher transformers (graph optimization) ep:DML issues related to the DirectML execution provider feature request request for unsupported feature or enhancement platform:mobile issues related to ONNX Runtime mobile; typically submitted using template
#20915 opened Jun 4, 2024 by peishenyan
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.