🐛 [Bug] TRTorch runtime isn't threadsafe,which maybe segment-fault or hang in online-serving multi-thread environment

##  Bug Description
We optimized one model based on TRTorch v0.3.0 successfully, but failed to deploy in online-serving, which maybe core-dump or hang at MemoryD2H.   By locking torch.jit inference，we could work-around this issue currently， and we guess trtorch runtime is not thread-safe.

I collected other issue or commits related this problem:
- trtorch community: https://github.com/NVIDIA/TRTorch/issues/181
- TFTRT had fixed this problem by this patch: https://github.com/tensorflow/tensorflow/commit/e51fa30400b3a469b03111b25344bc47bdff96bd

From these issue or commits, I get nvinfer1::IExecuteContext is not thread-safe.

So how to fix this bug in trtorch? 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🐛 [Bug] TRTorch runtime isn't threadsafe,which maybe segment-fault or hang in online-serving multi-thread environment #618

Bug Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

🐛 [Bug] TRTorch runtime isn't threadsafe,which maybe segment-fault or hang in online-serving multi-thread environment #618

Description

Bug Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions