Skip to content

Commit b29a026

Browse files
Simon-Li继优chus-chusxiezhq-hermann
authored
KV‑Cache (MHA, MLA): add missing start_layer / end_layer fields to MHATokenToKVPoolHost and MLATokenToKVPoolHost (#6016)
Co-authored-by: 继优 <[email protected]> Co-authored-by: chus-chus <[email protected]> Co-authored-by: Zhiqiang Xie <[email protected]>
1 parent 678d8cc commit b29a026

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

python/sglang/srt/mem_cache/memory_pool.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -762,6 +762,8 @@ def __init__(
762762
self.size = int(device_pool.size * host_to_device_ratio)
763763
# Align the host memory pool size to the page size
764764
self.size = self.size - (self.size % self.page_size)
765+
self.start_layer = device_pool.start_layer
766+
self.end_layer = device_pool.end_layer
765767

766768
assert (
767769
self.size > device_pool.size

0 commit comments

Comments
 (0)