Skip to content

Conversation

@Bekaboo
Copy link

@Bekaboo Bekaboo commented Apr 27, 2025

Use multiple thread to load weights, cache and tokenizer, should slightly improve the initialization and TTFT time.

img_6
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

1 participant