Skip to main content
Michael's user avatar
Michael's user avatar
Michael's user avatar
Michael
  • Member for 5 months
  • Last seen more than a month ago
awarded
comment
Same Processing Time for Prompts of Different SIze
Thanks for you reply. I understand that the GPU is under utilized. Question is, can I increase the throughput in anyway? Because now it seems like there is a lower bound for processing for Gemma 3 on an A100 GPU and it's quite high.
awarded
Loading…