Performance depends on available RAM, GPU acceleration, model size, context size, and what else is using your system resources.