I don't know what GPU you've got, but Lexi V2 is the best "small model" I've seen with emotions, that I can just cite from the top of my head.
It tends to skew male and can be a little dark at times, but it's more complex than expected for the size (8B feels like 48-70B).
Do Q8_0 if you've got the VRAM, Q5_KL for speed, IQ4_XS if you've got a potato.
If you are using CPU only, you need to look at very small models or the 2-bit quants.
Everything will be extremely slow otherwise:
GPUs are at least 3 times faster for the same power draw.