bazsalanszky

joined 1 year ago
MODERATOR OF
[–] [email protected] 4 points 3 months ago

From what I've seen, it's definitely worth quantizing. I've used llama 3 8B (fp16) and llama 3 70B (q2_XS). The 70B version was way better, even with this quantization and it fits perfectly in 24 GB of VRAM. There's also this comparison showing the quantization option and their benchmark scores:

1000029570

Source

To run this particular model though, you would need about 45GB of RAM just for the q2_K quant according to Ollama. I think I could run this with my GPU and offload the rest of the layers to the CPU, but the performance wouldn't be that great(e.g. less than 1 t/s).

[–] [email protected] 2 points 3 months ago

Yes, you can find it here.

[–] [email protected] 1 points 3 months ago (2 children)

Are you using mistral 7B?

I also really like that model and their fine-tunes. If licensing is a concern, it's definitely a great choice.

Mistral also has a new model, Mistral Nemo. I haven't tried it myself, but I heard it's quite good. It's also licensed under Apache 2.0 as far as I know.

[–] [email protected] 5 points 3 months ago* (last edited 3 months ago) (1 children)

I haven't tested it extensively, but open webui also has RAG functionality (chat with documents).

The UI it self is also kinda cool and it has other useful features like commands (for common prompts) and searching for stuff online (e.g. with searx). It works quite well with Ollama.

[–] [email protected] 4 points 3 months ago

I have resolved this and other issues in the latest nightly build. I have also uploaded the fix to Google Play (it's currently under review). If we do not encounter any further problems, I could release it soon.

[–] [email protected] 7 points 3 months ago

I want to add kbin/mbin support in the future, but for now, Lemmy is my primary focus.

[–] [email protected] 9 points 3 months ago

Thank you! I missed that. I will try to fix that too

[–] [email protected] 16 points 3 months ago

Ahhh, your insurance is updated to Lemmy 0.19.5. The update will fix this issue.

[–] [email protected] 9 points 3 months ago (2 children)

Glad you're sticking with Eternity! Could you let me know what’s broken on your side? I’d love to help you get it sorted.

[–] [email protected] 41 points 3 months ago* (last edited 3 months ago) (3 children)

Just wanted to add here that you can test the release early with the nightly builds or with the Google Play Beta program (it's still under review, but hopefully it will be out there soon).

If you find any issues with it, please let me know so I can fix it before this release.

UPDATE: It should be available on the Google Play Beta now. Unfortunately, I've found some issues with posting comments/posts so I will need to fix that before the release.

[–] [email protected] 18 points 3 months ago

Not yet, but I think I know what the problem is. I can probably fix it before this release.

[–] [email protected] 23 points 4 months ago (3 children)

Currently, there is no page to show you the list of communities (other than your subscriptions/blocks). However, I plan to implement something similar to this one soon.

I also want to add an instance view page that will show the list of communities for that specific instance as well.

view more: ‹ prev next ›