this post was submitted on 19 Oct 2024
101 points (88.0% liked)

Asklemmy

43746 readers
1462 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy ๐Ÿ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_[email protected]~

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 8 points 2 weeks ago (2 children)

(โ€ฏอœโ‚’ ใ……โ€ฏอœ โ‚’)แƒš(ยดฺก`แƒš)

I think that comes pretty close. Seeing as LLMs seem to avoid the topic of sex and female presenting nipples, I doubt they'd be able to recognise this picture, and thus, it might be a decent way to poison their training set. Sex talk and cursing should also drive a scraper away quickly, but... horny emoji art? That might just get through and poison the training set.

At least if I understood the question correctly, and the goal is to scew with an ML trying to scrape and learn.

[โ€“] [email protected] 3 points 2 weeks ago* (last edited 2 weeks ago) (1 children)

It would probably get stripped out automatically

[โ€“] [email protected] 2 points 2 weeks ago

Possibly. But if you - say - use a programming language that allows unicode identifiers, you can encode such emojis into the code, and if the model strips them out, they'll get absolute garbage to train on.