Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'

Stopthatgirl7@lemmy.world · 1 year ago

Someone Made a Dataset of One Million Bluesky Posts for 'Machine Learning Research'

foremanguy@lemmy.ml · 1 year ago

If you post something publicly, that thing will be used to train AI. Nevertheless the privacy speaks of the company.

Brumefey@sh.itjust.works · 1 year ago

I don’t know why social media are used for training. It’s like the worst quality of data ever and it results to answers like « go kill youself » when prompted about something sad…

foremanguy@lemmy.ml · 1 year ago

They are used because they are “real life” (not really but you know) conversation example

Aeri@lemmy.world · 1 year ago

Be super fucking foul and un advertiser friendly to make it less useful, OUTLAW COUNTRY

ApatheticCactus@lemmy.world · 1 year ago

Yes, it absolutely will. That’s why I fragrance the pandas. Just a little here and there so that some Howard will need to sort through it. The lime really comes through clearly.