Inclusion of Other Datasets #7

Open
opened 2024-12-22 02:00:15 +00:00 by james · 2 comments
Owner
  • Images from our chat (already scraped) (unclear how useful it would be, or if it's worth the time)
    • Fembooru pics (already tagged) could be even more useful and quicker to train on
  • Brainrot vocabulary (need to find this)
  • Greentexts maybe
  • Fembooru inside jokes and references
  • Time-consuming but would pay off: include detailed biography of each member of chat in system prompt. May be worth going back to deanonymized role labels
  • Besides ToxicQA, some kind of normal instruction-following dataset like Alpaca may also make her not so stupid
- Images from our chat (already scraped) (unclear how useful it would be, or if it's worth the time) - Fembooru pics (already tagged) could be even more useful and quicker to train on - Brainrot vocabulary (need to find this) - Greentexts maybe - Fembooru inside jokes and references - Time-consuming but would pay off: include detailed biography of each member of chat in system prompt. May be worth going back to deanonymized role labels - Besides ToxicQA, some kind of normal instruction-following dataset like Alpaca may also make her not so stupid
james added the
enhancement
label 2024-12-22 02:00:38 +00:00
Author
Owner

hatsune miku's funny tweets https://youtu.be/i8GiGyP39vY?feature=shared

hatsune miku's funny tweets https://youtu.be/i8GiGyP39vY?feature=shared
Author
Owner

get the schizodev guy on youtube to let me see his cunnyposting dataset lol

get the schizodev guy on youtube to let me see his cunnyposting dataset lol
Sign in to join this conversation.
No Milestone
No project
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: james/MikuAI#7
No description provided.