Updated readme, minor configuration stuff
This commit is contained in:
parent
6a475d584d
commit
2795fba194
61
README.md
61
README.md
@ -1,3 +1,62 @@
|
|||||||
# miku
|
# miku
|
||||||
|
|
||||||
Discord bot/companion for the group chatte, powered by the GPT-J language model and modified with a soft prompt to understand all of our esoteric, elaborate inside jokes.
|
Discord bot/companion for the group chatte, powered by the GPT-~~J~~ Neo language model and modified with a soft prompt to understand all of our esoteric, elaborate inside jokes.
|
||||||
|
|
||||||
|
## Setup
|
||||||
|
|
||||||
|
Python 3.8+ and PyTorch required. CUDA strongly recommended.
|
||||||
|
|
||||||
|
The `c1-1.3B` model used in development should work without needing beefy hardware. It has been tested on a 1050 Ti 4 GB.
|
||||||
|
|
||||||
|
Set up a virtual environment:
|
||||||
|
|
||||||
|
Linux/MacOS
|
||||||
|
|
||||||
|
```shell
|
||||||
|
python3 -m venv venv
|
||||||
|
source venv/bin/activate
|
||||||
|
```
|
||||||
|
|
||||||
|
Windows
|
||||||
|
|
||||||
|
```shell
|
||||||
|
py -3 -m venv venv
|
||||||
|
.\venv\Scripts\activate
|
||||||
|
```
|
||||||
|
|
||||||
|
Install required packages
|
||||||
|
|
||||||
|
```shell
|
||||||
|
pip install -r requirements.txt
|
||||||
|
```
|
||||||
|
|
||||||
|
Copy `.env.example` to `.env` and fill in the bot's `TOKEN`.
|
||||||
|
|
||||||
|
For chat scraping, you will also need to get your own `USER_TOKEN`.
|
||||||
|
|
||||||
|
* In Discord, hit Ctrl+Shift+I to open up developer tools
|
||||||
|
* Go to the Network tab and filter by XHR requests
|
||||||
|
* Open a new channel, or scroll up, or do something else that will trigger an authenticated request
|
||||||
|
* Click on one that looks suitable (e.g. `messages?limit=50`)
|
||||||
|
* Under the "Request" tab, copy the contents of the `Authorization` request header.
|
||||||
|
|
||||||
|
## Usage
|
||||||
|
|
||||||
|
Scrape the messages from the chat channel you wish to use for a soft prompt. You will be prompted for the channel ID, which you can get by having developer mode on in Discord and right-clicking, or copying the last part of the URL in the browser.
|
||||||
|
|
||||||
|
```shell
|
||||||
|
cd src
|
||||||
|
python -m scraper
|
||||||
|
```
|
||||||
|
|
||||||
|
Train the soft prompt (TODO)
|
||||||
|
|
||||||
|
Run the Hatsune Miku bot. The first time you do this, it will download the model, which is ~5 GB.
|
||||||
|
|
||||||
|
```shell
|
||||||
|
python -m miku
|
||||||
|
```
|
||||||
|
|
||||||
|
## Final Remarks
|
||||||
|
|
||||||
|
sukima nuts
|
||||||
|
BIN
requirements.txt
BIN
requirements.txt
Binary file not shown.
@ -113,7 +113,7 @@ def boot(token: str):
|
|||||||
if not token:
|
if not token:
|
||||||
token = input('Enter your Discord user token (Authorization request header): ')
|
token = input('Enter your Discord user token (Authorization request header): ')
|
||||||
channel = input('Enter channel ID: ')
|
channel = input('Enter channel ID: ')
|
||||||
default_export = Path.cwd().parent / 'chats'
|
default_export = Path.cwd().parent.parent / 'chats'
|
||||||
export = input('Enter path to export transcripts (default "chats"): ')
|
export = input('Enter path to export transcripts (default "chats"): ')
|
||||||
scraper = Scraper(token, channel, Path(export) if export else default_export)
|
scraper = Scraper(token, channel, Path(export) if export else default_export)
|
||||||
scraper.scrape()
|
scraper.scrape()
|
||||||
|
Loading…
x
Reference in New Issue
Block a user