Skip to content

Commit 9b2fc75

Browse files
committed
Update
1 parent 9542ca5 commit 9b2fc75

File tree

1 file changed

+34
-9
lines changed

1 file changed

+34
-9
lines changed

README.md

Lines changed: 34 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,28 +1,53 @@
11
# NotebookLM Detector
22

3-
This is a simple tool to detect if an audio file is generated by NotebookLM.
3+
A simple tool to detect whether an audio file was generated by [NotebookLM](https://notebooklm.google/).
44

5-
## Detect
5+
At [Listen Notes](https://www.listennotes.com/), we've encountered a growing number of spammers submitting fake,
6+
NotebookLM-generated podcasts to our platform.
67

7-
Install dependencies first:
8+
We hoped the NotebookLM team would provide a tool to help detect NotebookLM-generated audio.
9+
However, after a week of back-and-forth emails, we lost patience.
10+
11+
It's now Friday (Oct 4, 2024), and since we won't hear back from the NotebookLM team until next week,
12+
we decided to put together this simple script. Luckily, it seems to work!
13+
14+
15+
## Detection
16+
17+
### Install Dependencies
818

919
```shell
1020
$ pip install -r requirements.txt
1121
```
1222

13-
Run the script to detect:
23+
### Run the Detection Script
24+
25+
To detect whether an audio file is AI-generated or human-produced, run the following command:
1426
```shell
1527
$ python notebooklm_detector.py --action predict --file_path [filename].mp3
1628
```
1729

18-
## Train
30+
You’ll see output like this:
31+
```shell
32+
$ The audio is: AI Generated
33+
```
34+
or
35+
```shell
36+
$ The audio is: Human
37+
```
38+
39+
## Training the Model
40+
41+
You can train the model and regenerate `model.pkl` by following these steps:
42+
43+
### Step 1: Organize the Dataset
1944

20-
You can train and regenerate model.pkl:
45+
* Place NotebookLM-generated audio files (mp3, wav, or mp4) in the datasets/ai/ folder.
46+
* Place human-produced audio files in the datasets/human/ folder.
2147

22-
Step 1: Put NotebookLM-generated audio files (mp3, wav, or mp4) in datasets/ai/ folder.
23-
And put human-produced audio files in datasets/human/ folder.
48+
### Step 2: Run the Training Script
2449

25-
Step 2: Run the script to train:
50+
To train the model, run:
2651
```shell
2752
$ python notebooklm_detector.py --action train --dataset_path datasets
2853
```

0 commit comments

Comments
 (0)