Skip to content

Commit 55cbbff

Browse files
authored
Update README.md
1 parent 4f1f640 commit 55cbbff

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

README.md

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,8 @@ However, the data can be accessed by contacting Dr. Koufakou at FGCU
1212
or myself. It is also available on the public internet, but I am not
1313
sure where.
1414

15+
The dataset used is a small subset of the Wikimedia personal attacks dataset.
16+
1517
### How to use these experiments?
1618

1719
Ensure that the data directory contains the needed data in the proper
@@ -23,6 +25,12 @@ script you run. They are organized into folders depending on which
2325
type of F1 score they report. The results are output in CSV format
2426
for easier analysis.
2527

28+
The data prep script in the data directory is the script supplied by
29+
Jason Scott, another FGCU student working with Dr. Koufakou on research
30+
on this dataset. This is not to be confused with the data prep file in
31+
the root directory of the project, which is a Python file containing
32+
functions that I have written myself to assist in data prep.
33+
2634
The Fasttext classifier should be very fast, and produce most results
2735
within less than a minute on a mid-range system.
2836

0 commit comments

Comments
 (0)