File tree Expand file tree Collapse file tree 1 file changed +8
-0
lines changed Expand file tree Collapse file tree 1 file changed +8
-0
lines changed Original file line number Diff line number Diff line change @@ -12,6 +12,8 @@ However, the data can be accessed by contacting Dr. Koufakou at FGCU
1212or myself. It is also available on the public internet, but I am not
1313sure where.
1414
15+ The dataset used is a small subset of the Wikimedia personal attacks dataset.
16+
1517### How to use these experiments?
1618
1719Ensure that the data directory contains the needed data in the proper
@@ -23,6 +25,12 @@ script you run. They are organized into folders depending on which
2325type of F1 score they report. The results are output in CSV format
2426for easier analysis.
2527
28+ The data prep script in the data directory is the script supplied by
29+ Jason Scott, another FGCU student working with Dr. Koufakou on research
30+ on this dataset. This is not to be confused with the data prep file in
31+ the root directory of the project, which is a Python file containing
32+ functions that I have written myself to assist in data prep.
33+
2634The Fasttext classifier should be very fast, and produce most results
2735within less than a minute on a mid-range system.
2836
You can’t perform that action at this time.
0 commit comments