Skip to content

Commit 45c4304

Browse files
authored
Update README.md
1 parent 55cbbff commit 45c4304

File tree

1 file changed

+6
-1
lines changed

1 file changed

+6
-1
lines changed

README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,8 @@ or myself. It is also available on the public internet, but I am not
1313
sure where.
1414

1515
The dataset used is a small subset of the Wikimedia personal attacks dataset.
16+
Given more time, it is probably worth using the full dataset, but this could be
17+
a very time consuming process.
1618

1719
### How to use these experiments?
1820

@@ -27,7 +29,10 @@ for easier analysis.
2729

2830
The data prep script in the data directory is the script supplied by
2931
Jason Scott, another FGCU student working with Dr. Koufakou on research
30-
on this dataset. This is not to be confused with the data prep file in
32+
on this dataset. The file is not used by any of my code but was left in
33+
because it is needed if you preprocess the data yourself from scratch,
34+
rather than using the already processed data that I obtained from Jason.
35+
That file is not to be confused with the data prep file in
3136
the root directory of the project, which is a Python file containing
3237
functions that I have written myself to assist in data prep.
3338

0 commit comments

Comments
 (0)