Name	Name	Last commit message	Last commit date
Latest commit History 119 Commits
dialog_test	dialog_test
examples	examples
resources	resources
tests	tests
utils	utils
.gitignore	.gitignore
CHANGELOG.md	CHANGELOG.md
LICENSE	LICENSE
README.md	README.md
config.ini.sample	config.ini.sample
requirements.txt	requirements.txt
run.py	run.py

Name

Last commit message

Last commit date

119 Commits

WA-Testing-Tool

Scripts that run against Watson Assistant for K fold validation on training set, testing on blind test, and draw precision curves for comparison.

Features

Easy to setup in one configuration file.
Save the state when Assistant service is down in the middle of processing.
Able to resume from where it stops using modularized scripts.

Prerequisite

*nix OS (Recommend)
Python 3.6.4 +

Quick Start

Install dependencies pip install -r requirements.txt
Set up parameters properly in config.ini.
Run the process. python run.py -c <path to config.ini>

Input File

config.ini - Configuration file for run.py. Below is the template.

[DEFAULT] ; KFOLD, BLIND or TEST mode = <one of the three options above> ; workspace_id or workspace JSON of target testing instance workspace_id = 01234567-9ABC-DEF0-1234-56789ABCDEF0 ; Test input file for blind and standard test test_input_file = ./data/test.csv ; Previous blind test out previous_blind_out = ./data/previous_blind_out.csv ; Test output path for blind and standard test test_output_path = ./data/test-out.csv ; All temporary files/states will be stored here temporary_file_directory = ./data ; Figure path for kfold and blind test out_figure_path= ./data/figure.png ; number of folds fold_num = 5 ; Keep or delete the workspaces after testing. Use 'yes' or 'no' keep_workspace_after_test = no ; POPULATION, EQUAL or WEIGHT_FILE weight_mode = population ; Test request rate max_test_rate = 100 ; Threshold of confidence conf_thres = 0.2 ; Partial Credit Table partial_credit_table = ./data/partial-credit-table.csv [ASSISTANT CREDENTIALS] username = <wa username> password = <wa password>

previous_blind_out.csv (optional) - Test output from the previous classifier, which uses the same blind set as these in test_input_file.

confidence	does intent match
0.01	yes
0.90	no
0.09	yes

test_input_file.csv - Test set for blind testing and standard test.

For blind test with golden intent used for comparison:

utterance	golden intent
utterance 0	intent 0
utterance 1	intent 0
utterance 2	intent 1

For standard test, the input must only have one column or error will be thrown:

utterance (implicit)
utterance 0
utterance 1
utterance 2

Examples

Run k fold validation

Run blind test

Run standard test without ground truth

Generate description for intents

Caveats

Due to different coverage among service plans, user may need to adjust max_test_rate accordingly to avoid network connection error.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WA-Testing-Tool

Features

Prerequisite

Quick Start

Input File

Examples

Caveats

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 19

Uh oh!

Languages

License

cognitive-catalyst/WA-Testing-Tool

Folders and files

Latest commit

History

Repository files navigation

WA-Testing-Tool

Features

Prerequisite

Quick Start

Input File

Examples

Caveats

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 19

Uh oh!

Languages

Packages