Skip to content

Commit 5b240b4

Browse files
authored
1. set initmode to Random (#655)
2. add initmode as an adjustable parameter to kmeans.conf and run.sh Signed-off-by: minmingz <minming.zhu@inte.com>
1 parent f55862a commit 5b240b4

File tree

4 files changed

+4
-2
lines changed

4 files changed

+4
-2
lines changed

bin/functions/hibench_prop_env_mapping.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -94,6 +94,7 @@
9494
MAX_ITERATION="hibench.kmeans.max_iteration",
9595
K="hibench.kmeans.k",
9696
K_STORAGE_LEVEL="hibench.kmeans.storage.level",
97+
K_INIT_MODE="hibench.kmeans.initializationmode",
9798
# For gmm
9899
INPUT_SAMPLE_GMM="hibench.gmm.input.sample",
99100
INPUT_CLUSTER_GMM="hibench.gmm.input.cluster",

bin/workloads/ml/kmeans/spark/run.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ rmr_hdfs $OUTPUT_HDFS || true
2828
SIZE=`dir_size $INPUT_HDFS`
2929
START_TIME=`timestamp`
3030

31-
run_spark_job com.intel.hibench.sparkbench.ml.DenseKMeans -k $K --numIterations $MAX_ITERATION --storageLevel $K_STORAGE_LEVEL $INPUT_HDFS/samples
31+
run_spark_job com.intel.hibench.sparkbench.ml.DenseKMeans -k $K --numIterations $MAX_ITERATION --storageLevel $K_STORAGE_LEVEL --initMode $K_INIT_MODE $INPUT_HDFS/samples
3232
END_TIME=`timestamp`
3333

3434
gen_report ${START_TIME} ${END_TIME} ${SIZE}

conf/workloads/ml/kmeans.conf

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -49,3 +49,4 @@ hibench.workload.input ${hibench.hdfs.data.dir}/Kmeans/Input
4949
hibench.workload.output ${hibench.hdfs.data.dir}/Kmeans/Output
5050

5151
hibench.kmeans.storage.level MEMORY_ONLY
52+
hibench.kmeans.initializationmode Random

sparkbench/ml/src/main/scala/com/intel/sparkbench/ml/DenseKMeans.scala

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ object DenseKMeans {
3939
k: Int = -1,
4040
numIterations: Int = 10,
4141
storageLevel: String= "MEMORY_ONLY",
42-
initializationMode: InitializationMode = Parallel)
42+
initializationMode: InitializationMode = Random)
4343

4444
def main(args: Array[String]) {
4545
val defaultParams = Params()

0 commit comments

Comments
 (0)