You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/modules/ROOT/pages/usage.adoc
+46-1Lines changed: 46 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -192,7 +192,52 @@ dataNodes:
192
192
193
193
In the above example, all data nodes in the default group will store data (the location of `dfs.datanode.name.dir`) on a `128Gi` volume.
194
194
195
-
By default, in case nothing is configured in the custom resource for a certain role group, each Pod will have a `1Gi` large local volume mount for the data location.
195
+
By default, in case nothing is configured in the custom resource for a certain role group, each Pod will have a `5Gi` large volume mount for the data location.
196
+
197
+
==== Multiple storage volumes
198
+
199
+
Datanodes can have multiple disks attached to increase the storage size as well as speed.
200
+
They can be of different type, e.g. HDDs or SSDs.
201
+
202
+
You can configure multiple [PersistentVolumeClaims](https://kubernetes.io/docs/concepts/storage/persistent-volumes/#persistentvolumeclaims) (PVCs) for the datanodes as follows:
203
+
204
+
[source,yaml]
205
+
----
206
+
dataNodes:
207
+
roleGroups:
208
+
default:
209
+
config:
210
+
resources:
211
+
storage:
212
+
data: # We need to overwrite the data pvcs coming from the default value
213
+
count: 0
214
+
my-disks:
215
+
count: 3
216
+
capacity: 12Ti
217
+
hdfsStorageType: Disk
218
+
my-ssds:
219
+
count: 2
220
+
capacity: 5Ti
221
+
storageClass: premium-ssd
222
+
hdfsStorageType: SSD
223
+
----
224
+
225
+
This will create the following PVCs:
226
+
227
+
1. `my-disks-hdfs-datanode-default-0` (12Ti)
228
+
2. `my-disks-1-hdfs-datanode-default-0` (12Ti)
229
+
3. `my-disks-2-hdfs-datanode-default-0` (12Ti)
230
+
4. `my-ssds-hdfs-datanode-default-0` (5Ti)
231
+
5. `my-ssds-1-hdfs-datanode-default-0` (5Ti)
232
+
233
+
By configuring and using a dedicated https://kubernetes.io/docs/concepts/storage/storage-classes/[StorageClass] you can configure your HDFS to use local disks attached to Kubernetes nodes.
234
+
235
+
[NOTE]
236
+
====
237
+
You might need to re-create the StatefulSet to apply the new PVC configuration because of https://github.com/kubernetes/kubernetes/issues/68737[this Kubernetes issue].
238
+
You can delete the StatefulSet using `kubectl delete sts --cascade=false <statefulset>`.
239
+
The hdfs-operator will re-create the StatefulSet automatically.
// Will end up with something like `/stackable/data/<pvc-name>/datanode` e.g. `/stackable/data/data/datanode` and `/stackable/data/data-1/datanode` etc.
59
+
// We need one additional level because we don't want users to call their pvc e.g. `hadoop`
60
+
// ending up with a location of `/stackable/hadoop/data`
0 commit comments