Questions tagged [cluster]
A computer cluster consists of a set of loosely connected computers that work together so that in many respects they can be viewed as a single system.
1,389 questions
3 votes
3 answers
366 views
Requirement of Active Directory in Windows Server Hyper-V Failover Cluster on Windows Server 2022
I am planning a Hyper-V Failover Cluster on Windows Server 2022 for a customer. They have HP ProLiant DL380 Gen11 servers and MSA2060 FC SAN with required licenses and cables. I requested Active ...
0 votes
0 answers
47 views
RabbitMQ Cluster weird as hell after a node reinstall
I had a 3 node cluster with nodes: rabbitmq-01 rabbitmq-02 rabbitmq-03. The hosts file contains the names and the IP address of those hosts so they are able to reach each other via those names. The ...
1 vote
1 answer
90 views
Best network interface assignment for Apache NiFi cluster with multiple networks (service, management, backup)
We have an 8-node Apache NiFi cluster with an external Zookeeper cluster, running on three separated networks: Service Management Backup Currently all NiFi and Zookeeper communication runs through ...
2 votes
1 answer
188 views
How Does my WIndows Cluster Computer Object have permission to Create other AD Computers
I have a SQL Server Availability group which sits atop of a WSFC on a Windows Domain. There are two nodes in the cluster, both of which are SQL Server availability group replicas My SQL Server ...
0 votes
0 answers
29 views
How can I pass a parameter to ldirectord's `checkcommand` of type `external-perl`?
Some years ago I wrote a checkcommand for ldirectord in Perl, hard-coding the patch of the required configuration file. The checktypeis set to external-perl. When porting the configuration to a new ...
0 votes
2 answers
109 views
Is it correct to call an HAProxy load balancer setup a "cluster" despite not using a virtual IP?
I have two web servers (WebSrv-1 and WebSrv-2) connected to an HAProxy load balancer. Unlike failover clusters or Windows load balancing setups, the HAProxy setup doesn’t use a virtual IP—it's ...
0 votes
0 answers
29 views
Can't map a drive letter to a WSFC client access point - but can browse via Windows Explorer. Why?
I have a WSFC file server cluster set up (WS 2025). There is a client access point configured: ClientsGoHere.mydomain.com When I browse to \\ClientsGoHere in Windows Explorer, I'm greeted with the ...
0 votes
0 answers
75 views
I am setting up some new compute nodes but they are performing poorer than older nodes
So as the question says, I am in the process of migrating to new compute nodes. The new servers are HPE Proliant DL360 Gen 10 and the operating system installed is Ubuntu. These are the specifications ...
0 votes
0 answers
47 views
How to disable Storage Pools/Disks to be available to other nodes in SQL Cluster?
I am creating a SQL Cluster in Azure. This SQL cluster is mostly based on the Microsoft documentation article called "Tutorial: Configure an availability group in multiple subnets (SQL Server on ...
0 votes
0 answers
137 views
mongodb 6 use mongod.conf file to configure a cluster replica set
I have 3 pc with almalinux9.5 and mongodb v6. The 3 ips are 10.17.60.102, 10.17.60.103, 10.17.60.104 . In all 3 pc I edited the mongod.conf file in this way: # mongod.conf, Percona Server for MongoDB #...
2 votes
2 answers
243 views
Certificates SAN values when using TCP Load Balancer
I need some advices or best practices advices about SAN values on certificates. Here's the context : We got 2 servers that host the same application (clustered app on active-active mode). On top, we ...
1 vote
0 answers
231 views
OCFS2 + ISCSI - Mounting error "Heartbeat mode is invalid while trying to join the group"
An OCFS2 cluster with two nodes is attempting to work with storage that has two block devices. These devices are shared via iSCSI and are visible on both nodes. Smaller disk (230 gb) utilized for ...
0 votes
0 answers
52 views
How to unlock ssh passphrase only if it was not already added to ssh-agent?
I have a script that collects information from multiple ssh hosts at once, so it needs the ssh passphrase to be unlocked and loaded into ssh-agent before spawning connections in parallel. All hosts ...
4 votes
1 answer
189 views
Can't configure systemd:postfix on pacemaker to work properly
Objective: I have a cluster managed by the pacemaker, I want my postfix resource to depend on clamav-daemon.service and mailscanner.service. If either one of these two resources fail, the IP jumps. I ...
1 vote
0 answers
528 views
High availability on Redis cluster in K8s
As we know in a Redis Cluster, data is divided into shards, with each shard being managed by a master node and one or more replica nodes. The problem I want to tackle is the case where master and ...
1 vote
1 answer
91 views
Virtializing single guest in multiple hosts
I'm not sure if this question is more adequate to be asked here, in Super User or Unix Exchange but this seemed like the best forum to me. I'm used to virtualizing several VM's in a single host. I ...
1 vote
1 answer
500 views
Ray Cluster setup with "proxyconnect tcp: dial tcp connection refused"
Where are the possible places to configure K8S using a proxy as it repeats "proxyconnect tcp: dial tcp 127.0.0.1:1082: connect: connection refused" during setting up Ray Cluster? Follow ...
0 votes
1 answer
153 views
Ceph dashboard show image usage only for static provisioned volumes
I have a simple question. Why Ceph Dashboard => Block => Images in column "Usage" does not show the size for dynamically created Persistent Volume? If I create a static volume then, ...
1 vote
0 answers
339 views
Apisix kubernetes Discovery not work
I am using official helm chart to install the APISIX in my Kubernetes. I only override the following values from the official helm chart: useDaemonSet-> true serviceAccount -> set create to ...
2 votes
2 answers
488 views
Rook Ceph storage and MySQL space very big difference
Can someone please advise me what could be the cause of this condition? I have a Rook Ceph cluster on which MySQL database with 3x replication is stored. This database is used by me for development ...
0 votes
0 answers
52 views
Expose local kubernetes function to my pc
So I have a function running in my kubernetes cluster. NAME BUILDSTATE SERVINGSTATE BUILDER SERVING ADDRESS AGE minio-event ...
0 votes
1 answer
195 views
Windows Server 2019 is no longer activated after switching over to passive node [duplicate]
I am running Windows Server 2019 in Virtual Machine Manager on a Synology High Availability cluster. Today I switched the server to the passive node and now Windows Server 2019 is no longer activated. ...
1 vote
0 answers
220 views
Enabling vSAN on existing cluster in vCenter
I currently have 3 clusters in vCenter, all with their own ESXi hosts and VM's running on those resources. I want to enable and utilize vSAN within 1 cluster for now and then expand it throughout the ...
1 vote
0 answers
250 views
ovs-vsctl stucks and seems there's no schema when using ovsdb in cluster mode
#!/bin/sh ovsdb-tool create-cluster /host/var/lib/openvswitch/conf.db /usr/share/openvswitch/vswitch.ovsschema tcp:0.0.0.0:6632 2> /dev/null ovsdb-server \ --unixctl=/host/var/run/openvswitch/...
1 vote
1 answer
130 views
How keep storage synchronized between clustered web server
I'm working on a cluster of web servers that use local filesystem as storage for media files. These clustered servers are balanced. The served webpages gets some images or video from this repository, ...
1 vote
0 answers
787 views
AWS application load balancer not registering targets for Kubernetes EKS node target group
I have an EKS cluster with public/private access on a VPC with public and private subnets. I've setup my ALB in the public subnets on port 80, internet-facing and ip and installed the AWS controller ...
4 votes
1 answer
353 views
Upgrade pacemaker corosync drbd Cluster from centos7 to almalinux
I have an active passive pacemaker Cluster for vms running with drbd 84 and corosync in centos7. AS centos7 eol is coming soon I am in the need of upgrading the cluster to a newer distribution. ...
2 votes
1 answer
2k views
Ceph cluster mixing nvme disks with different sizes
I am deploying a ceph cluster, the cluster will have three controller servers and 27 osd nodes. Each osd node has 3x3.8Tb nvme + 1x1.9Tb nvme disks, for a total of 4 nvme disks per node. The failure ...
1 vote
0 answers
286 views
Deploy a Redis cluster in Kubernetes with High Availability
I am looking at deploying a Redis cluster (3masters, 3workers/slaves) on Kubernetes. I found the Redis Operator as being the best option but the “capability level” doesn’t look that attractive to me. ...
0 votes
1 answer
1k views
What is the setting to use on other nodes after force starting cluster on a certain node?
I'm learning about force starting a cluster in the event of a DR. https://learn.microsoft.com/en-us/windows-server/failover-clustering/manage-cluster-quorum#prevent-quorum-on-remaining-cluster-nodes ...
4 votes
1 answer
216 views
Why is there random choice made in a 2 node + file share cluster when File share dies?
In a 2 node wsfc with file share witness, each node has got 1 vote, and the witness also has got 1 vote. Total 3 votes. Given that we have concepts of dynamic quorum, dynamic witness; when the file ...
3 votes
1 answer
298 views
sge: How to set hard limits of node?
The guy who administrates our cluster died suddenly and recently, so now we have to operate it ourselves until a new guy comes in. We want to change the hard runtime limit of a node on our cluster. ...
2 votes
3 answers
220 views
Setting up a cluster with workload distribution
I want to setup a server cluster which can keep by servers as busy as possible while still giving fair compute time to everyone. I have setup a basic Kubernetes setup but the issue is that if some ...
1 vote
0 answers
136 views
Does MariaDB-Galera cluster have error tolerance function on sst_user credential?
We have done the testing and confirmed the following system behavior of MariaDB-Galera clustering: The first-time bootstrap always needs to get everything correct, including the sst_user credentials ...
0 votes
0 answers
121 views
Receiving PDNS NOTIFY for domain I no longer host
On a couple of my servers in a DNS cluster running CloudLinux and PowerDNS, I've been receiving the following in my LFD Log Scanner Reports: pdns[2569215]: Received NOTIFY for example.com from 54.39....
0 votes
0 answers
66 views
Possible to deploy service to GKE Autopilot cluster without YAML?
Is it possible to deploy to GKE Autopilot cluster without YAML and just command line? I have inherited a project where deployment generates command line to deploy to CloudRun. I am in the process of ...
1 vote
2 answers
400 views
Access to Kubernetes service non-HA or HA cluster
My setup has previously been one control node (non-HA) and a few worker nodes on Kubernetes. New I'm building up a new cluster with three control nodes (HA) and 9 worker nodes. Connection from browser ...
3 votes
2 answers
323 views
Have a single running process in a 4 node Windows Server cluster
Is there a way to have a single running process in a 4 node setup on Windows Server? And if the node running the process goes off, to automatically start it on one of the remaining 3 nodes? Currently ...
1 vote
1 answer
1k views
Slurm nodes randomly dropping
I've set up a cluster using Slurm, consisting of a head node, 16 compute nodes, and an NAS with NFS-4 network shared storage. I've recently installed Slurm on Ubuntu v22 via apt (sinfo -V reveals ...
4 votes
1 answer
1k views
Failover Cluster Disk Replication
We have a Windows failover cluster containing a shared disk assigned to a Role and want to set up replication from it. We tried removing the disk from the cluster to use DFS Replication, however we ...
0 votes
0 answers
207 views
IIS Cluster with ARR+NLB - Multiple Ports
I have two servers with NLB and ARR. And I have two IIS servers behind ARR. For now, as I tested, it only routes 80 and 443 ports. Can I route all ports (or may be defined ones) from ARR to IIS ...
2 votes
1 answer
815 views
MaxScale doesn't connect to Galera Cluster
I am working on a distributed HA cloud system and I have a Galera Cluster with 3 separated servers in Docker Swarm. It works as expected. I have 3 separated services for the Galera members due to ...
1 vote
0 answers
357 views
MariaDB Galera cluster with failed node
A client of mine asked a question that I'm not really sure how to answer correctly. His question is: In case of one node falls out of the cluster, stays offline for a few hours or days and then comes ...
1 vote
0 answers
685 views
Availability of CHROOT in Docker containers and clusters
Is the Linux tool chroot generally usable in a Docker container running in clusters and cloud platforms? (background) I'm developing an application where I may need to use chroot inside a Docker ...
5 votes
2 answers
2k views
Corosync/Pacemaker/DRBD Resilience Tuning
I have a DRBD cluster and one node was switched off for a couple of days. The single node ran fine without a hiccup. When i switch it on I got into a situation where all resources got stopped and one ...
3 votes
1 answer
452 views
Howto: Block or File replication across 3+ nodes without a SAN
The setup I admin the backend for a website that currently exists on a single node using Nginx (webserver), Neo4J (database) and Wildfly (app server). The website is getting enough traffic that we ...
1 vote
1 answer
1k views
Failover Cluster Manager - Location of the stored clusters
I have about a dozen clusters that i monitor daily. I use the Failover Cluster Manager under MMC for a dashboard view of the clusters. Two things i am looking for. Where are the cluster names that ...
1 vote
1 answer
147 views
Running a cluster 24x7 at full load. Possible damages? [closed]
Let us assume that we have a pool of some 50 computers with 6 cores and 12 threads each. If someone plans to use it for intensive astrophysics simulation using all of its logical CPUs (50*12) for 24x7,...
4 votes
2 answers
181 views
There is no replication in a HA Cluster
I had the problem of unrelated data aborting in a HA cluster and my Mysql resource was in standalone on both nodes, I managed to solve the problem and both nodes are synchronized, the problem now is ...
1 vote
0 answers
140 views
MySql Service Not Starting in cluster with DRBD
I have a high availability cluster with two nodes configured, after a maintenance carried out by an external company when restarting the system my mysql resource stopped working. when executing the ...