Questions tagged [devops]
The devops tag has no summary.
85 questions
0 votes
0 answers
104 views
Karpenter not consolidating nodes
I'm using Karpenter in my EKS, and I noticed that I have at least two underutilized t3.2xlarge nodes that have only kube-system pods, that I expect Karpenter to consolidate them, I've attached my ...
0 votes
0 answers
49 views
High Private Bandwidth & CPU Load in 3-5 hours interval
I'm running a Laravel application with the following architecture: Backend server: Handles queues, jobs, emails, Horizon, cron jobs, and admin panel. Specs: 2 vCPU / 4 GB RAM. Frontend server: ...
0 votes
1 answer
49 views
How to unblock Linux agent pool for container registry stuck in the "Update" state
I've got two agent pool in two Azure Container Registry that are stuck in the "Updating" state since a few weeks. I cannot delete them. I cannot reboot them. I followed the documentation ...
0 votes
0 answers
129 views
sshd stops accepting connections after a few minutes – requires manual restart
I'm facing a strange issue with sshd on my server. I have a custom SSH service running (custom-sshd.service) on a non-standard port. Every time I reboot or manually restart the service using: sudo ...
0 votes
0 answers
44 views
I am unable to SSH into my GCP VM, with an error code 4003
I have tried to ssh locally, from GCP's built-in ssh, and I have a Firewall rule for port 22. When running this command (in the Serial port), service ssh status, I receive that sshd is running: * ...
3 votes
1 answer
3k views
Digital Ocean droplet 100% CPU usage - kauditd0
Does anyone know what kauditd0 service is ? I’m encountering an issue with my Ubuntu server on a DigitalOcean Droplet, where the kauditd0 process is consuming 100% CPU (that never happened before). I’...
0 votes
1 answer
317 views
Azure CDN rule engine - work with redirect and path
I need to make sure that every time my url starts with /file it is directed to / (the root) but keeping the rest of the url the same, example -> /file/item.txt -> /item.txt, how can I do this? (...
1 vote
1 answer
88 views
MongoDB data in EC2 instance is erased automatically after after one day
I am facing an issue where data in MongoDB inside EC2 instance is erased after approximately one day. I use the AWS free tier service. In that I launch a t3.large ubuntu instance and gp3 8 GB volume. ...
4 votes
1 answer
625 views
Is it Possible to Install Print Server Role Inside a Windows Server Core 2019 Container?
I'm currently working with a Windows Server 2019 VM and considering setting up a containerized environment for a label printing service. My goal is to create a container with the print server role ...
0 votes
0 answers
63 views
How should I behave when giving a AWS Permission Set | Handling Policies to users
Context: I've been given the task to manage the AWS IAM Identity Center (user, groups, accounts, permission sets, etc.) of my company. We have multiple developer companies that implement solutions to ...
1 vote
1 answer
821 views
I am getting an error in the ansible playbook file
I am trying to learn ansible and I have coded my first playbook but it gives me the error saying ERROR! We were unable to read either as JSON nor YAML, these are the errors we got from each: JSON: ...
0 votes
1 answer
1k views
In GCP, how to connect to CloudSQL from another project?
I have three projects in Google Cloud Platform (GCP) under one organization: Project A, Project B, and Project C (central). Project C is using a GCP Cloud SQL component (MySQL), which is connected via ...
1 vote
1 answer
1k views
When to use Thanos/Cortex over Vanilla Prometheus?
Generally when I look around or try to understand the HA setup for Prometheus the most common search results are Cortex & Thanos. I've also seen M3 and Victoriametrics on the list. What I ...
0 votes
0 answers
1k views
kubernetes etcd failed to publish local member to cluster through raft
Environment Background: kubernetes configured 3 master and 3 worker node Kubernetes is configured component by component following https://github.com/kelseyhightower/kubernetes-the-hard-way. ...
0 votes
0 answers
41 views
Navigating Hetzner Server Security: Looking for Answers [duplicate]
We are managing a dedicated server with Hetzner. Our setup includes a public server (Nginx reverse proxy) connected to around 10 upstream servers. Recently, our server was compromised through an ...
2 votes
1 answer
183 views
Azure Devops VMSS Agent Pool without Administrator
We want to use VMSS instances as our Azure Devops Agent pool. The agentpool deploys the TeamServicesAgent Virtual machine extension to the instances deployed. This extension calls the script: https://...
2 votes
1 answer
1k views
Ansible compare two value from two dictionnaries
I have a simple problem, i have two dictionaries, one being static with some data, and an other being the result of the stat command stored in a variable (with register). The goal is simply to compare ...
0 votes
1 answer
1k views
Jenkins Multiple Agent and Git
I have been using Jenkins for many years, but I have never needed more than one agent for my tasks. Just recently, I found the need to add another agent, which is not Linux but Windows. Now, this node ...
0 votes
1 answer
188 views
How to create ops-agents policy that will install ops-agent on all Ubuntu 22.04 VMs?
I have created this ops-agent policy to install ops-agent on all my running Ubuntu 22.04 VMs, but the policy exists in the Cloud console when I navigate to Monitoring -> Dashboards -> VM ...
0 votes
2 answers
299 views
I can't seem to point the correct nameserver to my VPS
I'm hosting my domain on Namecheap and im trying to point my nameservers to connect to my VPS using Webuzo (on Hostinger) it shows this error on my VPS on Webuzo: " This system does not control ...
0 votes
0 answers
120 views
Seeking Expert Advice on Terraform Deployment Cleanup
I've been successfully using Terraform for various deployments, managing to create multiple environments for different clients using a single Jenkins job. However, I've hit a roadblock when it comes ...
0 votes
1 answer
445 views
Missing rabbitmq.conf file in RabbitMQ installation
I recently installed "RabbitMQ 3.12.10:Erlang 26.2" on a Linux VM. Upon checking the server status using the command [systemctl status rabbitmq-server], the console output logs confirmed ...
0 votes
2 answers
2k views
Playbook not use sudo command - Ansible Semaphore in Docker Container
So I'm learning DevOps tools. I installed Ansible Semaphore in a Docker container on my home server and created playbooks. I created a sudo user on a remote server and added an SSH key. However, my ...
0 votes
0 answers
32 views
Mapping versioning for config and infra corresponding to application code version
I have an application that has something like below structure Service A - application and environment configuration, infrastructure dependencies (Queue, DB etc) etc. Service B - application and ...
1 vote
1 answer
1k views
Ansible lineinfile replace with system variable
I'm encountering an issue while using Ansible to dynamically modify a Zabbix agent configuration file. Specifically, I'm attempting to employ an ansible lineinfile module with a loop to update ...
0 votes
1 answer
1k views
kubectl get nodes --kubeconfig admin.kubeconfig returns "No resources found"
I am a newbie in Kubernetes. I am not even very familiar with basic kubernetes commands, but the installation is a starting point for my learning journey. I wanted to avoid installing something like ...
1 vote
0 answers
3k views
How to get output of STDOUT to GCP cloud logging
Hey I have a job running on a gcp VM where some imported libraries are logging to stdout with the print or python logging library, and I also have written a lot of code where the output is logged to ...
1 vote
0 answers
479 views
Replicating foreign keys with AWS DMS Full load Ongoing replication - MySQL
I am using AWS DMS Full Load Ongoing Replication (CDC) to replicate data from a MySQL RDS to an on-prem MySQL server (both of them running MySQL 8). Before starting the replication process I'm dumping ...
0 votes
1 answer
266 views
apache "Options Indexes" working localhost but not with public DNS name
I have a Kubernetes cluster and a very basic container. I am trying to access it through the internet and the task is very simple, I just need to list the index of the directory. Here is my basic ...
0 votes
1 answer
3k views
Forwarding apache logs from k8s Pod to Syslog server
I am new to Kubernetes, I know there are multiple ways to log collection. but I am specifically asking about my scenario. we have a Syslog xyz.com port 8090. now we want our pods to send logs to our ...
1 vote
1 answer
901 views
Proof that aws cloudwatch get-metric-statistics works
I am learning to use the aws cloudwatch get-metric-statistics subcommand in the AWS command line tool. To prove to myself that I am using the tool correctly, I would like a simple example command that ...
0 votes
0 answers
1k views
Error occurred while creating a cluster using Kubespary
I want to create a Kubernetes cluster using Kubespary. I have created three nodes. I am using the official documentation: https://kubernetes.io/docs/setup/production-environment/tools/kubespray/ I ...
1 vote
2 answers
9k views
What is the difference between Cloud-Init and Autoinstall?
So I have been trying to essentially install Nixos from the official site with a Cloud-Init for Ubuntu Server 22.04. I want to really understand how to install packages that I potentially might have ...
0 votes
1 answer
1k views
ansible print folder path from register variable
I'm trying to print just the path of a register var that has multiple values, however I can only print a specific value. how can I print multiple values? --- - name: find directories find: ...
0 votes
1 answer
2k views
What does "COPY --from=build-env /app/build/web /usr/share/nginx/html" do?
I have a Dockerfile as following: # Install Operating system and dependencies FROM ubuntu:22.04 AS build-env RUN apt-get update RUN apt-get install -y curl git wget unzip libgconf-2-4 gdb libstdc++6 ...
0 votes
1 answer
757 views
My website is working when https://www.example.com but not when https://example.com
I have purchased the domain from Godaddy. My website is working when https://www.example.com example.com www.example.com Not working when https://example.com I want to know where I need to add ...
0 votes
1 answer
736 views
Delay A pipeline For 24Hours
Hello Im using Azure piplines for some tasks Here is a simiple example of what im trying to achieve : what im trying to do is to to trigger pipline2 after pipeline1 complition (this is done) and once ...
4 votes
1 answer
9k views
Why does argocd keep re-syncing my Job?
I'm deploying an application using ArgoCD. The deployment manifests include a Job that performs some one-time initialization for the application. The Job resource looks like this: apiVersion: batch/v1 ...
1 vote
2 answers
3k views
Serving ArgoCD behind Traefik in Kubernetes
I would like to run ArgoCD behind Traefik such that I can both access the ArgoCD web ui and interact with the API using the argocd command line tool. I have deployed ArgoCD into a Kubernetes cluster ...
0 votes
0 answers
36 views
How vps instance like aws, google cloud,... scale with containers?
As I understand. K8s will use the blueprint to create multiple containers on the same machine. But how this gonna help the scalability. For example, my laptop got 2 pods of backend servers and they ...
0 votes
0 answers
144 views
Incoming apiserver requests
is there any option to see what request is reaching apiserver? I'm struggling with error authentication.go:104] Unable to authenticate the request due to an error: x509: certificate has expired or is ...
0 votes
1 answer
337 views
Deploying monorepo without SSH
Full Disclosure: I'm a DevOps noob. I am running a MERN mono repo on an EC2 instance. Originally, I would always SSH into my machine, pull from the repository, and build again. But there are problems ...
1 vote
0 answers
174 views
Amazon OpenSearch Service stably or less downtime?
Can anyone point out, What is the best way for the elastic search to run stably? We frequently get messages about it requiring updates and that an update requires some downtime. Could we have 2 ...
0 votes
1 answer
1k views
AWS Policy to Read/write RDS
In my scenario , I want a policy that will allow reading and writing of abc-database-backups/rds/postgresql-backup on S3? We'll want the my servers to have that access added. Is creating a role and ...
1 vote
2 answers
2k views
Hashicorp Vault How Do I Login Headless From STDIN Using Bash Shell?
Given a Bash Shell say in a Docker container running on Gitlab, for example, how would I get the password to get passed in? When I login with this: $ vault login -method=ldap username=myusername It ...
0 votes
1 answer
1k views
Can not redirect https with upstream directive
With such config a I get redirecting to google.com server { listen 80; proxy_ssl_server_name on; rewrite_log on; location / { proxy_pass https://google.com; } } but if I ...
0 votes
1 answer
7k views
Jenkins Shell script not working. Yarn: command not found
I am running jenkins to build a react app and executing shell commands to build the app automatically. I following are the bash commands cd /var/www/FarmGroWeb sudo git reset --hard sudo git pull ...
0 votes
0 answers
237 views
Individual Local Accounts on AWS
I'm my scenario, Currently, we have all developers connect to ec2 instances using the ec2-user account. Is there a better way to do this so we can see which actions developers take on the machines? I'...
1 vote
1 answer
442 views
Puppet manifest configuration for docker image verification
I’m quite new to devops/ci/cd, so bear with me. Currently, I’m getting a puppet manifest configuration set up for docker image verification. Let me try to layout it out: I have a puppet master server ...
1 vote
2 answers
2k views
Terraform: How to Detect and Branch on Operating System?
Given this block of pseudo code from Terraform: resource "null_resource" { provisioner "local-exec" { command = "echo hello" interpreter = local.os == ...