Nutanix Cluster Companies Down – Troubleshooting | Digital Noch

Nutanix Cluster Companies Down – Troubleshooting | Digital Noch

Nutanix cluster / CVM runs tons of of companies to run and keep the cluster, however what is going to occur when Nutanix cluster / CVMs companies goes down. after all have to troubleshoot the Nutanix cluster / CVM companies to get again them up and working to show the cluster standing inexperienced.

It is vitally tough for any administrator to troubleshoot the Nutanix cluster / Nutanix CVM down companies. Subsequently is have talked about the easy steps to troubleshoot the Nutanix cluster / CVM companies.

Nutanix Cluster companies Troubleshooting

Let’s troubleshoot the frequent Nutanix cluster / CVM companies down points to resolve it and switch Nutanix cluster / CVM well being again to inexperienced.

Learn additionally: Nutanix Cluster Most Essential Companies

Problem 1: Improve caught as a result of Genesis not in a position to begin companies after Cassandra service

Decision : To resolve the Improve caught as a result of Genesis not in a position to begin companies after Cassandra service – Run following command from any Nutanix CVM within the cluster

nutanix@cvm$ allssh 'genesis restart'

Problem 2: Unreachable DNS server can forestall 2 node clusters from beginning companies after failure

Decision : To resolve the Unreachable DNS server can forestall 2 node clusters from beginning companies after failure – examine the DNS / Title server entry in CVM configuration file and examine connectivity.

Command 1: Verify DNS / Title server entry on cluster configuration

nutanix@cvm:~$ zeus_config_printer | grep name_server

Command 2: then examine the DNS / Title server entry in all CVMs configuration file.

nutanix@cvm:~$ allssh "cat /and so on/resolv.conf"

If DNS entry is just not discovered then add DNS server IP handle / host title from Prism as exhibiting following screenshot.

Make sure that DNS server is reachable earlier than placing DNS IP handle / host title.

Problem 3: SSP: Enabling Self-Service Portal Companies

Decision: To resolve SSP: Enabling Self-Service Portal Companies – Must allow the SSP service on all Nutanix CVM

Companies for the Self-Service Portal (SSP) function are disabled by default on AHV hosts on which the Controller VM has lower than 24 GB of reminiscence.

SSP is supported on AHV hosts solely.

Step 1: examine the Nutanix CVM Reminiscence allocation that should be at the least 24 GB or better can be effective.

nutanix@cvm$ free -m

Step 2: If Nutanix CVM Reminiscence allocation is much less then 24 GB then have to scale-up the reminiscence to at the least 24 GB or better.

Choice 1: Enhance / scale-up Nutlanix CVM reminiscence from Prism console

Choice 2: Enhance / Scale-up Nutanix CVM reminiscence from command-line

Step 3: Restart Genesis service on all Nutanix CVMs

nutanix@cvm$ allssh genesis restart
nutanix@cvm$ allssh genesis cease prism
nutanix@cvm$ cluster begin

Problem 4: Nutanix CVM / Cluster Companies are down

Let’s troubleshoot the Nutanix Cluster / CVM companies down challenge. to begin with attempt to perceive the Nutanix cluster essential companies right here:

Few Nutanix Essential companies record is right here:

  • acropolis
  • andruil
  • aplos
  • aplos_engine
  • catalog
  • cluster_config
  • cluster_sync
  • delphi
  • ergon
  • circulate
  • lazan
  • minerva_cvm
  • snmp_manager
  • sys_stat_collector
  • uhura
  • xtrim

Learn extra: Nutanix Cluster Most Essential Companies

Decision: examine the Nutanix CVM / Cluster companies standing and restart them.

Step 1: Verify Nutanix CVM / Cluster companies standing

nutanix@CVM$ ncc health_checks run_all
nutanix@CVM$ ncc health_checks system_checks cluster_services_status
nutanix@CVM$ ncc health_checks system_checks cvm_services_status
nutanix@cvm$ ncc health_checks hypervisor_checks check_services
nutanix@cvm$ ncc health_checks system_checks cluster_services_down_check

The NCC well being examine cluster_services_status verifies if the Controller VM (CVM) companies have restarted not too long ago throughout the cluster.

The next companies are checked:

  • alert_manager
  • arithmos
  • cassandra_monitor
  • cerebro
  • chronos_node_main
  • cluster_manager_monitor
  • hyperint_monitor
  • pithos
  • prism_monitor
  • stargate
  • stargate_monitor_main
  • stats_aggregator_monitor
  • zookeeper_monitor
  • curator

Step 2: Shortlist the down companies on all Nutanix CVM

nutanix@pcvm$ cluster standing | grep -v UP

Step 3: Begin Nutanix CVM / Cluster companies

nutanix@pcvm$ cluster begin

Observe: Above command won’t affect your manufacturing working VMs.

Elective Step 4: If step 3 command doesn’t begin the down companies then you possibly can reboot your both Nutlanix Node or Nutanix CVM.

Step 4.1: Reboot Nutanix CVM

nutanix@cvm$ cvm_shutdown -r now

Step 4.1.1: OR Shutdown Nutanix CVM

nutanix@cvm$ cvm_shutdown -P now

Step 4.1.2: Energy-on the Shudown Nutanix CVM

SSH to Nuanix AHV host

root# virsh record --all | grep CVM

In output you will notice CVM Title, simply copy it and run following command to begin the Nutanix CVM

root# virsh begin <CVM_Name>

Wait for five Minutes to boot-up the Nutanix CVM and companies.

OR Step 4.2 : You’ll be able to put your host in upkeep mode after which reboot node

Learn extra: Allow Nutanix CVM, AHV Upkeep mode

Learn extra: Shutdown / Reboot Nutanix AHV Host and Nutanix CVM

Last Step : Now examine Nutanix cluster standing and working companies.

nutanix@pcvm$ cluster standing

Problem 5: Nutanix Gateway not reachable. Http request error

Decision: Must restart the Nutanix Console companies on the host, which is Prism chief.

Step 1: Discover the Nutanix Prism Chief – Confirm which cluster node is the Prism chief, that’s, the CVM working the Prism container companies.

nutanix@cvm$ curl  && echo

Output ought to look related as following

"chief":"xx.xx.xx.10:9080", "is_local":false

It means xx.xx.xx.10 CVM is the Prism Chief.

Step 2: SSH to Prism Chief and run the next command to restart Prism service.

nutanix@cvm$ genesis cease prism 
nutanix@cvm$ cluster begin

Observe: There is no such thing as a affect on working manufacturing of above instructions.

Learn additionally: Nutanix Prism internet console is sluggish, not working, hanging points troubleshooting

Problem 6: Essential : Cluster Service: Aplos is down on the Controller VM

Problem 7: LCM improve fails with error “Companies not up” on a 2-node cluster

Decision: Above each companies run in LCM framework.

That is identified challenge. due to this fact it is strongly recommended to improve Nutanix NCC and LCM framework model to newest obtainable model.

Learn additionally: How Nutanix LCM Life Cycle Administration Framework Works ?

Hopefully, in the present day you may have discovered one thing new and attention-grabbing subject.

Due to being with HyperHCI Tech Weblog to remain tuned and continue learning until final breath.

#Nutanix #Cluster #Companies #Troubleshooting

Related articles

spot_img

Leave a reply

Please enter your comment!
Please enter your name here