Quantcast
Channel: High Availability (Clustering) forum
Viewing all 4519 articles
Browse latest View live

how to remove a node from nlb at runtime?

$
0
0

hello,

i need to temporally exclude a node from an nlb.

May happen that a server is up and working but the web application i'm balancing is out of sinch with the same application in the others nodes.

Eg. some static variables are not the same of the same static variables of other nodes, because of a timeout, a write error and so on but the server is still working.

in this case i need to stop the server from nlb because the information in the web application is not in sinch with other nodes.

I need to prevent users from being serverd from this out to date server, untill it will became updated, but i need to do this programmatically

how can i do it?



evict Node

$
0
0
what is proper way to evict a node on 2008r2 cluster? Stop cluster service on the node first or not? See conflicting articles

Trying to setup a redundant server for server 2012

$
0
0

have some questions  regarding creating a backup for an all in one server. The server has active directory, DNS, the file server and everything on it. I'm trying to create a redundancy setup in case of failure. So I was reading about failover clusters and this seem to be what I need to do. I'm just a little bit confused. What is the difference between using hyper V and using clusters?

When I create a cluster and add servers to it, what exactly happens? For instance simplest case I have the main server with all the data. Now I bring in another machine with nothing on it. I install windows server 2012 on it and add it and the other computer to a cluster. Does the data from the main server automatically get copied onto all the servers that are part of this cluster? At this point if one server fails will the other one take over?

Any help is greatly appreciated.

-FlipFlop

Repair Cluster Network Nam in Windows Server 2012

$
0
0

Hi,

I am having issues as described in mention below KB link.

http://support.microsoft.com/kb/2838043

after applying this hotfix I have restart Nodes now I need to perform post installation task as described in article that's is simulate failure on Cluster Network Name and repair it .

 I am confuse and need  responses on below queries

 

  1. What is the impact on services  running on this cluster like VM's , CSV  after pressing Simulate Failure.
  2. Is this activity required any downtime?
  3. Another question is if Cluster Network Name Resource fails how it effects cluster service?

 

 

 

 


Usama Arif


route add/delete broke my networking

$
0
0

I have run into a networking problem on my cluster, and I cannot figure out what changed.

I had a cluster configured and working.  Its access network is 192.168.10.0/24.  I have other networks, including a node management network on 10.29.130.0/24.  The access network is private to my lab; it's here I have my AD defined.  The management network is a 'lab' network that has a gateway available so we can access the lab from our corporate network with no issues.  In other words, the access network is only routed within my private lab, but the management network can be routed to corporate access.

But, as you know, Windows simply doesn't like creating two different networks with default gateways.  So before building the cluster, I removed the gateway from the management network, ensuring there was only a single gateway configured on each host.  Ran the validation and it came through fine (typical network warnings about non-routed networks not able to reach other networks, but that is expected and presents no problems).  Built the cluster.

I wanted to try to create an environment that would allow me to access the physical hosts through the management network, so I tried issuing a route add command specific to the management network. 

route add 10.29.130.0 mask 255.255.255.0 10.29.130.1 if 3

It didn't work as expected (I am by no means a networking expert, but I figured I would try it.)  Since it didn't work, I deleted it.

route delete 10.29.130.0 mask 255.255.255.0 10.29.130.1 if 3

Came back and ran another validation wizard on the cluster and now the validation fails with the following error (to each of the other nodes in the cluster):

Network interfaces FT4-Infra01.VSPEX.COM - Mgmt and FT4-Infra03.VSPEX.COM - Mgmt are on the same cluster network, yet address 10.29.130.37 is not reachable from 10.29.130.35 using UDP on port 3343.

I check my firewall, and those ports are open on all node for all firewall profiles.  From FT4-Infra01 (the machine I was messing with), I can ping the other nodes of the cluster.  From the other nodes in the cluster, I cannot ping FT4-Infra01.  Yes, I know ping is a different rule, but I always go back to basics.  And the fact that it is not responding to pings after playing with the route is strange.

Does anybody have any ideas about what playing with the route command could have changed?  I am assuming that is the cause because everything worked fine before issuing the command and now these errors are there.  No other changes were made to the cluster or the nodes.


.:|:.:|:. tim

Cannot create a SMB Application File Share in 2012R2

$
0
0
  1. I am trying to create a SOFS on Windows Server 2012R2 but cannot create the application file share via Failover Cluster Manager. 
  2. I have a created a Clustered Shared Volume using JBOD and storage spaces.  The volume is formatted on the JBOD disks using REFS (I understood this is supported for CSV in  2012R2).  The environment is being built using Data On Storage 9220 CiB which is certified for Storage Spaces.
  3. I open Fail Over Cluster Manager (tried both remotely and locally on file server). 
  4. Connect to the Cluster. 
  5. Expand the Cluster
  6. Choose Roles
  7. Right Click on "SecSofs01Cap" which is of type "Scale-Out File Server" and choose "Add File Share"
  8. Choose SMB Share - Applications
  9. Choose Next
  10. Select "SecSofs01Cap" which is of type "Scale Out File Server"
  11. Select by Volume is grayed out and I see no volumes presented. The browse button in "Type a custom path" is grayed out as well.  

Cluster file server resource 'FileServer-(NAME)(Database Disk)' failed a health check

$
0
0

Any ideas on why a Fileserver keeps going offline on only 1 node. Even tried swapping the server and its still failing. This is a windows 2008r2 cluster

Cluster file server resource 'FileServer-(NAME)(Database Disk)' failed a health check. This was because some of its shared folders were inaccessible. Verify that the folders are accessible from clients. Additionally, confirm the state of the Server service on this cluster node using Server Manager and look for other events related to the Server

Creating a cluster question

$
0
0

I am trying to create a High Availability Cluster on Serve 2012r2 that will run Remote Desktop Services i.e. RemoteApps.  The idea being that if one server goes down the second one will take over.  I have created a cluster with 2 nodes, the two nodes in the cluster are virtual servers on two separate physical servers running Server 2012r2 with Hyper-V. From what I can tell the cluster is setup properly (server manager>local server recognizes them as being apart of the cluster), but I'm unclear whether these two nodes are the High Availability cluster I require, or if I should have made the physical servers that host Hyper-v the nodes in the cluster.  I would think that once these 2 nodes are in a cluster if I make a change to one node i.e. install a feature that the other node would be automatically updated to match, and should one node shutdown the other node takes over.  Doesn't seem to be the case for me, which makes me think I shouldn't have used the two virtual servers as nodes. 

Should I have created the cluster between the two physical servers instead?  Then created a virtual machine from that cluster?

My only other experience with clustering is with Synology, so I'm basing a lot of my assumption on how Synology clusters their devices.

Any help is greatly appreciated.

Brendon


Cluster Disk stuck in "Online Pending"

$
0
0

I am currently seeing this under my Storage in the Failover Cluster Manager on Server 2012 r2....
"Cluster Disk 2" is stuck in an "Online Pending" state. The thing is I don't have any disk that would be claiming this role.

When I try and look at the properties of this disk I get a message: "operation has failed..... unable to cast object of type 'MS.Internal.ServerClusters.Cluster' to type 'MS.Internal.ServerClusters.ClusterResource'. " or sometime the MMC crashes with an unhandled exception. If I try and remove it in the Failover Cluster MMC, it fails with "The file cannot be opened because it is in the process of being deleted. Error Code: 0x8007012f"

If I try and remove it with cmdlets in PowerShell, ' Remove-ClusterResource "Cluster Disk 2", no result because the resource doesn't exist. 'Get-ClusterResource' shows no resource related to the ghost disk.

Any thoughts on what may be causing this ghost disk to be showing up as available storage, or any resolutions to removing it.

How to size a Scale-out File Server

$
0
0

Hi,

We are looking to implement a 2-node 2012 R2 Scale-out File Server cluster (using SAS JBOD enclosure) for the primary purpose of storing the VHD files that will be accessed by a 4-node 2012 R2 Hyper-V cluster using 10 gigabit Ethernet (no RDMA).  Our environment can be characterised as having a large number of mostly idle VMs that experience sporadic, low intensity use (this is *not* a VDI environment).  We have 2 questions.

1) To what extent is RAM a consideration for the SoFS servers?  We can't find any documentation to suggest that there are benefits to be gained by having more RAM in the SoFS servers but we don't know if we should go with 8/16/32/64+ GB RAM in each of the nodes.

2) With the need to keep costs down, we don't think RDMA / SMB-Direct NICs are going to be within our reach.  Should we however look to have 2 * dual-port 10 Gbps NICs in both the SoFS & Hyper-V boxes?

is it possible to use SQL Server Cluster Disk as a Shared Storage for Load Balanced IIS servers?

$
0
0

HA Configuration:

Servers: Windows Server 2012 R2

2 node SQL Server FCI Cluster

2 node Web Server Behind a load balancer

is it possible to use cluster disk as a shared storage for IIS Servers?


How to Find the Latest Version of Cluster Binary Files available at Microsoft website

$
0
0

How we can find the latest version of Cluster binary files available for download at Microsoft Website and which Cluster Binaries should be update to make the Cluster services stable healthy.


Cluster Aware Updating Scheduling

$
0
0

I am having an issue with Cluster Aware Updating (CAU) on Server 2012 and Server 2012 R2. If I schedule self-updating and specify a time other than 03:00 (e.g. 07:00) in the wizard, when I get to the end of the wizard the confirmation page shows the schedule to be 03:00. if I click apply then the schedule does appear to be set to 03:00.The same behaviour occurs whether I am setting up CAU for the first time or editing an existing configuration. Thus it is not possible to schedule cluster aware updating for any time other than 03:00.

I am assuming that this is a bug, although I am open to suggestions if anyone else can think of a possible cause. I haven't found this mentioned anywhere online and I have been to MS Connect and Server 2012 is not listed as open for bugs. Has anybody else been able to reproduce this? Any idea how to report a bug if connect is closed?

I have two clusters, one on Server 2012 and one on 2012 R2 and I can reproduce on both:

OS Name    Microsoft Windows Server 2012 Datacenter
Version    6.2.9200 Build 9200

OS Name    Microsoft Windows Server 2012 R2 Datacenter
Version    6.3.9600 Build 9600

I am happy to supply further details if anyone is willing to help.

Thanks


Cluster log and etl files

$
0
0

We have a windows 2008 R2 cluster.

A few days ago, we had issues with this cluster.

I openend the cluster log, hoping to find the cause of the problem. But the last entry in the cluster log was dated 14 december 2013. Also the modified date of the failoverclustering.etl files is 24 december 2013. This is also the last time the cluster was restarted.

(Yes, we are a little behind with the MS security patches)

If i have read some articles correct, then the Windows Event Tracing (ETW) process on each node should constantly writes to etl files. Correct?

If that's the case, what could cause this problem?

In windows 2003, the cluster.log was automatically generated. Do i have to create the cluster.log manually in W2kR2?

Thanx.

Changing Node Name, Cluster Name and SQL Virtual Network Name

$
0
0

I plan to rename 2 Cluster Node names (2-Node Active/Passive Cluster in SQL Server 2008 and SQL Server 2008), Cluster Name and SQL Server Virtual Network Name.

I see few articles here and there with pieces of information, but is it possible to have actual steps and pre-req on how to do it

1. Will I need New IP Address for new names or I can MAP 2 names to 1 IP Address?

2. What should be the chronological order to complete this name change process?

3. Since the SQL Is already installed, will this in anyway affect its functionality?

Thanks in advance!!


Inquiry about update KB2878635

$
0
0

Hello, i am wondering if the compound update KB2878635 is strictly concerning Windows 2012 or if a Windows 2012 R2 package will be available, because we are confronted to deadlocks in the Resource Hosting Subsystem (RHS) process, and CSV failures" (the only difference is that in our testing environment we do not have backups)

So far we have installed all updates indicated in http://support.microsoft.com/kb/2920151/en-us

and specifically the update KB 2913695

and followed the procedure to disable ODX.

Because it addresses exactly our issue, but fails to resolve the problem (CSV stuck in pause state during SAN failover)

Bottom story, we are investigating other possible updates. If anyone is knowledgeable on future updates that may go public in the next weeks, i am eager to know...

I am too following this thread but didn't finish yet to process all info...

http://social.technet.microsoft.com/Forums/en-US/604409df-ada1-47d1-bdfb-3f938cde0b59/dpm-2012-sp1-beta-causing-server-2012-hyperv-cluster-hang-iscsi-problems?forum=dpmhypervbackup

Thanks.





Server 2012 Failover cluster. Make two VMs stay on the same node

$
0
0
We have a unique situation where i need two machines to stay on the same node. Its a 4 node cluster with 30+ resources but i want to make sure two boxes are ALWAYS on the same now. If one migrates to another node the second needs to follow. Is there a way to do this? 

Access is denied messages in Win2012 R2 Failover Cluster validation report and CSV entering a paused state

$
0
0

Been having some issues with nodes basically dropping out of clusters config.
Error showing was

"Cluster Shared Volume 'Volume1' ('Data') has entered a paused state because of '(c000020c)'. All I/O will temporarily be queued until a path to the volume is reestablished."

All nodes (Poweredge 420) connected a Dell MD3200 shared SAS storage.

Nodes point to Virtual 2012 R2 DC's

Upon running validation with just two nodes, get the same errors over and over again.

Bemused!

----------------

List Software Updates
Description: List software updates that have been applied on each node.
An error occurred while executing the test.
An error occurred while getting information about the software updates installed on the nodes.

One or more errors occurred.

Creating an instance of the COM component with CLSID {4142DD5D-3472-4370-8641-DE7856431FB0} from the IClassFactory failed due to the following error: 80070005 Access is denied. (Exception from HRESULT: 0x80070005 (E_ACCESSDENIED)).


and

List Disks
Description: List all disks visible to one or more nodes. If a subset of disks is specified for validation, list only disks in the subset.
An error occurred while executing the test.
Storage cannot be validated at this time. Node 'zhyperv2.KISLNET.LOCAL' could not be initialized for validation testing. Possible causes for this are that another validation test is being run from another management client, or a previous validation test was unexpectedly terminated. If a previous validation test was unexpectedly terminated, the best corrective action is to restart the node and try again.

Access is denied

-----------

The event viewer on one of the hosts shows
-------------
Cluster node 'zhyperv2' lost communication with cluster node 'zhyperv1'.  Network communication was reestablished. This could be due to communication temporarily being blocked by a firewall or connection security policy update. If the problem persists and network communication are not reestablished, the cluster service on one or more nodes will stop.  If that happens, run the Validate a Configuration wizard to check your network configuration. Additionally, check for hardware or software errors related to the network adapters on this node, and check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

The Cluster service is shutting down because quorum was lost. This could be due to the loss of network connectivity between some or all nodes in the cluster, or a failover of the witness disk.
Run the Validate a Configuration wizard to check your network configuration. If the condition persists, check for hardware or software errors related to the network adapter. Also check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

Only other warning is because the 4 nic ports in each node server are teamed on one ip address split over two switches - I am not concernd about this and could if required split then pairs, I think this is a red herring????

Failed to online the cluster generic script resource

$
0
0

Environment

Cluster Nodes = two

Cluster Nodes OS = Windows 2012

Application = IIS

Query

I configured the cluster. I want to cluster IIS as failover cluster. For this I am using the following article. As per the direction of the below article I replaced both (SITE_NAME = "MyWebSite" andAPP_POOL_NAME = "DefaultAppPool") . Now when I try to online the generic script resource it failed.

http://support.microsoft.com/kb/970759/en-us

Please Note: When I used the same script mentioned in the above link with default web site, it works fine.


Any comment will be appreciated. Thanks. Zahid Haseeb.


Repairing a Cluster

$
0
0

Hi!

One of the two windows 2012 cluster nodes' hard disk crashed and we had to change the hard disk and reinstall everything. How can we repair the cluster. We also need to change the scsi targets location for quorum disk.

Thanks.

Viewing all 4519 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>