Oracle 11g RAC 添加节点故障之--CRS资源启动故障

2023-10-30 23:50

本文主要是介绍Oracle 11g RAC 添加节点故障之--CRS资源启动故障,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

系统环境:

操作系统:RedHat EL5.5

集群软件: GI 11G

数据库软件:Oracle 11.2.0.1

故障原因:

   由于新节点(node3)是从原有的老节点(node2),克隆而来,在添加新节点时,忘了修改新节点的主机名,导致出现以下故障:

1、新节点CRS service启动正常

[root@node3 ~]# crsctl check crs

CRS-4638: Oracle High Availability Services is online

CRS-4537: Cluster Ready Services is online

CRS-4529: Cluster Synchronization Services is online

CRS-4533: Event Manager is online

2、listener 资源启动失败

[root@node3 ~]# crs_stat -t

Name           Type           Target    State     Host        

------------------------------------------------------------

ora.DG1.dg     ora....up.type ONLINE    ONLINE    node1      

ora....ER.lsnr ora....er.type ONLINE    ONLINE    node1      

ora....N1.lsnr ora....er.type ONLINE    ONLINE    node1      

ora....VOTE.dg ora....up.type ONLINE    ONLINE    node1      

ora.RCY.dg     ora....up.type ONLINE    ONLINE    node1      

ora.asm        ora.asm.type   ONLINE    ONLINE    node1      

ora.eons       ora.eons.type  ONLINE    ONLINE    node1      

ora.gsd        ora.gsd.type   ONLINE    ONLINE    node1      

ora....network ora....rk.type ONLINE    ONLINE    node1      

ora....SM1.asm application    ONLINE    ONLINE    node1      

ora....E1.lsnr application    ONLINE    ONLINE    node1      

ora.node1.gsd  application    ONLINE    ONLINE    node1      

ora.node1.ons  application    ONLINE    ONLINE    node1      

ora.node1.vip  ora....t1.type ONLINE    ONLINE    node1      

ora....SM2.asm application    ONLINE    ONLINE    node2      

ora....E2.lsnr application    ONLINE    ONLINE    node2      

ora.node2.gsd  application    ONLINE    ONLINE    node2      

ora.node2.ons  application    ONLINE    ONLINE    node2      

ora.node2.vip  ora....t1.type ONLINE    ONLINE    node2      

ora....SM3.asm application    ONLINE    ONLINE    node3      

ora....E3.lsnr application    OFFLINE   OFFLINE        

ora.node3.gsd  application    ONLINE    ONLINE    node3      

ora.node3.ons  application    ONLINE    ONLINE    node3      

ora.oc4j       ora.oc4j.type  ONLINE    ONLINE    node2      

ora.ons        ora.ons.type   ONLINE    ONLINE    node1      

ora.pmydb.db   ora....se.type OFFLINE   OFFLINE              

ora....taf.svc ora....ce.type OFFLINE   OFFLINE              

ora.prod.db    ora....se.type OFFLINE   OFFLINE              

ora....ry.acfs ora....fs.type ONLINE    ONLINE    node1      

ora.scan1.vip  ora....ip.type ONLINE    ONLINE    node1  

[root@node3 ~]# crs_stat |grep lsn

NAME=ora.LISTENER.lsnr

NAME=ora.LISTENER_SCAN1.lsnr

NAME=ora.node1.LISTENER_NODE1.lsnr

NAME=ora.node2.LISTENER_NODE2.lsnr

NAME=ora.node3.LISTENER_NODE3.lsnr

NAME=ora.node3.LISTENER_NODE3.lsnr 服务启动失败

3、listener 资源故障解决方法:

手工启动:

[root@node3 ~]# crs_start  ora.node3.LISTENER_NODE3.lsnr -f

CRS-2527: Unable to start 'ora.LISTENER.lsnr' because it has a 'hard' dependency on 'ora.cluster_vip_net1.type'

CRS-2525: All instances of the resource 'ora.node1.vip' are already running; relocate is not allowed because the force option was not specified

CRS-2525: All instances of the resource 'ora.node2.vip' are already running; relocate is not allowed because the force option was not specified

CRS-0222: Resource 'ora.node3.LISTENER_NODE3.lsnr' has dependency error.

从以上日志可以看出,listener 在启动时,缺少vip的支持(node3 vip service)

[root@node3 ~]# crs_stat |grep vip

NAME=ora.node1.vip

TYPE=ora.cluster_vip_net1.type

NAME=ora.node2.vip

TYPE=ora.cluster_vip_net1.type

NAME=ora.scan1.vip

TYPE=ora.scan_vip.type

缺少node vip service!

添加vip service:

[root@node3 ~]# srvctl add -h

The SRVCTL add command adds the configuration and the Oracle Clusterware application to the OCR for the cluster database, named instances, named services, or for the named nodes.

Usage: srvctl add vip -n <node_name> -k <network_number> -A <name|ip>/<netmask>/[if1[|if2...]] [-v]

[root@node1 ~]# cat /etc/hosts

# Do not remove the following line, or various programs

# that require network functionality will fail.

127.0.0.1      localhost

192.168.8.41   node1

192.168.8.43   node1-vip

10.1.1.1       node1-priv

192.168.8.42   node2

192.168.8.44   node2-vip

10.1.1.2       node2-priv

192.168.8.45   scan_ip

192.168.8.46   node3

192.168.8.47   node3-vip

10.1.1.3       node3-priv

[root@node3 ~]#  srvctl add vip -n node3 -A 192.168.8.47/255.255.255.0/eth0 -k 0

PRCN-2049 : The network attributes specified (network number: 0, subnet: 192.168.8.0, adapters: eth0) conflict with an already registered network (network number: 1, subnet: 192.168.8.0, adapters: eth0)

[root@node3 ~]#  srvctl add vip -n node3 -A 192.168.8.47/255.255.255.0/eth0 -k 1

[root@node3 ~]#

添加node3 vip service 成功!

[root@node3 ~]# crs_stat |grep vip

NAME=ora.node1.vip

TYPE=ora.cluster_vip_net1.type

NAME=ora.node2.vip

TYPE=ora.cluster_vip_net1.type

NAME=ora.node3.vip

TYPE=ora.cluster_vip_net1.type

NAME=ora.scan1.vip

TYPE=ora.scan_vip.type

[root@node3 ~]#

启动ora.node3.LISTENER_NODE3.lsnr service:

[root@node3 ~]# crs_stat |grep lsn

NAME=ora.LISTENER.lsnr

NAME=ora.LISTENER_SCAN1.lsnr

NAME=ora.node1.LISTENER_NODE1.lsnr

NAME=ora.node2.LISTENER_NODE2.lsnr

NAME=ora.node3.LISTENER_NODE3.lsnr


[root@node3 ~]# crs_start -f ora.node3.LISTENER_NODE3.lsnr

Attempting to start `ora.node3.vip` on member `node3`

Start of `ora.node3.vip` on member `node3` succeeded.

Attempting to start `ora.LISTENER.lsnr` on member `node3`

Start of `ora.LISTENER.lsnr` on member `node3` succeeded.

[root@node3 ~]#

node3 Grid 日志:

[root@node3 node3]# pwd

/u01/11.2.0/grid/log/node3

[root@node3 node3]# tail -f alertnode3.log

[gpnpd(7901)]CRS-2332:Error pushing GPnP profile to "mdns:service:gpnp._tcp.local.://node1:12802/agent=gpnpd,cname=node-cluster,host=node1,pid=3431/gpnpd h:node1 c:node-cluster".

2014-04-17 14:47:42.399

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.

2014-04-17 14:48:14.444

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.

2014-04-17 14:48:18.952

[cssd(7954)]CRS-1601:CSSD Reconfiguration complete. Active nodes are node1 node2 node3 .

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.

2014-04-17 14:49:34.501

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.

2014-04-17 14:49:58.515

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.

2014-04-17 14:50:22.552

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.

2014-04-17 14:50:46.595

[ctssd(8044)]CRS-2408:The clock on host node3 has been updated by the Cluster Time Synchronization Service to be synchronous with the mean cluster time.


4、添加Oracle 节点

CRS资源已经启动成功,但在node1 上添加 node3 的Oracle 节点时,报以下错误!

wKioL1NPlOKx6eb6AAKzT7Swcho980.jpg

    重新启动node的CRS service ,仍然不能解决,最后尝试在node2 上添加node3 的Oracle新节点,却成功(node1 不能添加,还是个不解之谜;可能是在添加CRS节点时是在node1上执行,添加出现了错误(在node3 运行root.sh时)导致node不能正常识别node3的新节点)


node3 记录以下日志:

[root@node3 node3]# tail -f alertnode3.log

[gpnpd(7901)]CRS-2332:Error pushing GPnP profile to "mdns:service:gpnp._tcp.local.://node1:12802/agent=gpnpd,cname=node-cluster,host=node1,pid=3431/gpnpd h:node1 c:node-cluster".

2014-04-17 14:47:42.399











本文转自 客居天涯 51CTO博客,原文链接:http://blog.51cto.com/tiany/1397335,如需转载请自行联系原作者

这篇关于Oracle 11g RAC 添加节点故障之--CRS资源启动故障的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/311371

相关文章

Oracle数据库在windows系统上重启步骤

《Oracle数据库在windows系统上重启步骤》有时候在服务中重启了oracle之后,数据库并不能正常访问,下面:本文主要介绍Oracle数据库在windows系统上重启的相关资料,文中通过代... oracle数据库在Windows上重启的方法我这里是使用oracle自带的sqlplus工具实现的方

Oracle Scheduler任务故障诊断方法实战指南

《OracleScheduler任务故障诊断方法实战指南》Oracle数据库作为企业级应用中最常用的关系型数据库管理系统之一,偶尔会遇到各种故障和问题,:本文主要介绍OracleSchedul... 目录前言一、故障场景:当定时任务突然“消失”二、基础环境诊断:搭建“全局视角”1. 数据库实例与PDB状态2

oracle 11g导入\导出(expdp impdp)之导入过程

《oracle11g导入导出(expdpimpdp)之导入过程》导出需使用SEC.DMP格式,无分号;建立expdir目录(E:/exp)并确保存在;导入在cmd下执行,需sys用户权限;若需修... 目录准备文件导入(impdp)1、建立directory2、导入语句 3、更改密码总结上一个环节,我们讲了

SpringBoot通过main方法启动web项目实践

《SpringBoot通过main方法启动web项目实践》SpringBoot通过SpringApplication.run()启动Web项目,自动推断应用类型,加载初始化器与监听器,配置Spring... 目录1. 启动入口:SpringApplication.run()2. SpringApplicat

解决Nginx启动报错Job for nginx.service failed because the control process exited with error code问题

《解决Nginx启动报错Jobfornginx.servicefailedbecausethecontrolprocessexitedwitherrorcode问题》Nginx启... 目录一、报错如下二、解决原因三、解决方式总结一、报错如下Job for nginx.service failed bec

Spring Boot项目如何使用外部application.yml配置文件启动JAR包

《SpringBoot项目如何使用外部application.yml配置文件启动JAR包》文章介绍了SpringBoot项目通过指定外部application.yml配置文件启动JAR包的方法,包括... 目录Spring Boot项目中使用外部application.yml配置文件启动JAR包一、基本原理

通过配置nginx访问服务器静态资源的过程

《通过配置nginx访问服务器静态资源的过程》文章介绍了图片存储路径设置、Nginx服务器配置及通过http://192.168.206.170:8007/a.png访问图片的方法,涵盖图片管理与服务... 目录1.图片存储路径2.nginx配置3.访问图片方式总结1.图片存储路径2.nginx配置

解决若依微服务框架启动报错的问题

《解决若依微服务框架启动报错的问题》Invalidboundstatement错误通常由MyBatis映射文件未正确加载或Nacos配置未读取导致,需检查XML的namespace与方法ID是否匹配,... 目录ruoyi-system模块报错报错详情nacos文件目录总结ruoyi-systnGLNYpe

Go语言使用sync.Mutex实现资源加锁

《Go语言使用sync.Mutex实现资源加锁》数据共享是一把双刃剑,Go语言为我们提供了sync.Mutex,一种最基础也是最常用的加锁方式,用于保证在任意时刻只有一个goroutine能访问共享... 目录一、什么是 Mutex二、为什么需要加锁三、实战案例:并发安全的计数器1. 未加锁示例(存在竞态)

解决hive启动时java.net.ConnectException:拒绝连接的问题

《解决hive启动时java.net.ConnectException:拒绝连接的问题》Hadoop集群连接被拒,需检查集群是否启动、关闭防火墙/SELinux、确认安全模式退出,若问题仍存,查看日志... 目录错误发生原因解决方式1.关闭防火墙2.关闭selinux3.启动集群4.检查集群是否正常启动5.