heartbeat-vl.spec 61 KB

1234567891011121314151617181920212223242526272829303132333435363738394041424344454647484950515253545556575859606162636465666768697071727374757677787980818283848586878889909192939495969798991001011021031041051061071081091101111121131141151161171181191201211221231241251261271281291301311321331341351361371381391401411421431441451461471481491501511521531541551561571581591601611621631641651661671681691701711721731741751761771781791801811821831841851861871881891901911921931941951961971981992002012022032042052062072082092102112122132142152162172182192202212222232242252262272282292302312322332342352362372382392402412422432442452462472482492502512522532542552562572582592602612622632642652662672682692702712722732742752762772782792802812822832842852862872882892902912922932942952962972982993003013023033043053063073083093103113123133143153163173183193203213223233243253263273283293303313323333343353363373383393403413423433443453463473483493503513523533543553563573583593603613623633643653663673683693703713723733743753763773783793803813823833843853863873883893903913923933943953963973983994004014024034044054064074084094104114124134144154164174184194204214224234244254264274284294304314324334344354364374384394404414424434444454464474484494504514524534544554564574584594604614624634644654664674684694704714724734744754764774784794804814824834844854864874884894904914924934944954964974984995005015025035045055065075085095105115125135145155165175185195205215225235245255265275285295305315325335345355365375385395405415425435445455465475485495505515525535545555565575585595605615625635645655665675685695705715725735745755765775785795805815825835845855865875885895905915925935945955965975985996006016026036046056066076086096106116126136146156166176186196206216226236246256266276286296306316326336346356366376386396406416426436446456466476486496506516526536546556566576586596606616626636646656666676686696706716726736746756766776786796806816826836846856866876886896906916926936946956966976986997007017027037047057067077087097107117127137147157167177187197207217227237247257267277287297307317327337347357367377387397407417427437447457467477487497507517527537547557567577587597607617627637647657667677687697707717727737747757767777787797807817827837847857867877887897907917927937947957967977987998008018028038048058068078088098108118128138148158168178188198208218228238248258268278288298308318328338348358368378388398408418428438448458468478488498508518528538548558568578588598608618628638648658668678688698708718728738748758768778788798808818828838848858868878888898908918928938948958968978988999009019029039049059069079089099109119129139149159169179189199209219229239249259269279289299309319329339349359369379389399409419429439449459469479489499509519529539549559569579589599609619629639649659669679689699709719729739749759769779789799809819829839849859869879889899909919929939949959969979989991000100110021003100410051006100710081009101010111012101310141015101610171018101910201021102210231024102510261027102810291030103110321033103410351036103710381039104010411042104310441045104610471048104910501051105210531054105510561057105810591060106110621063106410651066106710681069107010711072107310741075107610771078107910801081108210831084108510861087108810891090109110921093109410951096109710981099110011011102110311041105110611071108110911101111111211131114111511161117111811191120112111221123112411251126112711281129113011311132113311341135113611371138113911401141114211431144114511461147114811491150115111521153115411551156115711581159116011611162116311641165116611671168116911701171117211731174117511761177117811791180118111821183118411851186118711881189119011911192119311941195119611971198119912001201120212031204120512061207120812091210121112121213121412151216121712181219122012211222122312241225122612271228122912301231123212331234123512361237123812391240124112421243124412451246124712481249125012511252125312541255125612571258125912601261126212631264126512661267126812691270127112721273127412751276127712781279128012811282128312841285128612871288128912901291129212931294129512961297129812991300130113021303130413051306130713081309131013111312131313141315131613171318131913201321132213231324132513261327132813291330133113321333133413351336133713381339134013411342134313441345134613471348134913501351135213531354135513561357135813591360136113621363136413651366136713681369137013711372137313741375137613771378137913801381
  1. %define ENABLE_SNMP_SUBAGENT 0
  2. %define ENABLE_MGMT 1
  3. %define uid 24
  4. %define gname haclient
  5. %define uname hacluster
  6. Summary: Heartbeat subsystem for High-Availability Linux
  7. Summary(ja): 高可用性 Linux のための HeartBeat サブシステム
  8. Name: heartbeat
  9. Version: 2.0.8
  10. Release: 6%{_dist_release}
  11. Group: System Environment/Daemons
  12. License: GPL/LGPL
  13. URL: http://linux-ha.org/
  14. Source0: http://linux-ha.org/download/%{name}-%{version}.tar.gz
  15. Patch0: heartbeat-fedora-pam.patch
  16. Patch1: heartbeat-fedora-chkconfig.patch
  17. BuildRoot: %{_tmppath}/%{name}-%{version}-root
  18. BuildRequires: glib2-devel
  19. BuildRequires: iputils
  20. BuildRequires: libnet-devel
  21. BuildRequires: libtool-ltdl-devel
  22. BuildRequires: net-snmp-devel
  23. BuildRequires: bzip2-devel
  24. BuildRequires: ncurses-devel
  25. %if %{ENABLE_MGMT}
  26. BuildRequires: gnutls-devel > 2
  27. BuildRequires: pam-devel
  28. BuildRequires: python-devel
  29. BuildRequires: swig
  30. BuildRequires: curl-devel
  31. BuildRequires: libuuid-devel
  32. BuildRequires: libxml2-devel
  33. %endif
  34. Requires: stonith = %{version}-%{release}
  35. Requires: pils = %{version}-%{release}
  36. Requires: ldirectord = %{version}-%{release}
  37. Requires: pygtk2 >= 2.4
  38. Requires(post): /sbin/chkconfig
  39. Requires(preun): /sbin/chkconfig
  40. %if %{ENABLE_MGMT}
  41. Requires: gettext
  42. %endif
  43. %description
  44. heartbeat is a basic high-availability subsystem for Linux-HA.
  45. It will run scripts at initialization, and when machines go up or down.
  46. This version will also perform IP address takeover using gratuitous ARPs.
  47. It supports "n-node" clusters with significant capabilities for managing
  48. resources and dependencies.
  49. In addition it continues to support the older release 1 style of
  50. 2-node clustering.
  51. It implements the following kinds of heartbeats:
  52. - Serial ports
  53. - UDP/IP multicast (ethernet, etc)
  54. - UDP/IP broadcast (ethernet, etc)
  55. - UDP/IP heartbeats
  56. - "ping" heartbeats (for routers, switches, etc.)
  57. (to be used for breaking ties in 2-node systems)
  58. %package -n ldirectord
  59. Summary: Monitor daemon for maintaining high availability resources
  60. Summary(ja): 高可用リソースを管理するための監視デーモン
  61. Group: System Environment/Daemons
  62. Requires: ipvsadm
  63. Requires: %{name} = %{version}
  64. Requires(post): /sbin/chkconfig
  65. Requires(preun): /sbin/chkconfig
  66. %description -n ldirectord
  67. ldirectord is a stand-alone daemon to monitor services of real
  68. for virtual services provided by The Linux Virtual Server
  69. (http://www.linuxvirtualserver.org/). It is simple to install
  70. and works with the heartbeat code (http://www.linux-ha.org/).
  71. %package -n stonith
  72. Summary: Provides an interface to Shoot The Other Node In The Head
  73. Group: System Environment/Daemons
  74. Requires: pils = %{version}-%{release}
  75. %description -n stonith
  76. The STONITH module (a.k.a. STOMITH) provides an extensible interface
  77. for remotely powering down a node in the cluster. The idea is quite simple:
  78. When the software running on one machine wants to make sure another
  79. machine in the cluster is not using a resource, pull the plug on the other
  80. machine. It's simple and reliable, albeit admittedly brutal.
  81. #'
  82. %package -n pils
  83. Summary: Provides a general plugin and interface loading library
  84. Group: System Environment/Daemons
  85. %description -n pils
  86. PILS is an generalized and portable open source Plugin and Interface Loading
  87. System. PILS was developed as part of the Open Cluster Framework reference
  88. implementation, and is designed to be directly usable by a wide variety of
  89. other applications.
  90. PILS manages both plugins (loadable objects), and the interfaces these plugins
  91. implement. PILS is designed to support any number of plugins implementing any
  92. number of interfaces.
  93. %package devel
  94. Summary: Heartbeat development package
  95. Summary(ja): Heartbeat 開発パッケージ
  96. Group: Development/Libraries
  97. Requires: heartbeat = %{version}-%{release}
  98. %description devel
  99. Heartbeat development package
  100. %prep
  101. %setup -q
  102. %patch0 -p1
  103. %patch1 -p1
  104. %build
  105. # disable-fatal-warnings flag used to disable gcc4.x warnings of 'difference in signedness'
  106. CFLAGS=${RPM_OPT_FLAGS} \
  107. %configure \
  108. --disable-fatal-warnings \
  109. --disable-static \
  110. %if %{ENABLE_MGMT}
  111. --enable-mgmt
  112. %else
  113. --disable-mgmt
  114. %endif
  115. make
  116. %install
  117. rm -rf $RPM_BUILD_ROOT
  118. mkdir -p $RPM_BUILD_ROOT
  119. make DESTDIR=$RPM_BUILD_ROOT install
  120. (
  121. cd $RPM_BUILD_ROOT/etc/ha.d/resource.d
  122. ln -s /usr/sbin/ldirectord ldirectord
  123. )
  124. mkdir -p $RPM_BUILD_ROOT/%{_sysconfdir}/ha.d/conf
  125. # cleanup
  126. [ -d $RPM_BUILD_ROOT/usr/man ] && rm -rf $RPM_BUILD_ROOT/usr/man
  127. [ -d $RPM_BUILD_ROOT/usr/share/libtool ] && rm -rf $RPM_BUILD_ROOT/usr/share/libtool
  128. find $RPM_BUILD_ROOT -type f -name *.la -exec rm -f {} ';'
  129. sed -i -e '1i# -*-Shell-script-*-' $RPM_BUILD_ROOT/%{_libdir}/heartbeat/ocf-shellfuncs
  130. sed -i -e '1i# -*-Shell-script-*-' $RPM_BUILD_ROOT/%{_sysconfdir}/ha.d/shellfuncs
  131. chmod -x $RPM_BUILD_ROOT/%{_libdir}/heartbeat/ocf-shellfuncs
  132. chmod -x $RPM_BUILD_ROOT/%{_sysconfdir}/ha.d/shellfuncs
  133. chmod -x $RPM_BUILD_ROOT/%{_libdir}/heartbeat-gui/pymgmt.py
  134. %clean
  135. if [ -n "${RPM_BUILD_ROOT}" -a "${RPM_BUILD_ROOT}" != "/" ]; then
  136. rm -rf $RPM_BUILD_ROOT
  137. fi
  138. %pre
  139. /usr/sbin/groupadd -g %{uid} %{gname} &>/dev/null || :
  140. /usr/sbin/useradd -M -r -u %{uid} -s /sbin/nologin -d /var/lib/heartbeat/cores/hacluster \
  141. -c 'heartbeat user' -g %{gname} %{uname} &>/dev/null || :
  142. %post
  143. /sbin/ldconfig
  144. /sbin/chkconfig --add heartbeat
  145. %preun
  146. /sbin/chkconfig --del heartbeat
  147. %postun
  148. /sbin/ldconfig
  149. test "$1" != 0 || /usr/sbin/userdel %{uname} &>/dev/null || :
  150. test "$1" != 0 || /usr/sbin/groupdel %{gname} &>/dev/null || :
  151. %post -n ldirectord
  152. /sbin/chkconfig --add ldirectord
  153. %preun -n ldirectord
  154. /sbin/chkconfig --del ldirectord
  155. %post -n stonith -p /sbin/ldconfig
  156. %postun -n stonith -p /sbin/ldconfig
  157. %files
  158. %doc %{_datadir}/doc/%{name}-%{version}
  159. %defattr(-,root,root)
  160. %dir %{_sysconfdir}/ha.d
  161. %{_sysconfdir}/ha.d/harc
  162. %config(noreplace) %{_sysconfdir}/ha.d/shellfuncs
  163. %{_sysconfdir}/ha.d/rc.d
  164. %config(noreplace) %{_sysconfdir}/ha.d/README.config
  165. %{_libdir}/heartbeat
  166. %{_libdir}/heartbeat-gui
  167. %{_libdir}/libapphb.so.*
  168. %{_libdir}/libccmclient.so.*
  169. %{_libdir}/libcib.so.*
  170. %{_libdir}/libclm.so.*
  171. %{_libdir}/libcrmcommon.so.*
  172. %{_libdir}/libtransitioner.so.*
  173. %{_libdir}/libhbclient.so.*
  174. %{_libdir}/liblrm.so.*
  175. %{_libdir}/libpengine.so.*
  176. %{_libdir}/libplumb.so.*
  177. %{_libdir}/libplumbgpl.so.*
  178. %{_libdir}/librecoverymgr.so.*
  179. %{_libdir}/libstonithd.so.*
  180. %{_prefix}/lib/ocf
  181. %{_sysconfdir}/ha.d/resource.d/
  182. %exclude %{_sysconfdir}/ha.d/resource.d/ldirectord
  183. %{_sysconfdir}/init.d/heartbeat
  184. %config(noreplace) %{_sysconfdir}/logrotate.d/heartbeat
  185. %dir %{_var}/lib/heartbeat
  186. %dir %{_var}/lib/heartbeat/cores
  187. %dir %attr (0755, root, root) %{_var}/lib/heartbeat/cores/root
  188. %dir %attr (0755, nobody, nobody) %{_var}/lib/heartbeat/cores/nobody
  189. %dir %attr (0755, hacluster, haclient) %{_var}/lib/heartbeat/cores/hacluster
  190. %dir %{_var}/run/heartbeat
  191. %attr (0755, hacluster, haclient) %{_bindir}/cl_status
  192. %{_bindir}/cl_respawn
  193. %{_sbindir}/crmadmin
  194. %{_sbindir}/cibadmin
  195. %{_sbindir}/ccm_tool
  196. %{_sbindir}/crm_diff
  197. %{_sbindir}/crm_uuid
  198. %{_sbindir}/crm_mon
  199. %{_sbindir}/crm_sh
  200. %{_sbindir}/iso8601
  201. %{_sbindir}/crm_master
  202. %{_sbindir}/crm_standby
  203. %{_sbindir}/crm_attribute
  204. %{_sbindir}/crm_resource
  205. %{_sbindir}/crm_verify
  206. %{_sbindir}/attrd_updater
  207. %{_sbindir}/crm_failcount
  208. %{_sbindir}/ocf-tester
  209. %dir %attr (755, hacluster, haclient) %{_var}/run/heartbeat/ccm
  210. %dir %attr (755, hacluster, haclient) %{_var}/run/heartbeat/crm
  211. %dir %attr (755, hacluster, haclient) %{_var}/lib/heartbeat/crm
  212. %dir %attr (755, hacluster, haclient) %{_var}/lib/heartbeat/pengine
  213. %{_mandir}/man1/cl_status.1*
  214. %{_mandir}/man1/ha_logger.1*
  215. %{_mandir}/man1/hb_standby.1*
  216. %{_mandir}/man1/hb_takeover.1*
  217. %{_mandir}/man1/hb_addnode.1*
  218. %{_mandir}/man1/hb_delnode.1*
  219. %{_mandir}/man8/heartbeat.8*
  220. %{_mandir}/man8/apphbd.8*
  221. %{_mandir}/man8/ha_logd.8*
  222. %{_mandir}/man8/cibadmin.8*
  223. %{_mandir}/man8/crm_resource.8*
  224. %if %{ENABLE_SNMP_SUBAGENT}
  225. /LINUX-HA-MIB.mib
  226. %endif
  227. %if %{ENABLE_MGMT}
  228. %{_libdir}/libhbmgmt.so.*
  229. %{_libdir}/libhbmgmtclient.so.*
  230. %{_libdir}/libhbmgmtcommon.so.*
  231. %{_libdir}/libhbmgmttls.so.*
  232. %config(noreplace) %{_sysconfdir}/pam.d/hbmgmtd
  233. %{_datadir}/locale/zh_CN/LC_MESSAGES/haclient.mo
  234. %endif
  235. %files -n ldirectord
  236. %doc doc/COPYING
  237. %doc doc/README
  238. %doc ldirectord/ldirectord.cf
  239. %defattr(-,root,root)
  240. %{_sysconfdir}/ha.d/conf
  241. %{_sbindir}/ldirectord
  242. %{_sbindir}/supervise-ldirectord-config
  243. %config(noreplace) %{_sysconfdir}/logrotate.d/ldirectord
  244. %{_sysconfdir}/init.d/ldirectord
  245. %{_sysconfdir}/ha.d/resource.d/ldirectord
  246. %{_mandir}/man8/ldirectord.8*
  247. %{_mandir}/man8/supervise-ldirectord-config.8*
  248. %files -n stonith
  249. %doc doc/COPYING
  250. %doc doc/README
  251. %defattr(-,root,root)
  252. %{_libdir}/*.so.*
  253. %{_libdir}/stonith/
  254. %{_sbindir}/stonith
  255. %{_sbindir}/meatclient
  256. %{_mandir}/man8/stonith.8*
  257. %{_mandir}/man8/meatclient.8*
  258. %files -n pils
  259. %doc doc/COPYING
  260. %doc doc/README
  261. %defattr(-,root,root)
  262. /usr/include/pils
  263. %{_libdir}/libpils.*
  264. %{_libdir}/pils/
  265. %files devel
  266. %doc %{_datadir}/doc/%{name}-%{version}
  267. %defattr(-,root,root)
  268. %{_includedir}/heartbeat/
  269. %{_includedir}/clplumbing/
  270. %{_includedir}/saf/
  271. %{_includedir}/ocf/
  272. %{_includedir}/stonith/
  273. %{_includedir}/pils/
  274. %{_libdir}/*.so
  275. %changelog
  276. * Sun Mar 20 2011 Yoji TOYODA <bsyamato@sea.plala.or.jp> 2.0.8-6
  277. - rebuild with openssl-1.0.0d
  278. - add BuildRequires: libuuid-devel, libxml2-devel
  279. * Sun Oct 3 2010 Ryoichi INAGAKI <ryo1@bc.wakwak.com> 2.0.8-5
  280. - rebuilt with new toolchain
  281. * Sun May 17 2009 Ryoichi INAGAKI <ryo1@bc.wakwak.com> 2.0.8-4
  282. - spec in UTF-8
  283. - changed devel Group to Development/Libraries
  284. * Tue May 05 2009 Satoshi IWAMOTO <satoshi.iwamoto@nifty.ne.jp> 2.0.8-3
  285. - new versioning policy
  286. - rebuild with gnutls 2.6.6, libtool 2.2.6 and curl 7.19.4
  287. - add BuildRequires: curl-devel
  288. * Sun Jan 06 2008 Shu KONNO <owa@bg.wakwak.com> 2.0.8-0vl2
  289. - rebuilt with net-snmp-5.4.1
  290. * Sat May 19 2007 Daisuke SUZUKI <daisuke@linux.or.jp> 2.0.8-0vl1
  291. - new upstream release
  292. - build with new environment
  293. * Mon Oct 30 2006 Daisuke SUZUKI <daisuke@linux.or.jp> 2.0.7-0vl2
  294. - rebuild with gnutls-1.4
  295. - add Japanese summaries
  296. * Mon Oct 23 2006 Daisuke SUZUKI <daisuke@linux.or.jp> 2.0.7-0vl1
  297. - initial build for Vine Linux based on FE package.
  298. * Wed Aug 30 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.7-2
  299. - rebuild for Fedora Extras 6
  300. * Wed Aug 16 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.7-1
  301. - upstream version 2.0.7
  302. * Sat Jul 15 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.6-2
  303. - added BuildReqs: ncurses-devel
  304. * Fri Jul 14 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.6-1
  305. - upstream version 2.0.6
  306. * Fri Jun 16 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.5-2
  307. - bump for gnutls change in devel
  308. * Thu Apr 27 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.5-1
  309. - upstream version 2.0.5
  310. - removed patch2 - ownership of /heartbeat/crm/cib.xml is no longer
  311. set in cts/CM_LinuxHAv2.py.in
  312. * Wed Mar 29 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.4-2
  313. - Version 2.0.4
  314. * Wed Mar 1 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-9
  315. - changed user creation
  316. - added patch2 heartbeat-2.0.3-fedora-ccmuser.patch
  317. * Wed Mar 1 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-8
  318. - specifically excluded ldirectord symlink from heartbeat package
  319. - removed user and group deletion in postun
  320. - renamed subpackages ldirectord, pils and stonith to lose prefix heartbeat
  321. by using -n
  322. * Tue Feb 28 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-7
  323. - fixed more rpmlint errors and warnings
  324. * Sat Feb 25 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-6
  325. - fixed number of rpmlint warnings and errors (still ignores some..)
  326. - generate 'predictable' uid and gid with fedora-usermgmt to use with
  327. configure flag -with-ccmuser-id and groupadd, useradd
  328. - added Buildreq's: libtool-ltdl-devel, fedora-usermgmt-setup
  329. net-snmp-devel, bzip2-devel
  330. - removed *.so duplication in heartbeat and heartbeat-devel
  331. - changed file sections
  332. * Fri Feb 24 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-5
  333. - useradd with fedora-usermgmt
  334. - added *.so file to -devel sub-package
  335. * Sat Feb 18 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-4
  336. - removed all perl requires; should be picked up by rpmbuild automagically
  337. - changed system user creation hacluster part to use baseid:
  338. (http://fedoraproject.org/wiki/Packaging/UserCreation)
  339. * Thu Feb 16 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-3
  340. - removed Requires: python and gnutls
  341. - changed _libdir/ocf -> _prefix/lib/ocf
  342. - reversed subpackages depend on basepackage
  343. - removed Req swig (kept BuildReq)
  344. - added Req pygtk2
  345. * Wed Feb 15 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-2
  346. - fixes for various rpmlint errors and warnings
  347. - fixed setup -q
  348. - make subpackages depend on basepackage, not reverse
  349. - clean buildroot at beginning of install
  350. - replaced a number of hardcoded paths with RPM macros
  351. - Changed Group from Networking/Daemons to System Environment/Daemons
  352. - enable mgmt option
  353. * Sun Feb 12 2006 Joost Soeterbroek <fedora@soeterbroek.com> - 2.0.3-1
  354. - rebuilt for Fedora Extras
  355. * Fri Feb 10 2006 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  356. + Version 2.0.3 - Bug fixes and significant new features.
  357. + Management Daemon/Library and GUI client
  358. + provide a management library for manamgement daemon and CIM provider
  359. + provide a management daemon and a basic GUI management tool
  360. + CIM enablement
  361. + CIM (Common Information Model) enablement - works with
  362. sblim-sfcb, OpenWBEM, and Pegasus CIMOMs
  363. - not yet compiled into our binary RPMs because of dependencies
  364. + CRM (Cluster Resource Manager) General
  365. + All shutdowns go via the PE/TE - preserves inter-resource ordering
  366. + Support for future changes to the CIB (depreciation of cib_fragment)
  367. + Overhaul of IPC and HA channel callback logic
  368. + Many improvments to the quality and quantity (reduced) of logging
  369. + CRMd
  370. + Timerless elections - when everyone has voted we're done
  371. + Use the replace notification from the CIB to re-update our copy with
  372. our view of our peers.
  373. + Reliably detect if the LRM connection is still active.
  374. + Elections
  375. + newer versions defer to older ones in DC elections
  376. (opposite of current behavior)
  377. + this means that only once the complete cluster has been upgraded will
  378. we start acting like the new version and accept new config options
  379. + it also means newer PE's and TE's (the most complex pieces) don't need
  380. "act like the old version" options and can rely on all slaves being at
  381. least as up-to-date as they are
  382. + people can run mixed clusters as long as they want
  383. (until they want the new PE features)
  384. + new DCs only update the version number in the CIB if they have a
  385. higher value
  386. + nodes that start and have a lower version than that stored in the CIB
  387. shut themselves down (the CRM part anyway)
  388. + this prevents an admin from introducing old nodes back into an upgraded
  389. cluster. It probably doesn't fully understand the config and may not
  390. support the actions the PE/TE requires.
  391. + CIB (Common Information Base daemon)
  392. + Make sure "query only" connections cant modify the CIB
  393. + Periodically dump some stats about what the CIB has been doing.
  394. + Verify there are no memory leaks
  395. + Performance enhancements
  396. + Prevent a single CIB client from blocking everyone else
  397. + Clients Can be notified of full CIB replacements
  398. + record_config_changes option in ha.cf for those worried about
  399. the amount of logging. Defaults to "on".
  400. + suppress_cib_writes CIB option replaced with in enable_config_writes ha.cf
  401. (enable_config_writes to be removed in 2.0.4)
  402. + Never write the status section to disk
  403. + Check permissions for the on-disk CIB at startup
  404. + Dont trash unreadable on-disk CIBs
  405. + Fix for updates made against the whole CIB (not just one section)
  406. + PEngine (Policy Engine)
  407. + Many improvements to the handling of resource groups
  408. + Support "anonymous" clones
  409. + Fix stonith ordering
  410. + Order DC shutdowns after everyone else's
  411. + Support short resource names (for group and clone resources)
  412. + The ordering and colocation of grouped resources is now optional
  413. + Support probing new nodes for active resources.
  414. + All "probe" actions are controlled by the PE.
  415. + No resource may be started until the probing is complete.
  416. + Do not probe for resources we know to be active on unprobed nodes
  417. + When looking for monitor ops, only mark it optional if it was already
  418. active on the node we're interested in.
  419. + Detect changes to class/type/provider/parameters and force a restart
  420. of the resource
  421. + New record_pengine_inputs option in ha.cf for those worried about
  422. the amount of logging. Defaults to "on".
  423. + Differentiate between config and processing errors
  424. + reduces the frequency that we need to log the complete CIB
  425. + Make notify for master/slave work
  426. + New CIB option: stop_orphan_actions (boolean)
  427. If a resource is no longer defined, we can optionally stop it
  428. + New CIB option: stop_orphan_actions (boolean)
  429. If a monitor op for a given interval is no longer defined, we can
  430. optionally stop it
  431. + Add support for time and phase-of-the-moon based constraints
  432. + Improved failure handling: avoiding false positives
  433. + Always create orphaned resources - so they show up in crm_mon
  434. + Do not require sequential clone numbers starting at 0
  435. + TEngine (transition engine)
  436. + Detect old stonith ops
  437. + CLIs (Command Line interfaces)
  438. + Create a --one-shot option for crm_mon
  439. + Switch a number of CLI tools to use the new syncronous connections
  440. + Log errors to stderr where they will be seen and therefore useful
  441. + Support migration and un-migration of resources and resource groups
  442. + Create crm_verify for checking configuration validity
  443. + Simplify the passing of XML to cibadmin
  444. + Known open bugs worth mentioning:
  445. + 1075, 1080, 1081, 1084, 1085, 1064, 1069, 756, 984
  446. + 1050, 1082, 1037, 1079
  447. * Thu Sep 22 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  448. + Version 2.0.2 - small bug fix only release
  449. + Fixed a bug in ping directive - it works again
  450. + Added a check to BasicSanityCheck to check ping and ping_group directives
  451. + fixed cl_status nodestatus to return 0 if a node has status "ping"
  452. + fixed a memory leak in the CRM's LRM interface code
  453. + fixed code which deterimines which version of the CRM becomes
  454. the DC when basic CIB schema versions differ. It now prefers
  455. the older version to be DC instead of the newer version.
  456. * Wed Sep 14 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  457. + Version 2.0.1 -
  458. + Communication Layer
  459. + netstring encoding format is changed to be more efficient
  460. + add compression capability for big messages
  461. + Add man pages for hb_standby/hb_takeover
  462. + The assert triggered by 2.0.0 has been fixed
  463. + CIB can now contain XML comments and/or be in DOS format
  464. + Includes implementation of the ISO8601 date format
  465. + New CLI tools for changing cluster preferences, node attributes
  466. and node standby
  467. + Improved recovery and placement of group resources
  468. + Detection of failed nodes by the Policy Engine is fixed
  469. + New Policy Engine features
  470. http://www.linux-ha.org/ClusterResourceManager/DTD1.0/Annotated :
  471. sections 1.5.[8,9,10,12]
  472. + Constraints and instance attributes can now be active conditionally
  473. + Rules can now contain other rules
  474. + Date/Time based expressions are supported
  475. + Cloned resources can now optionally be notified before and after
  476. any of its peers are stopped or started.
  477. + The cluster can re-evaluate the configuration automatically after
  478. a defined interval of idleness
  479. + Removed a flow control message which was very annoying when operating
  480. in a mixed 1.x/2.x environment
  481. -- Known Bugs :-( --
  482. - Bug 859 - FSA took too long to complete action - fully recovered from
  483. - Bug 882 - IPC channel not connected during shutdown - harmless
  484. - Bug 879 - Failed actions cause extra election - harmless
  485. Each of these occurs about once or twice in 5000 test iterations
  486. - This is probably > 10K failovers
  487. - rsc_location constraints cannot have rules that contain other rules
  488. (fixed in CVS after release)
  489. * Fri Jul 29 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  490. + Version 2.0.0 - First stable release of the next generation of the Linux-HA project
  491. + Basic Characteristics described here:
  492. http://linux-ha.org/FactSheetv2
  493. + Core infrastructure improvments:
  494. + Messaging (message acknowledging and flow control)
  495. + Logging (logging daemon)
  496. + Release 1.x style (2-node) clusters fully supported
  497. + Multi-node support (so far up to 16-node clusters tested)
  498. See http://linux-ha.org/GettingStartedV2 for more information
  499. + New components:
  500. + Cluster Information Base (replicated resource configuration)
  501. + Cluster Resource Manager (supporting 1->N nodes)
  502. + Modular Policy Engine (controlling resource placement)
  503. + Local Resource Manager (policy free, not cluster aware)
  504. + Stonith Daemon (stand-alone fencing subsytem)
  505. + Support for OCF and LSB resource agents
  506. + Support for composite resource types (groups, clones)
  507. + Support for a rich set of resource location and ordering constraints
  508. + Conversion tool for existing haresources
  509. + Resources monitored by request
  510. + Resource "maintenance" mode
  511. + Several failback, failure and "No Quorum" behaviours to choose from
  512. (global defaults and per action or resource)
  513. + Sample cluster state and configuration monitoring tools
  514. Known issues in 2.0.0:
  515. - Under some rare circumstances the cluster manager will time out
  516. while stabilizing a new cluster state. This appears to be
  517. otherwise harmless - the cluster is actually fine.
  518. http://www.osdl.org/developer_bugzilla/show_bug.cgi?id=770
  519. - Under some rare circumstances, a dev assert will be triggered
  520. in unpack.c. This results in the pengine getting restarted.
  521. This is annoying, but not a disaster.
  522. http://www.osdl.org/developer_bugzilla/show_bug.cgi?id=797
  523. * Tue May 23 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  524. + Version 1.99.5 - Near-final beta of 2.0.0 release
  525. + many bug fixes - code looks very stable at this point
  526. -- well tested at this point on 4 and 8 node clusters.
  527. * Thu Apr 07 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  528. + Version 1.99.4 - Near-final beta of 2.0.0 release
  529. + many bug fixes since 1.99.1
  530. + new external STONITH model - fully supports scripting interface
  531. + tested through 12 node clusters successfully
  532. + No serious defects found in testing
  533. + Easier-to-understand locational constraints model
  534. + Many bug fixes of many kinds
  535. + Important bug fixes to OCF IPaddr resource agent
  536. + Resources are monitored only on request
  537. + See http://wiki.linux-ha.org/ClusterResourceManager/Setup
  538. for basic ideas about getting started.
  539. + Release 1 style (2-node) clusters still fully supported
  540. + Release 2 style clusters support 1-N node clusters
  541. (where N is probably something like 8-32)
  542. * Tue Mar 20 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  543. + Version 1.99.3 - Near-final beta "technology preview" of 2.0.0 release
  544. + many bug fixes since 1.99.1
  545. + tested through 12 node clusters with reasonable success
  546. + new STONITH API
  547. * Sun Feb 20 2005 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  548. + Version 1.99.2 - Near-final beta "technology preview" of 2.0.0 release
  549. + Many many many changes. Far too many to describe here.
  550. + See http://wiki.linux-ha.org/ClusterResourceManager/Setup
  551. for certain basic ideas about getting started.
  552. * Mon Oct 11 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  553. + Version 1.99.1 - *early* beta series - preparing for 2.0.0
  554. + Andrew provided a number of fixes to the CRM and 2.0 features
  555. + Fixed a problem with retrying failed STONITH operations
  556. * Mon Oct 11 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  557. + Version 1.99.0 - *early* beta series - preparing for 2.0.0
  558. + All STABLE changes noted below have been ported to this branch
  559. + Included in this release is a beta of the next generation of Heartbeat
  560. resource manager developed by Andrew Beekhof.
  561. http://linuxha.trick.ca/NewHeartbeatDesign is a good place to learn
  562. more about this effort. Please examine crm/README, crm/test/README
  563. and crm/crm-1.0.dtd for example usage and configuration.
  564. + Also included is the L(ocal) R(esource) M(anager) developed by IBM China
  565. which is an integral part of the NewHeartbeatDesign.
  566. + Known caveats:
  567. - STONITH as a whole has seen a code cleanup and should be tested
  568. carefully.
  569. - The external STONITH plug-in has undergone major surgery and
  570. probably doesn't work yet.
  571. - the new CRM is not perfectly stable with 3 nodes yet.
  572. + PLEASE see http://osdl.org/developer_bugzilla/enter_bug.cgi?product=Linux-HA
  573. and use it to report quirks and issues you find!
  574. * Sat Sep 18 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  575. + Version 1.2.3 (stable)
  576. + fixed a serious error which causes heartbeat to misbehave after about
  577. 10 months of continuous operation
  578. + Made our ARP packets more RFC compliant
  579. + Extended apcmastersnmp code to deal with new devices
  580. + fixed a bug concerning simultaneous stops of both machines causing one
  581. of them to not shut down.
  582. + added an option to suppress reporting on packet corruption
  583. + fixed it so that we don't create the FIFO by the RPM
  584. + made cl_status setgid so anyone can run it, and fixed exit codes
  585. + eliminated a serious memory leak associated with client code
  586. + packaged doc files which had been missed before
  587. + fixed many many small bugs and memory leaks detected by BEAM
  588. + added several new test cases
  589. + fixed longstanding bug in plugin unloading
  590. + fixed a shutdown hang problem
  591. + several fixes for Solaris, FreeBSD
  592. + Solaris packaging now included in base
  593. + fixed a bug related to the apache resource agent not handling
  594. quoted parameters
  595. + added use_apphbd parameter to have heartbeat register
  596. with apphbd instead of watchdog device when desired
  597. + changed apphbd to default its config file to /etc
  598. + added snmp subagent code
  599. + added hbaping communications plugin
  600. + added external STONITH plugin
  601. + ldirectord: fixed a bug where real servers that were are
  602. present in multiple virtual services will only be added
  603. to one virtual service.
  604. * Mon May 11 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  605. + Version 1.2.2 (stable)
  606. + Fixed several format string errors in communication plugins
  607. + Fixed a bug which kept us from diagnosing errors in non-aliased interfaces
  608. + Fixed a bug in ipaddr which caused an infinite loop when auto_failback on
  609. + Updated Debian things...
  610. + Added IPv6addr resource agent
  611. + Added ibmhmc STONITH plugin
  612. + Added cl_status command
  613. + Fixed a bug regarding restarts when auto_failback is on...
  614. + Fixed a couple of bugs in sha1 authentication method for very long keys
  615. + Fixed a bug in the portblock resource agent so that it no longer blocks
  616. ports on the loopback interface
  617. + Increased the time allowed for split brain test before it declares failure
  618. + Version 1.2.1 (stable)
  619. + Netstrings can now be used for our on-the-wire data format
  620. + Perl/SWIG bindings added for some heartbeat libraries
  621. + Significant improvements to SAF data checkpointing API
  622. + Implemented unbuffered ipcsocket code for SAF APIs
  623. + Many Solaris fixes -- except for ipfail, Solaris works
  624. + Significant library restructuring
  625. + Watchdog device NOWAYOUT is now overridded if defaulted
  626. + Watchdog device now kills machine instantly after deadtime
  627. instead of after one minute
  628. + Hostnames should now be treated case-independently...
  629. + Added new client status APIs - client_status() and cstatus_callback()
  630. + Fixed bug with auto_failback and quick full restarts
  631. + We now automatically reboot when resources fail to stop correctly...
  632. + We now check the status of the configured STONITH device hourly...
  633. + STONITH operations repeat after a 5 second delay, not immediately...
  634. + Added hb_takeover command - complement to hb_standby
  635. + Added documentation on how to use evlog/TCP to enable testing to
  636. take place without losing messages due to UDP message forwarding
  637. + Several new tests from Mi, Jun - split brain, bandwidth, failure
  638. detection time.
  639. + Fix to LVM resource from Harald Milz <hm@muc.de>
  640. + Fixed FreeBSD authentication problems breaking ipfail
  641. + Fixed .so loading on Debian
  642. + Fixed false complaints about resource scripts (from Jens Schmalzing)
  643. + Fixed false stop failure from LinuxSCSI (from Jens Schmalzing <j.s@lmu.de>)
  644. * Thu Apr 15 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  645. + Version 1.3.0 - beta series
  646. + Netstrings can now be used for our on-the-wire data format
  647. + Perl/SWIG bindings added for some heartbeat libraries
  648. + Significant improvements to SAF data checkpointing API
  649. + Implemented unbuffered ipcsocket code for SAF APIs
  650. + Many Solaris fixes -- except for ipfail, Solaris works
  651. + Significant library restructuring
  652. + Watchdog device NOWAYOUT is now overridded if defaulted
  653. + Watchdog device now kills machine instantly after deadtime
  654. instead of after one minute
  655. + Hostnames should now be treated case-independently...
  656. + Added new client status APIs - client_status() and cstatus_callback()
  657. + Fixed bug with auto_failback and quick full restarts
  658. + We now automatically reboot when resources fail to stop correctly...
  659. + We now check the status of the configured STONITH device hourly...
  660. + STONITH operations repeat after a 5 second delay, not immediately...
  661. + Added hb_takeover command - complement to hb_standby
  662. + Added documentation on how to use evlog/TCP to enable testing to
  663. take place without losing messages due to UDP message forwarding
  664. + Several new tests from Mi, Jun - split brain, bandwidth, failure
  665. detection time.
  666. + Fix to LVM resource from Harald Milz <hm@muc.de>
  667. * Tue Feb 16 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  668. + Version 1.2.0
  669. + Replaced the nice_failback option with the auto_failback option.
  670. THIS OBSOLETES THE NICE_FAILBACK OPTION. READ THE DOCS FOR HOW
  671. TO UPGRADE SMOOTHLY.
  672. + Added a new feature to hb_standby which allows you to give up
  673. any specific category of resources: local, foreign, or all.
  674. The old behavior is "all" which is the default.
  675. This allows you to put a auto_failback no cluster into
  676. an active/active configuration on demand.
  677. + ipfail now works properly with auto_failback on (active/active)
  678. + ipfail now has "hysteresis" so that it doesn't respond immediately
  679. to a network failure, but waits a little while so that the
  680. damage can be properly assessed and extraneous takeovers avoided
  681. + Added new ping node timeout directive "deadping"
  682. + Made sure heartbeat preallocated stack and heap, and printed a
  683. message if we allocate heap once we're started up...
  684. + IPMILan STONITH plugin added to CVS
  685. + Added IPaddr2 resource script
  686. + Made the APC smart UPS ups code compatible with more UPSes
  687. + Added a (preliminary?) ordered messaging facility from Yi Zhu
  688. + Changed IPaddr's method of doing ARPs in background so that
  689. certain timing windows were closed.
  690. + Added OCF (wrapper) resource script
  691. + Allow respawn programs to take arguments
  692. + Added pinggroups (where any node being up is OK)
  693. + SIGNIFICANT amount of internal rearchitecture.
  694. + Many bug fixes.
  695. + Several documentation updates.
  696. * Tue Feb 10 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  697. + Version 1.1.5
  698. + ipfail now has "hysteresis" so that it doesn't respond immediately
  699. to a network failure, but waits a little while so that the
  700. damage can be properly assessed and extraneous takeovers avoided
  701. + Several fixes to cl_poll()
  702. + More fixes to the IPC code - especially handling data reception
  703. after EOF
  704. + removed some unclean code from GSource for treating EOF conditions
  705. + Several bugs concerning hanging when shutting down early during startup
  706. + A few BasicSanityCheck bug fixes
  707. + CTS now allows a single machine to be able to monitor several clusters
  708. + Most former CTS options are now either unneeded or on the command line
  709. + Increased number of ARPs and how long they're being sent out
  710. + Fixed uncommon (authorization) memory leak
  711. + Some Solaris portability fixes.
  712. + Made init script handle standby correctly for new config files
  713. + Improved the fast failure detection test
  714. + Added some backwards compatibility for nice_failback and some default
  715. authentication directives
  716. + Corrected the 1.1.4 change log
  717. * Fri Jan 22 2004 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  718. + Version 1.1.4
  719. + ipfail now works properly with auto_failback on (active/active)
  720. + Changed the API to use sockets (IPC library) instead of FIFOs.
  721. + Added new apiauth directives to provide authorization information
  722. formerly provided by the FIFO permissions.
  723. + Added Intel's implementation of the SAF data checkpointing API and daemon
  724. + Added a cleanup suggested by Emily Ratliff.
  725. + IPMILan STONITH plugin added to CVS
  726. + Added IPaddr2 resource script
  727. + Various cleanups due to horms.
  728. + Fixed authentication to work on 64-bit platforms(!)
  729. + Fixed the cl_poll() code to handle corner cases better
  730. + Made heartbeat close watchdog device before re-execing itself
  731. + New CTS improvements from Mi, Jun <jun.mi@intel.com>
  732. + Various minor bug fixes.
  733. . Several shutdown bugs addressed
  734. . fixed sendarp to make a pid file, so we can shut it down
  735. when we shut everything else down in case it's still running.
  736. . Lots of minor bug fixes to IPC code
  737. . Lots of minor bug fixes to ipctest program
  738. . made BasicSanityCheck more tolerant of delays
  739. . Fixed IPC code to authenticate based on ints, not int*s.
  740. . Check properly for strnlen instead of strlen...
  741. . Several signed/unsigned fixes
  742. . A few uninitialized vars now are inited
  743. . Switched to compiling lex/yacc sources the automake way
  744. . Lots of minor CTS fixes...
  745. + ldirectord bug fixes:
  746. . When new real servers are added on initialisation or when
  747. the configuration file is reread they are marked with status
  748. of -1 (uninitialised) so they will be checked and inserted
  749. into the virtual service as required
  750. . All checks use the checkport if set, otherwise the port set for
  751. the individual real server. This was the case for http and
  752. connect checks, but others had variations on this theme.
  753. . When the configuration file is reread because it changed
  754. on disk and autoreload is set, check the real servers
  755. immediately rather than waiting for checkinterval to expire
  756. . Already running message sent to stderr instead of stdout
  757. . Support alternate server in real-server specific URL
  758. . Treat the same real server with different weights as a different
  759. real server. Fixes bug reported by Philip Hayward whereby the same
  760. real-server would always have the same weight, regardless of
  761. the ldirectord.cf
  762. * Fri Sep 26 2003 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  763. + Version 1.1.3
  764. + Bugfix for heartbeat starting resources twice concurrently if
  765. auto_failback was set to "legacy".
  766. + Bugfix for messages getting lost if messages were sent in quick
  767. succession. (Kurosawa Takahiro)
  768. + Bugfix for Filesystem resource checking for presence of filesystem
  769. support before loading the module.
  770. + BasicSanityCheck extended to cover more basic tests.
  771. + Bugfix for findif not working correctly for CIDR netmasks.
  772. + Minor bugfix for ldirectord recognizing new schedulers correctly and
  773. timeout settings are now being honoured.
  774. + Enhanced the message giving a better explanation of how to set up node
  775. names properly when current node not found in the ha.cf file
  776. + Send a message to the cluster whenever we have a node which doesn't
  777. need STONITHing - even though it's gone down. This fix needed
  778. by CCM, which is in turn needed by EVMS.
  779. + Enhanced the messages for missing ha.cf and missing haresources files
  780. explaining that sample config files are found in the documentation.
  781. + Fix for memory leak from Forrest Zhao<forrest.zhao@intel.com>
  782. + Added a (preliminary?) ordered messaging facility from Yi Zhu
  783. + FAQ updates
  784. + Added Xinetd resource script
  785. + Added OCF (wrapper) resource script
  786. + Allow respawn programs to take arguments
  787. + Added pinggroups (where any node being up is OK)
  788. + fixed ldirectord negotiatetimeout for HTTP
  789. + fixed a bug which caused -d flag to be ignored
  790. + failing resource scripts are now ERRORs not WARNings
  791. + now shuts down correctly when auto_failback == legacy
  792. * Mon Jul 13 2003 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  793. + Version 1.1.2
  794. + Replaced the nice_failback option with the auto_failback option.
  795. THIS OBSOLETES THE NICE_FAILBACK OPTION. READ THE DOCS FOR HOW
  796. TO UPGRADE SMOOTHLY.
  797. + Changed IPaddr to not do ARPs in background, and shortened time
  798. between ARPs. Also made these things tunable...
  799. + changed our comm ttys to not become our controlling TTYs
  800. + Enhanced the ServeRAID script to fix a critical bug by using a new feature
  801. + Added a new DirectoryMap to CVS - tells where everything is...
  802. + significantly enhanced the BasicSanityCheck script, and the tests
  803. it calls.
  804. + added a new option to use a replacement poll function for improved
  805. real-time performance.
  806. + added the ability to have a cluster node's name be different
  807. from it's uname -n
  808. + Moved where CTS gets installed to /usr/lib/heartbeat/cts
  809. + Big improvements to the CTS README from IBM test labs in Austin.
  810. + bug fixes to the WTI NPS power switch
  811. + new client API calls:
  812. return arbitrary configuration parameters
  813. return current resource status
  814. + Added a new clplumbing function: mssleep()
  815. + added new capabilities for supporting pseudo-resources
  816. + added new messages which come out after initial takeover is done
  817. (improves CTS results)
  818. + LOTS of documentation updates.
  819. + fixed a security vulnerability
  820. + fixed a bug where heartbeat would shut down while in the middle
  821. of processing resource movement requests.
  822. + changed compilation flags to eliminate similar future security
  823. issues
  824. + went to even-more-strict gcc flags
  825. + fixed several "reload" bugs. Now reload works ;-)
  826. + fixed STONITH bug when other node never heard from.
  827. + Minor bug fixes (cleaned up corrupted message)
  828. + Two different client API bugs fixed.
  829. + changed the configure script to test which warning flags are
  830. supported by the current gcc.
  831. + enhanced the API test program to test new capabilities...
  832. * Wed May 21 2003 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  833. + Version 1.1.1
  834. + Significant restructuring of the processes in heartbeat
  835. + Added a new feature to hb_standby which allows you to give up
  836. any specific category of resources: local, foreign, or all.
  837. The old behavior is "all" which is the default.
  838. This allows you to put a nice_failback cluster into
  839. an active/active configuration
  840. + Enhancements to the ServeRAID code to make it work with the new
  841. (supported) version of IPSSEND from the ServeRAID folks...
  842. + Added STONITH code for the Dell remote access controller
  843. + Fixed a major bug which kept it from taking over correctly after 246
  844. days or so
  845. + Fixed a major bug where heartbeat didn't lock itself into memory
  846. properly
  847. + Added new ping node timeout directive "deadping"
  848. + Made sure heartbeat preallocated stack and heap, and printed a
  849. message if we allocate heap once we're started up...
  850. + Minor heartbeat API bug fixes
  851. + Minor documentation fixes
  852. + Minor fix to allow IP addresses with /32 masks...
  853. + Fixed a timing window for !nice_failback resource acquisition
  854. + Added several CCM bug fixes
  855. + Made the APC smart UPS ups code compatible with more UPSes
  856. + Fixed a bug in respawn
  857. + Enhanced internal checking for malloc errors...
  858. + Added IP alias search optimization from Sean Reifscheneider
  859. * Wed Mar 19 2003 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  860. + Version 1.0.2:
  861. + Fixed comment errors in heartbeat init script to allow it to run on RH 8.0
  862. + Changed apphbd to use poll(2) instead of sigtimedwait(2)
  863. + Put missing files into tarball
  864. + Documentation improvements for IPaddr and other things
  865. + Fixed an error in hb_standby which kept it from working if releasing
  866. resources takes more than 10 seconds
  867. + Added a fix to allow heartbeat to run on systems without writable disk
  868. (like routers booting from CD-ROM)
  869. + Added configuration file for apphbd
  870. + Added fix from Adam Li to keep recoverymgr stop looping at high priority
  871. + Added fix to ServeRAID resource to make it work with (new) supported
  872. hardware
  873. + Added Delay resource script
  874. + Added fix to Filesystem to allow it to support NFS mounts and allow
  875. user to specify mount options
  876. + Added fix to IPaddr to make tmp directory for restoring loopback device
  877. + Added fix to ipcsocket code to deal correctly with EAGAIN when sending
  878. message body
  879. * Mon Feb 17 2003 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  880. + Version 1.0.1:
  881. + Fixed some compile errors on different platforms, and library versions
  882. + Disable ccm from running on 'ping' nodes
  883. + Put in Steve Snodgrass' fix to send_arp to make it work on non-primary
  884. interfaces.
  885. * Thu Feb 13 2003 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  886. + Version 1.0.1 beta series
  887. 0.4.9g:
  888. + Changed default deadtime, warntime, and heartbeat interval
  889. + Auto* tool updates
  890. + VIP loopback fixes for IP address takeover
  891. + Various Solaris and FreeBSD fixes
  892. + added SNMP agent
  893. + Several CCM bug fixes
  894. + two new heartbeat API calls
  895. + various documentation fixes, including documentation for ipfail
  896. + Numerous minor cleanups.
  897. + Fixed a few bugs in the IPC code.
  898. + Fixed the (IPC) bug which caused apphbd to hang the whole machine.
  899. + Added a new IPC call (waitout)
  900. + Wrote a simple IPC test program.
  901. + Clarified several log messages.
  902. + Cleaned up the ucast communications plugin
  903. + Cleaned up for new C compilers
  904. + Fixed permissions bug in IPC which caused apphbd to not be usable by all
  905. + Added a new rtprio option to the heartbeat config file
  906. + updated apphbtest program
  907. + Changed ipfail to log things at same level heartbeat does
  908. * Sat Nov 30 2002 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  909. + Version 0.5 beta series (now renamed to 1.0.1 beta series).
  910. 0.4.9f:
  911. + Added pre-start, pre-stop, post-stop and pre-stop constructs in init script
  912. + various IPC fixes
  913. + Fix to STONITH behavior: STONITH unresponsive node right after we reboot
  914. + Fixed extreme latency in IPC code
  915. + various configure.in cleanups
  916. + Fixed memory leak in IPC socket code
  917. + Added streamlined mainloop/IPC integration code
  918. + Moved more heartbeat internal communication to IPC library
  919. + Added further support for ipfail
  920. + Added supplementary groups to the respawn-ed clients
  921. + Added standby to init script actions
  922. + Lots of minor CCM fixes
  923. + Split (most) resource management code into a separate file.
  924. + Fixes to accommodate different versions of libraries
  925. + Heartbeat API client headers fixup
  926. + Added new API calls
  927. + Simplified (and fixed) handling of local status. This would sometimes cause
  928. obscure failures on startup.
  929. + Added new IPsrcaddr resource script
  930. KNOWN BUGS:
  931. + apphbd goes into an infinite loop on some platforms
  932. * Wed Oct 9 2002 Alan Robertson <alanr@unix.sh> (see doc/AUTHORS file)
  933. 0.4.9e:
  934. + Changed client code to keep write file descriptor open at all times
  935. (realtime improvement)
  936. + Added a "poll replacement" function based on sigtimedwait(2), which
  937. should be faster for those cases that can use it.
  938. + Added a hb_warntime() call to the application heartbeat API.
  939. + Changed all times in the configuration file to be in milliseconds
  940. if specified with "ms" at the end. (seconds is still the default).
  941. + Fixes to serious security issue due to Nathan Wallwork <nwallwo@pnm.com>
  942. + Changed read/write child processes to run as nobody.
  943. + Fixed a bug where ping packets are printed incorrectly when debugging.
  944. + Changed heartbeat code to preallocate a some heap space.
  945. + CCM daemon API restructuring
  946. + Added ipc_channel_pair() function to the IPC library.
  947. + Changed everything to use longclock_t instead of clock_t
  948. + Fixed a bug concerning the ifwalk() call on ping nodes in the API
  949. + Made apphbd run at high priority and locked into memory
  950. + Made a library for setting priority up.
  951. + Made ucast comm module at least be configurable and loadable.
  952. + Fixed a startup/shutdown timing problem.
  953. 0.4.9d:
  954. + removed an "open" call for /proc/loadavg (improve realtime behavior)
  955. + changed API code to not 1-char reads from clients
  956. + Ignored certain error conditions from API clients
  957. + fixed an obscure error message about trying to retransmit a packet
  958. which we haven't sent yet. This happens after restarts.
  959. + made the PILS libraries available in a separate package
  960. + moved the stonith headers to stonith/... when installed
  961. + improved debugging for NV failure cases...
  962. + updated AUTHORS file and simplified the changelog authorship
  963. (look in AUTHORS for the real story)
  964. + Added Ram Pai's CCM membership code
  965. + Added the application heartbeat code
  966. + Added the Kevin Dwyer's ipfail client code to the distribution
  967. + Many fixes for various tool versions and OS combinations.
  968. + Fixed a few bugs related to clients disconnecting.
  969. + Fixed some bugs in the CTS test code.
  970. + Added BasicSanityCheck script to tell if built objects look good.
  971. + Added PATH-like capabilities to PILS
  972. + Changed STONITH to use the new plugin system.
  973. + *Significantly* improved STONITH usage message (from Lorn Kay)
  974. + Fixed some bugs related to restarting.
  975. + Made exit codes more LSB-compliant.
  976. + Fixed various things so that ping nodes don't break takeovers.
  977. 0.4.9c and before:
  978. + Cluster partitioning now handled correctly (really!)
  979. + Complete rearchitecture of plugin system
  980. + Complete restructure of build system to use automake and port things
  981. to AIX, FreeBSD and solaris.
  982. + Added Lclaudio's "standby" capability to put a node into standby
  983. mode on demand.
  984. + Added code to send out gratuitous ARP requests as well as gratuitous
  985. arp replies during IP address takeover.
  986. + Suppress stonith operations for nodes which went down gracefully.
  987. + Significantly improved real-time performance
  988. + Added new unicast heartbeat type.
  989. + Added code to make serial ports flush stale data on new connections.
  990. + The Famous CLK_TCK compile time fixes (really!)
  991. + Added a document which describes the heartbeat API
  992. + Changed the code which makes FIFOs to not try and make the FIFOs for
  993. named clients, and several other minor API client changes.
  994. + Fixed a fairly rare client API bug where it would shut down the
  995. client for no apparent reason.
  996. + Added stonith plugins for: apcmaster, apcmastersnmp switches, and ssh
  997. module (for test environments only)
  998. + Integrated support for the Baytech RPC-3 switch into baytech module
  999. + Fixes to APC UPS plugin
  1000. + Got rid of "control_process: NULL message" message
  1001. + Got rid of the "controlfifo2msg: cannot create message" message
  1002. + Added -h option to give usage message for stonith command...
  1003. + Wait for successful STONITH completion, and retry if its configured.
  1004. + Sped up takeover code.
  1005. + Several potential timing problems eliminated.
  1006. + Cleaned up the shutdown (exit) code considerably.
  1007. + Detect the death of our core child processes.
  1008. + Changed where usage messages go depending on exit status from usage().
  1009. + Made some more functions static.
  1010. + Real-time performance improvement changes
  1011. + Updated the faqntips document
  1012. + Added a feature to heartbeat.h so that log messages get checked as
  1013. printf-style messages on GNU C compilers
  1014. + Changed several log messages to have the right parameters (discovered
  1015. as a result of the change above)
  1016. + Numerous FreeBSD, Solaris and OpenBSD fixes.
  1017. + Added backwards compatibility kludge for udp (versus bcast)
  1018. + Queued messages to API clients instead of throwing them away.
  1019. + Added code to send out messages when clients join, leave.
  1020. + Added support for spawning and monitoring child clients.
  1021. + Cleaned up error messages.
  1022. + Added support for DB2, ServeRAID and WAS, LVM, and Apache (IBMhttp too),
  1023. also ICP Vortex controller.
  1024. + Added locking when creating new IP aliases.
  1025. + Added a "unicast" media option.
  1026. + Added a new SimulStart and standby test case.
  1027. + Diddled init levels around...
  1028. + Added an application-level heartbeat API.
  1029. + Added several new "plumbing" subsystems (IPC, longclock_t, proctrack, etc.)
  1030. + Added a new "contrib" directory.
  1031. + Fixed serious (but trivial) bug in the process tracking code which caused
  1032. it to exit heartbeat - this occured repeatably for STONITH operations.
  1033. + Write a 'v' to the watchdog device to tell it not to reboot us when
  1034. we close the device.
  1035. + Various ldirectord fixes due to Horms
  1036. + Minor patch from Lorn Kay to deal with loopback interfaces which might
  1037. have been put in by LVS direct routing
  1038. + Updated AUTHORS file and moved list of authors over
  1039. * Fri Mar 16 2001 Alan Robertson <alanr@unix.sh>
  1040. + Version 0.4.9
  1041. + Split into 3 rpms - heartbeat, heartbeat-stonith heartbeat-ldirectord
  1042. + Made media modules and authentication modules and stonith modules
  1043. dynamically loadable.
  1044. + Added Multicast media support
  1045. + Added ping node/membership/link type for tiebreaking. This will
  1046. be useful when implementing quorum on 2-node systems.
  1047. (not yet compatible with nice_failback(?))
  1048. + Removed ppp support
  1049. + Heartbeat client API support
  1050. + Added STONITH API library
  1051. + support for the Baytech RPC-3A power switch
  1052. + support for the APCsmart UPS
  1053. + support for the VACM cluster management tool
  1054. + support for WTI RPS10
  1055. + support for Night/Ware RPC100S
  1056. + support for "Meatware" (human intervention) module
  1057. + support for "null" (testing only) module
  1058. + Fixed startup timing bugs
  1059. + Fixed shutdown sequence bugs: takeover occured before
  1060. resources were released by other system
  1061. + Fixed various logging bugs
  1062. + Closed holes in protection against replay attacks
  1063. + Added checks that complain if all resources aren't idle on startup.
  1064. + IP address takeover fixes
  1065. + Endian fixes
  1066. + Removed the 8-alias limitation
  1067. + Takeovers now occur faster (ARPs occur asynchronously)
  1068. + Port number changes
  1069. + Use our IANA port number (694) by default
  1070. + Recognize our IANA port number ("ha-cluster") if it's in /etc/services
  1071. + Moved several files, etc. from /var/run to /var/lib/heartbeat
  1072. + Incorporated new ldirectord version
  1073. + Added late heartbeat warning for late-arriving heartbeats
  1074. + Added detection of and partial recovery from cluster partitions
  1075. + Accept multiple arguments for resource scripts
  1076. + Added Raid1 and Filesystem resource scripts
  1077. + Added man pages
  1078. + Added debian package support
  1079. * Fri Jun 30 2000 Alan Robertson <alanr@unix.sh>
  1080. + Version 0.4.8
  1081. + Incorporated ldirectord version 1.9 (fixes memory leak)
  1082. + Made the order of resource takeover more rational: Takeover is now
  1083. left-to-right, and giveup is right-to-left
  1084. + Changed the default port number to our official IANA port number (694)
  1085. + Regularized more messages, eliminated some redundant ones.
  1086. + Print the version of heartbeat when starting.
  1087. + Print exhaustive version info when starting with debug on.
  1088. + Hosts now have 3 statuses {down, up, active} active means that it knows
  1089. that all its links are operational, and it's safe to send cluster
  1090. messages
  1091. + Significant revisions to nice_failback (mainly due to lclaudio)
  1092. + More SuSE-compatibility. Thanks to Friedrich Lobenstock <fl@fl.priv.at>
  1093. + Tidied up logging so it can be to files, to syslog or both (Horms)
  1094. + Tidied up build process (Horms)
  1095. + Updated ldirectord to produce and install a man page and be
  1096. compatible with the fwmark options to The Linux Virtual Server (Horms)
  1097. + Added log rotation for ldirectord and heartbeat using logrotate
  1098. if it is installed
  1099. + Added Audible Alarm resource by Kirk Lawson <lklawson@heapy.com>
  1100. and myself (Horms)
  1101. + Added init script for ldirectord so it can be run independently
  1102. of heartbeat (Horms)
  1103. + Added sample config file for ldirectord (Horms)
  1104. + An empty /etc/ha.d/conf/ is now part of the rpm distribution
  1105. as this is where ldirectord's configuration belongs (Horms)
  1106. + Minor startup script tweaks. Hopefully, we should be able to make core
  1107. files should we crash in the future. Thanks to Holger Kiehl for diagnosing
  1108. the problem!
  1109. + Fixed a bug which kept the "logfile" option from ever working.
  1110. + Added a TestCluster test utility. Pretty primitive so far...
  1111. + Fixed the serial locking code so that it unlocks when it shuts down.
  1112. + Lock heartbeat into memory, and raise our priority
  1113. + Minor, but important fix from lclaudio to init uninited variable.
  1114. * Sat Dec 25 1999 Alan Robertson <alanr@unix.sh>
  1115. + Version 0.4.7
  1116. + Added the nice_failback feature. If the cluster is running when
  1117. the primary starts it acts as a secondary. (Luis Claudio Goncalves)
  1118. + Put in lots of code to make lost packet retransmission happen
  1119. + Stopped trying to use the /proc/ha interface
  1120. + Finished the error recovery in the heartbeat protocol (and got it to work)
  1121. + Added test code for the heartbeat protocol
  1122. + Raised the maximum length of a node name
  1123. + Added Jacob Rief's ldirectord resource type
  1124. + Added Stefan Salzer's <salt@cin.de> fix for a 'grep' in IPaddr which
  1125. wasn't specific enough and would sometimes get IPaddr confused on
  1126. IP addresses that prefix-matched.
  1127. + Added Lars Marowsky-Bree's suggestion to make the code almost completely
  1128. robust with respect to jumping the clock backwards and forwards
  1129. + Added code from Michael Moerz <mike@cubit.at> to keep findif from
  1130. core dumping if /proc/route can't be read.
  1131. * Mon Nov 22 1999 Alan Robertson <alanr@unix.sh>
  1132. + Version 0.4.6
  1133. + Fixed timing problem in "heartbeat restart" so it's reliable now
  1134. + Made start/stop status compatible with SuSE expectations
  1135. + Made resource status detection compatible with SuSE start/stop expectations
  1136. + Fixed a bug relating to serial and ppp-udp authentication (it never worked)
  1137. + added a little more substance to the error recovery for the HB protocol.
  1138. + Fixed a bug for logging from shell scripts
  1139. + Added a little logging for initial resource acquisition
  1140. + Added #!/bin/sh to the front of shell scripts
  1141. + Fixed Makefile, so that the build root wasn't compiled into pathnames
  1142. + Turned on CTSRTS, enabling for flow control for serial ports.
  1143. + Fixed a bug which kept it from working in non-English environments
  1144. * Wed Oct 13 1999 Alan Robertson <alanr@unix.sh>
  1145. + Version 0.4.5
  1146. + Mijta Sarp added a new feature to authenticate heartbeat packets
  1147. using a variety of strong authentication techniques
  1148. + Changed resource acquisition and relinquishment to occur in heartbeat,
  1149. instead of in the start/stop script. This means you don't *really*
  1150. have to use the start/stop script if you don't want to.
  1151. + Added -k option to gracefully shut down current heartbeat instance
  1152. + Added -r option to cause currently running heartbeat to reread config files
  1153. + Added -s option to report on operational status of "heartbeat"
  1154. + Sped up resource acquisition on master restart.
  1155. + Added validation of ipresources file at startup time.
  1156. + Added code to allow the IPaddr takeover script to be given the
  1157. interface to take over, instead of inferring it. This was requested
  1158. by Lars Marowsky-Bree
  1159. + Incorporated patch from Guenther Thomsen to implement locking for
  1160. serial ports used for heartbeats
  1161. + Incorporated patch from Guenther Thomsen to clean up logging.
  1162. (you can now use syslog and/or file logs)
  1163. + Improved FreeBSD compatibility.
  1164. + Fixed a bug where the FIFO doesn't get created correctly.
  1165. + Fixed a couple of uninitialized variables in heartbeat and /proc/ha code
  1166. + Fixed longstanding crash bug related to getting a SIGALRM while in malloc
  1167. or free.
  1168. + Implemented new memory management scheme, including memory stats
  1169. * Thu Sep 16 1999 Alan Robertson <alanr@unix.sh>
  1170. + Version 0.4.4
  1171. + Fixed a stupid error in handling CIDR addresses in IPaddr.
  1172. + Updated the documentation with the latest from Rudy.
  1173. * Wed Sep 15 1999 Alan Robertson <alanr@unix.sh>
  1174. + Version 0.4.3
  1175. + Changed startup scripts to create /dev/watchdog if needed
  1176. + Turned off loading of /proc/ha module by default.
  1177. + Incorporated bug fix from Thomas Hepper <th@ant.han.de> to IPaddr for
  1178. PPP configurations
  1179. + Put in a fix from Gregor Howey <ghowey@bremer-nachrichten.de>
  1180. where Gregor found that I had stripped off the ::resourceid part
  1181. of the string in ResourceManager resulting in some bad calls later on.
  1182. + Made it compliant with the FHS (filesystem hierarchy standard)
  1183. + Fixed IP address takeover so we can take over on non-eth0 interface
  1184. + Fixed IP takeover code so we can specify netmasks and broadcast addrs,
  1185. or default them at the user's option.
  1186. + Added code to report on message buffer usage on SIGUSR[12]
  1187. + Made SIGUSR1 increment debug level, and SIGUSR2 decrement it.
  1188. + Incorporated Rudy's latest "Getting Started" document
  1189. + Made it largely Debian-compliant. Thanks to Guenther Thomsen, Thomas
  1190. Hepper, I単aki Fern叩ndez Villanueva and others.
  1191. + Made changes to work better with Red Hat 6.1, and SMP code.
  1192. + Sometimes it seems that the Master Control Process dies :-(
  1193. * Sat Aug 14 1999 Alan Robertson <alanr@unix.sh>
  1194. + Version 0.4.2
  1195. + Implemented simple resource groups
  1196. + Implemented application notification for groups starting/stopping
  1197. + Eliminated restriction on floating IPs only being associated with eth0
  1198. + Added a uniform resource model, with IP resources being only one kind.
  1199. (Thanks to Lars Marowsky-Bree for a good suggestion)
  1200. + Largely rewrote the IP address takeover code, making it clearer, fit
  1201. into the uniform resource model, and removing some restrictions.
  1202. + Preliminary "Getting Started" document by Rudy Pawul
  1203. + Improved the /proc/ha code
  1204. + Fixed memory leak associated with serial ports, and problem with return
  1205. of control to the "master" node.
  1206. (Thanks to Holger Kiehl for reporting them, and testing fixes!)
  1207. * Tue Jul 6 1999 Alan Robertson <alanr@unix.sh>
  1208. + Version 0.4.1
  1209. + Fixed major memory leak in 0.4.0 (oops!)
  1210. + Added code to eliminate duplicate packets and log lost ones
  1211. + Tightened up PPP/UDP startup/shutdown code
  1212. + Made PPP/UDP peacefully coexist with "normal" udp
  1213. + Made logs more uniform and neater
  1214. + Fixed several other minor bugs
  1215. + Added very preliminary kernel code for monitoring and controlling
  1216. heartbeat via /proc/ha. Very cool, but not really done yet.
  1217. * Wed Jun 30 1999 Alan Robertson <alanr@unix.sh>
  1218. + Version 0.4.0
  1219. + Changed packet format from single line positional parameter style
  1220. to a collection of {name,value} pairs. A vital change for the future.
  1221. + Fixed some bugs with regard to forwarding data around rings
  1222. + We now modify /etc/ppp/ip-up.local, so PPP-udp works out of the box
  1223. (at least for Red Hat)
  1224. + Includes the first version of Volker Wiegand's Hardware Installation Guide
  1225. (it's pretty good for a first version!)
  1226. * Wed Jun 09 1999 Alan Robertson <alanr@unix.sh>
  1227. + Version 0.3.2
  1228. + Added UDP/PPP bidirectional serial ring heartbeat
  1229. (PPP ensures data integrity on the serial links)
  1230. + fixed a stupid bug which caused shutdown to give unpredictable
  1231. results
  1232. + added timestamps to /var/log/ha-log messages
  1233. + fixed a couple of other minor oversights.
  1234. * Sun May 10 1999 Alan Robertson <alanr@unix.sh>
  1235. + Version 0.3.1
  1236. + Make ChangeLog file from RPM specfile
  1237. + Made ipresources only install in the DOC directory as a sample
  1238. * Sun May 09 1999 Alan Robertson <alanr@unix.sh>
  1239. + Version 0.3.0
  1240. + Added UDP broadcast heartbeat (courtesy of Tom Vogt)
  1241. + Significantly restructured code making it easier to add heartbeat media
  1242. + added new directives to config file:
  1243. + udp interface-name
  1244. + udpport port-number
  1245. + baud serial-baud-rate
  1246. + made manual daemon shutdown easier (only need to kill one)
  1247. + moved the sample ha.cf file to the Doc directory
  1248. * Sat Mar 27 1999 Alan Robertson <alanr@unix.sh>
  1249. + Version 0.2.0
  1250. + Make an RPM out of it
  1251. + Integrated IP address takeover gotten from Horms
  1252. + Added support to tickle a watchdog timer whenever our heart beats
  1253. + Integrated enough basic code to allow a 2-node demo to occur
  1254. + Integrated patches from Andrew Hildebrand <andrew@pdi.com> to allow it
  1255. to run under IRIX.
  1256. - Known Bugs
  1257. - Only supports 2-node clusters
  1258. - Only supports a single IP interface per node in the cluster
  1259. - Doesn't yet include Tom Vogt's ethernet heartbeat code
  1260. - No documentation
  1261. - Not very useful yet :-)
  1262. ###########################################################