ESX3 Installation
Requirements
Media
Current
(Versions current as of Feb 09)
- Memtest bootable CD (v3.2 or later)
- ESX v3.5 update 1 install CD (.\Software_Repository\ESX\esx-3.5.0_Update_1-82663.iso)
- Hardware monitoring agent
- DELL OpenManage v5.4 install (.\Software_Repository\DELL\OM_5.4.0_ManNode_A01.tar.gz)
- HP Insight Manager v8.1 install (.\Software_Repository\HP\hpmgmt-8.1.1-vmware3x.tar)
Info
- Management IP address, mask, and VLAN ID (if applicable)
- VMKernel IP address, mask
- Default gateway, and DNS server IP’s
- Fully qualified domain name (e.g. esxp1.company.com)
ESX Software Installation Procedure
Hardware Checks
- Install any required cards, memory upgrades etc
- Default BIOS options are fine, though of note;
- Hyperthreading should be disabled if available
- CPU Virtualisation Technology should be enabled (disabled for ESX's versions prior to v3.5, enabled by default on DELL's, disabled by default on HP's)
- If a DELL with a pre-configured OS, boot the system up and ensure the system is as expected, especially;
- Number of CPU's and speed
- Memory size
- Network cards
- Physical storage (local hard-drives and SAN)
- Reboot the system to Memtest, which should auto-start. Leave to run for at least one pass (eg overnight). Any errors must be resolved prior installation of ESX.
- Configure local hard-drive RAID virtual disk (configuration not normally required on purchased for purpose DELL servers)
- If not using SAN storage for VM's its recommended that you use RAID 10, requires 4 or more disks, ensures optimum performance
- If using SAN storage, local disk pack is used for ESX software, should be at least RAID 1 if 2 disks, RAID 5 recommended if 3 disks.
- Configure only 1 RAID pack, do not carve up physical storage into multiple logical drives
ESX Software Install
If you are (re)installing an ESX that's connected to an existing SAN partition you must disconnect the SAN cables prior to install. The ESX install will write new UUID's to the SAN partition making existing ESX's unable to access the SAN via the original UUID
- Insert ESX install CD and reboot
- If it’s the 1st usage of the CD, perform the media check (takes about 5 mins), otherwise Skip
- Select GUI setup mode
- Select United Kingdom keyboard
- Select appropriate mouse
- Acknowledge Yes to erase ALL DATA and create new partitions
- Accept EULA
- Check disk size is as expected. Select recommended partition config, and confirm erasing existing partitions
- Edit suggested partitioning such that;
- boot 100 MB -> 200 MB
- swap 544 MB -> 1024 MB
- log 1992 MB -> 2048 MB
- delete VMFS partition
- Leave bootloader config as is
- Enter full network config for management interface;
- IP and mask
- Default gateway
- DNS servers
- Domain name - you must enter a fully qualified domain name (otherwise new VM deployments can fail)
- If the management IP is on a trunked VLAN switchport you must enter the VLAN (802.1q) ID
- Uncheck 'Create a default network for virtual machines'
- In Location tab select Europe\London, select UTC in UTC Offset tab and in the Map tab leave ‘System clock uses UTC’ checked
- Enter root password (syndrome can only be used for test hosts)
- Review config (especially double check the partitioning and network config, it’s a pain to change these afterwards)
- Install will complete, eject the CD, and await reboot
- Logon as root and issue the following command:
esxcfg-firewall -o 53,tcp,out,dnsClientTCP
Create Local User Accounts
- Disable password expiry
esxcfg-auth --passmaxdays=99999
- Check the login.defs file has been updated
more /etc/login.defs
- Confirm the following the exists:
PASS_MAX_DAYS 99999
- Create local user accounts for console access - NOTE that setting a password is optional and not required if an Active Directory account userid is going to be used.
useradd
usernamepasswd
username you'll then be asked for a password
Remaining tasks can be completed remotely
Post ESX Installation Server Config
Config NTP Sources
Sets up the system to use ntp-svr-a and ntp-svr-b, can be configured via the VI Client but doesn't seem to work reliably
- Login to system, and use root privelidges (
su -
) - Enable NTP through service console firewall
esxcfg-firewall --enableService ntpClient
- Backup ntp config file
cp /etc/ntp.conf /etc/ntp.conf.bak
- Edit file using VI, so that in the appropriate sections…
# Prohibit general access to this service.
restrict default kod nomodify trap
# server mytrustedtimeserverip
server ntp-svr-a
server ntp-svr-b
- Backup step tickers file
cp /etc/ntp/step-tickers /etc/ntp/step-tickers.bak
- Edit file using VI, so that…
ntp-svr-a
ntp-svr-b
- Restart NTP service
service ntpd restart
- Service should restart successfully (see below), otherwise fallback config file and restart
Shutting down ntpd: [FAILED]
ntpd: Synchronizing with time server: [ OK ]
Starting ntpd: [ OK ]
- Check system time / clock
Date
- Configure service to start automatically
chkconfig ntpd on
- Check service has been entered into run level config
chkconfig --list ntpd
ntpd 0:off 1:off 2:on 3:on 4:on 5:on 6:off
Config SNMP Trap Destinations
Remember that Netcool needs to be made aware of the sending ESX server for traps to populate views.
- Login to system, and use root privelidges (
su -
) - Enable SNMP through firewall
esxcfg-firewall --enableService snmpd
- Backup SNMP config file
cp /etc/snmp/snmpd.conf /etc/snmp/snmpd.conf.bak
- Edit file snmpd.conf using VI, so that is eactly as shown here:
# syscontact Server Systems Team # # Create a Read only community called sstesx that is only accessible by authorised pollers. # rocommunity sstesx poller1 rocommunity sstesx poller2
# # Send traps to both Netcool pollers with a community name of ssttrap. # trapsink receiver1 ssttrap trapsink receiver2 ssttrap authtrapenable 1 # # Enable VMware MIB modules. # dlmod SNMPESX /usr/lib/vmware/snmp/libSNMPESX.so # # Allow Dell's Systems Management Data Engine SNMP to connect to snmpd using SMUX # smuxpeer .1.3.6.1.4.1.674.10892.1 #
- Start SNMP service
service snmpd restart
- Service should start successfully (see below), otherwise fallback config file and restart
Starting snmpd: [ OK ]
- Set SNMP agent to start automatically
chkconfig snmpd on
- Check service has been entered into run level config
chkconfig --list snmpd
snmpd 0:off 1:off 2:on 3:on 4:on 5:on 6:off
Install Centennial
- Copy across the installation files to the ESX
- EG
pscp C:\Software_Repository\Unix_Centennial\* user@ESXserver:/home/user/centennial
- EG
- On the ESX console become root, change into the directory, set execution permissions and run the installation procedure.
su -
cd /home/user/centennial
(substitute user appropriately).chmod 775 *
./deploy_mpclient.sh redhat_i386
- Confirm services are running...
service disco status
Enable Active Directory Authentication
- Enter the following command. Note that 'enablead' is not a typo it being 'enable ad' and not 'enabled'.
esxcfg-auth --enablead --addomain=ukprod.company.com --addc=adserver.company.com
Install Hardware Monitoring Agents
Use the DELL or HP sections as appropriate!
Install DELL OpenManage
Previously done by CD, latest version is only available via tar file (currently).
- Copy the tar file onto the ESX using PSCP
- Generic command:
pscp C:\Software_Repository\DELL\OM_5.4.0_ManNode_A01.tar.gz user@ESXserver:/home/user
- Generic command:
- On the ESX, change directory so you're in your home directory
cd /home/user
- Uncompress the file
tar -zxvf OM_5.4.0_ManNode_A01.tar.gz
- Change directory into the created folders
cd linux/supportscripts
- Install OpenManage
./srvadmin-install.sh –b –w –r –s
- Enable OpenMange through the ESX server’s firewall
esxcfg-firewall -o 1311,tcp,in,OpenManageRequest
- Start the OpenManage services
srvadmin-services.sh start
, dsm_sa_datamgr32d can take a few mins to start
- Confirm the OpenManage services are running
srvadmin-services.sh status
- All processes should be in a running state
- Log out (exit), and return to ESX screen (Alt + F11)
Check monitored by / Add to DELL IT Assistant
- On ITA machine, run up IT Assistant, and login as administrator
- Check to see if the ESX appears in the list of servers (this is refreshed in one hour intervals, so you might need to check back later)
To add an ESX
- Go to Discovery and Monitoring | Ranges
- Assuming the IP of the ESX isn't covered by one of the existing ranges, right-click over 'Include Ranges
- Add ESX's IP address (or range of IP addresses if the first of a new cluster of ESX's)
- Accept all default options, except step 7, select Perform both Discovery and Inventory
Install HP InSight Manager
- Copy the tar file onto the ESX using PSCP
- Generic command:
pscp C:\Software_Repository\HP\hpmgmt-8.1.1-vmware3x.tar user@ESXserver:/home/user
- Generic command:
- On the ESX, change directory so you're in your home directory
cd /home/user
- Uncompress the file
tar -zxvf hpmgmt-8.1.1-vmware3x.tar
- Change directory into the created folders
cd hpmgmt/811
- Install InSight
./installvm811.sh --install
- Accept all default options, including firewall ports
Install NIC and HBA monitor script
Note: Now using ESXmon_v3 script
- Create a SST cron folder
mkdir /etc/cron.sst
- Make contents of RTS cron folder runnable every minute, edit /etc/crontab to append the following line
* * * * * root run-parts /etc/cron.sst
- Copy the monitor script, ESXmon_v3, into the ESX server from the VC in a similar fashion to #Install Centennial
pscp C:\Software_Repository\ESXmon_v2 user@ESXserver:/home/user/
- Move the file into the cron folder
mv /home/user/ESXmon_v3 /etc/cron.sst/
- Make the files executable
chmod +x /etc/cron.sst/*
Monitor script gets run every 1 min, it will send traps for all HBA's and NIC's when 1st run. It takes 2 mins for the changes to the crontab file to take effect, therefore (assuming Netcool knows about the ESX server) you should see some initial traps within 3 mins.
The script creates a number of hba* and nic* files, therefore if they exist the script is being run. If you delete these files, the script will have no record of the previous HBA and NIC states, and so will generate traps when next run as their states will change from Unknown to Up/Down.
A log file is also generated, ESXmon.log, this should log any HBA/NIC state changes, and all SNMP traps sent.
VMWare Configuration
Setup Management and Network Connectivity
- Add to Virtual Centre server
- Highlight the cluster the host should be a member of
- Right-click and select Add Host...
- Use a fully qualified domain name, and the root user/pass
- Configure network connectivity (following steps may need to change if your config is non standard)
- Go to Networking (on the Configuration tab) and select Properties... of vSwitch0 (you may have to scroll the screen to the right).
- On the Ports tab Add... the required networks
- VM Port Group names must be consistent across all ESX's in a cluster otherwise HA failover will not work
- Use the table below as a reference for consistent Port Group names
- On the Network Adapters tab, add the unassigned adapter.
Subnet mask crib sheet;
- 16 - 255.255.0.0
- 21 - 255.255.248.0
- 24 - 255.255.255.0
- 25 - 255.255.255.128
- 26 - 255.255.255.192
License ESX server
- Go to the Licensed Features option, and edit the ESX Server License Type
- Check the ESX Server Standard option, the ESX will request required licenses from the Virtual Centre server
Configure ESX Firewall
Note that if you've issued command line esxcfg-firewall commands and not rebooted, the results of these commands may not show up via the GUI. If in doubt, give the ESX a bounce.
- Go to the Security Profile and select Properties...
- Tick/Check the following options (do not untick options not specified)
- SNMP Server
- NFS Client
- NTP Client
Setup Storage
- Set-up the local VMFS partition
- Go to the Storage option, and select Add Storage...
- Select Disk/LUN
- Select the local device
- Tick the Use free space option
- Give datastore same name as ESX
- Accept default partition formatting options
- Set-up the NFS drive (shared storage on NFSserver for ISO's etc)
- Go to the Storage (SCSI, SAN, and NFS) option, and select Add Storage...
- Select Network File System and enter
- Server: NFSserver
- Folder: NFS_Share
- Datastore Name: NFSserver
Troubleshooting / Known Issues
Can't add to Virtual Centre
Virtual Centre server unable to connect to ESX host...
- Restart the ESX Managment agent
Test
Recorded procedure for installing / configuring, anything in this section has been attempted on one or more test ESX's, but isn't general installation.