...
- 1) Personnel cost to maintain the internet networking / hardware.
- 2) UPS Power
- 3) Electricity
- 4) No Worries about backup server
- 5) You don't need to worry about scaling
- 6) Everything is virtual, that means, at the click of one button, we can have disaster recovery off-site… to Japan / USA or other places.
...
...
To trouble Shoot Server Slow Unresponsive, always refer to basic guide at /wiki/spaces/WU/pages/11568251
The steps to checks:
...
Jboss memory allocation
If RAM usage is very high, check the memory allocation for jboss : /usr/java/jboss-4.0.1RC2/server/default/deploy/jbossweb-tomcat50.sar/server.xml
See if the xms is too low or xmx too high - compared to the RAM size.
...
If jboss-stop doesn't run anything
use ps aux|grep java
- kill -9 2304
- /etc/init.d/postgresql-9.2.4 restart
- jboss-start
- Check previous date server log :
- cd /usr/java/jboss/server/default/log/
- ls -lhrt Check the specific date of the server log
...
- Refer to /wiki/spaces/WU/pages/37945457
...
PostgreSQL states:
...
- If you are not able to connect to the wsemp database, most probably your postgres has not started yet.
- run command as root, /etc/init.d/postgresql-9.2.4 start and try psql command
...
To look at running query
SELECT pg_stat_get_backend_pid(s.backendid) AS procpid,
pg_stat_get_backend_activity(s.backendid) AS current_query
FROM (SELECT pg_stat_get_backend_idset() AS backendid) AS s;
...
Postgresql Log
...
Located at /var/lib/pgsql/data/serverlog
...
Errors are usually very apparent, they will tell you it's an error
You can locate the related table / query that might be causing the problem
...
- Check the whether the VACUMM is running.
- To check as empbackup users. run vim backup.log and you need to make sure the latest backup log shows the VACUMM is done.
- To check as empbackup users. run vim backup.log and you need to make sure the latest backup log shows the VACUMM is done.
- Inform customer that you will repopulate database again at night. In same cases, the slowness is gone after we re-populate their database. This can't be done at working hours unless customer ask you to
...
There are many reasons that could lead to the users experiencing slowness, by understanding the root cause of the system slowness, we will be able to troubleshoot effectively. In order to resolve the problems effectively, it is extremely important for the users to provide the details. To some users, slow means if a page is loaded more than 2 seconds, while slowness could mean 20 seconds for the same page to be loaded. Instead of using qualitative words (big , small, slow, fast, good, not good), we will use quantitative words while communicating with the engineers who is going to help you in seconds, minutes, hours, date / time etc. Also, in order to be specific, we are going to indicate which page is slow, and which page (function / screen / module) takes how many seconds to load, and how many seconds to generate a certain report. And if we are generating the report, we will indicate exactly which date range, which branches / company, what category filter or settings we use to generate the reports. All this information is essential for us to help you identify the root cause quickly, and resolve the problems effectively.
Possible Root Cause of the System Slowness, Detection, and Solutions
...
Client Device
...
The fastest way to check if the slowness is caused by the client device, is to access the same website / functions with multiple other devices. If Accessibility from other devices are normal, then the probability of slowness caused by the slowness in the device itself is almost certain.
Regardless of whether it is mobile or desktop devices, see below for more steps to identify the root cause:
- Android Devices
- To monitor your android device performance, you could download Apps that does the trick for you, there are plenty of apps available in the market, there are some recommendations here : https://www.androidpit.com/best-apps-for-monitoring-system-performance-on-your-android-device
- iOS
- There are many system monitoring Apps in the iOS world as well, examples as below:
- Mac
- Out of the box, we could use the Activity Monitor that comes with Mac OS
- Windows
- There are many different versions of Windows out there, and there are plenty of resources for different versions of Windows, some of the useful resources as below:
Apart from System Performance monitoring, it is never alien to our technical support team that some of the client devices that have been infected by Virus is known to perform badly. Usually, when we detected high CPU usage, and high Network Bandwidth, we would also be recommending the users to install anti-virus software to detect the potential security threat to the devices.
Another possibility is the web browser. When it comes to web-browser usage, settings, the following areas could be checked:
- DNS settings
- Too many tabs
- Cache
- Proxy Settings
Depends on the root cause from the troubleshooting of the client device, users could upgrade the RAM, Processor, or perform anti-virus scanning accordingly.
At the time of reading, the following guide may be out-dated, but google is your best friends, there are plenty of resources available online to resolve the problems that you are having with optimizing your devices.
...
- http://www.hongkiat.com/blog/increase-android-performances/
- http://www.infoworld.com/article/2892444/android/android-tune-up-how-to-boost-performance-while-you-wait-for-lollipop.html
...
...
- http://www.thexlab.com/faqs/performance.html
- http://lifehacker.com/5896699/how-to-speed-up-clean-up-and-revive-your-mac
...
When devices are not connected directly to the internet, it is possible it goes thru routers, switches, proxy servers and other possible network routes to access the Wavelet EMP on the cloud. There are a few ways to check and detect if the problems are caused by the Local Area Network:
- Non-technical approach
- Getting help from another users from another location to access the website
- Use a different device that directly connect to the internet, for example you are having problems with your computer accessing the website, but you can test with your phone (without connecting to Wifi, use direct internet connection, 3G etc)
- Using Ping
- Ping your router IP address (not the google and not other external websites / servers)
- Monitoring the network performance on the networking devices
Widget Connector url https://www.youtube.com/watch?v=zTuM6NJyk94 - this hyperlink provides tutorial for D-Link routers, but other routers should be similar as well, refer to the respective user manual for other routers.
- Other devices within the same local area network
- Sometimes, there's no problem with your device or routers, but there could be another device within the local area network that is infected by Virus, and consuming significant amount of bandwidth. It is also possible that somewhere in the LAN (Local Area Network), there's another users installed some applications like Torrent that download and share large files, or watching youtube, or consuming some live streaming content.
...
On the client side, for easier troubleshoot. Please ask customer to use right tools such as Teamviewer. Top 3 reasons from client side
- The Obvious: The URL has a typo.
- Ask the user to double-check the URL. This may seem obvious but it is very critical that the user puts in the correct URL. It is better to ask the user to send you the URL so that you can examine and try it yourself. There are three things to note in the URL.
- Is the protocol correct? (i.e) http. Your Web Server admin could have very well blocked port 80 altogether
- Is the domain name and the resource being accessed correctly spelled?
- Is the Port number correct ? (if any). The browser will default http to 80 or http to 8080.
...
- Root cause : There is some inconsistent connection between application and database server. You can tested by ssh to application server, and make connection from application to db server. Don't idle your connection. If you lost connection in between 10-15 minutes, there is problem with the network
- We can advise customer to install another segregated switch, with additional cables between the application and database server
...
Troubleshoot Internet connection problems
http://windows.microsoft.com/en-us/windows-vista/troubleshoot-internet-connection-problems
Guide to set DNS
https://developers.google.com/speed/public-dns/docs/using?hl=en
...
If you are connected from "Public Internet", you might experienced having difficulties of accessing URL which causes by:
- Enhanced Protected Mode
- Browser History
- Browser Add-On
- Proxy and DNS Settings
- Browser need to be reset
- Check whether a third-party service, program, or anti-virus is conflicting with browser
- Temporarily disabling the firewall
- Updating older drivers and editing registry key TabProcGrowth
- Checking Windows Updates for drivers
- Restore or refresh your PC
...
Refer to this wiki for solutions
https://support.microsoft.com/en-us/kb/956196
...
- Identify the customer.
- Always refer to complete customer server list https://docs.google.com/spreadsheets/d/1nfHlOJYQhhxe_aMSOLJd6zC1cL9dptmPPf8t7zMwllE/edit#gid=0
- Getting server URL from the customer, sometimes it might be IP address. Note that don't confuse the server IP address with the IP address of local network which most of time is similar to 192.168.X.Y.
- Identify the location of the server.
- Customer own host or Datae Center, please contact the person in charge from Data Center
- Internet connectivity of data center or customer host ( Point 1 to 4)
- Customer server condition.
- ssh port is accessible? Port forwarding access
- backup database can be restored and manually restored to wavelet cloud or AWS
- If customer server located at AWS, login to aws.amazon.com using (wavelet-production or wavelet-asia) account and go to EC2 Instance and check the instance status
Amazon Asia Server Account
https://wavelet-asia.signin.aws.amazon.com/consoleAmazon Production Server Account
https://wavelet-production.signin.aws.amazon.com/console- Refer to
If customer located at Azure, login to
- Azure Portal https://portal.azure.com/
- Refer to /wiki/spaces/WU/pages/11567303
- If customer servers located at Wavelet Cloud office, refer to
- Customer own host or Datae Center, please contact the person in charge from Data Center
...
...
If the network out too high, use netstat to observe which IP that causing high network out. kill and chmod the process will help
...
- Customer server moved under different network http://intranet.wavelet.asia/projects/tech/wiki/When_Customer_server_is_moved_under_a_different_network
- Check IP Config http://intranet.wavelet.asia/projects/tech/wiki/Check_Ip_Config
- Set Static IP via terminal http://intranet.wavelet.asia/projects/tech/wiki/Setting_Static_IP_Via_Terminal
- Solve Session Time Out error in Firefox http://intranet.wavelet.asia/projects/tech/wiki/Solve_Session_Time_Out_error_in_Firefox_(credit_to_Annie_Tan_YMS)
- Linux System Security http://intranet.wavelet.asia/projects/tech/wiki/System_Security
...
...
- Server Operating Systems : Ubuntu, Centos, Amazon Linux AMI
- Useful LINUX command
- The different between Ubuntu and Centos
- Ubuntu : apt-get
- Centos : yum
- The different between Ubuntu and Centos
...
There are many reasons that could lead to the users experiencing slowness, by understanding the root cause of the system slowness, we will be able to troubleshoot effectively. In order to resolve the problems effectively, it is extremely important for the users to provide the details. To some users, slow means if a page is loaded more than 2 seconds, while slowness could mean 20 seconds for the same page to be loaded. Instead of using qualitative words (big , small, slow, fast, good, not good), we will use quantitative words while communicating with the engineers who is going to help you in seconds, minutes, hours, date / time etc. Also, in order to be specific, we are going to indicate which page is slow, and which page (function / screen / module) takes how many seconds to load, and how many seconds to generate a certain report. And if we are generating the report, we will indicate exactly which date range, which branches / company, what category filter or settings we use to generate the reports. All this information is essential for us to help you identify the root cause quickly, and resolve the problems effectively.
Possible Root Cause of the System Slowness, Detection, and Solutions
No | Possible Root Cause | Troubleshooting Guide | Solutions | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Client Device
| The fastest way to check if the slowness is caused by the client device, is to access the same website / functions with multiple other devices. If Accessibility from other devices are normal, then the probability of slowness caused by the slowness in the device itself is almost certain. Regardless of whether it is mobile or desktop devices, see below for more steps to identify the root cause:
Apart from System Performance monitoring, it is never alien to our technical support team that some of the client devices that have been infected by Virus is known to perform badly. Usually, when we detected high CPU usage, and high Network Bandwidth, we would also be recommending the users to install anti-virus software to detect the potential security threat to the devices. Another possibility is the web browser. When it comes to web-browser usage, settings, the following areas could be checked:
| Depends on the root cause from the troubleshooting of the client device, users could upgrade the RAM, Processor, or perform anti-virus scanning accordingly. At the time of reading, the following guide may be out-dated, but google is your best friends, there are plenty of resources available online to resolve the problems that you are having with optimizing your devices.
| |||||||||||||||
2 | Client Local Area Network | When devices are not connected directly to the internet, it is possible it goes thru routers, switches, proxy servers and other possible network routes to access the Wavelet EMP on the cloud. There are a few ways to check and detect if the problems are caused by the Local Area Network:
| ||||||||||||||||
3 | Client Internet Connectivity | On the client side, for easier troubleshoot. Please ask customer to use right tools such as Teamviewer. Top 3 reasons from client side
| Troubleshoot Internet connection problems | |||||||||||||||
4 | Public Internet | If you are connected from "Public Internet", you might experienced having difficulties of accessing URL which causes by:
| Refer to this wiki for solutions | |||||||||||||||
5 | Data Center Network |
|
| |||||||||||||||
6 | Server Network |
|
| |||||||||||||||
7 | Server Operating Systems |
|
| |||||||||||||||
8 | Server Application Server | To trouble Shoot Server Slow Unresponsive, always refer to basic guide at /wiki/spaces/WU/pages/11568251 The steps to checks:
| ||||||||||||||||
9 | Server Database Server |
| common error is when postgres cannot start as postmasterpid has invalid data | |||||||||||||||
10 | Server Hard Disk |
|
Note: Don't delete anything inside /var/lib/pgsql/data, we stored database inside that folder. It's normal for that folder have big size |
| |
11 | Wavelet EMP Itself | Server slowness can be found only from specific module from EMP caused by Inefficient code or SQL queries
|
|
|
| |||
12 | Usage of EMP |
|
|