I can tell what happened. The production site had multiple load-balanced web servers configured under a web farm (whereas, at staging site, there was only one web server) pointing to the same code base, and not each request was being routed to the same specific server. So, even if the first hit caused a server to perform an expensive query and load the data in its cache, subsequent requests were likely to be redirected to a different server, which didn't have the data in their cache and hence they were performing the same expensive operations to load the data and put it into their in-memory cache, which is already available in another server's memory.
谁能帮我翻译一下,真是看不懂什么意思
生产环境部署了多台负载均衡服务器。 并不是每一次请求都会被路由至同一台服务器。所以,当第一次请求发出,被负载均衡服务器路由至A服务器,A服务器会把本次请求的数据放入cache。当其他的请求过来,并不一定会被路由至A服务器(你有多台负载均衡服务器),然后其他的服务器做了跟A服务器一样的事儿,虽然A服务器可能已存在了同样的cache。
老外说的就是这个意思。 你可以换个分布式缓存服务就OK了 MemCached试试