Translate this website:
Search this website:


BC/DRCloud StorageComplianceData CentresDeduplicationDisk/RAID/Tape/SSDsEthernet StorageSAN/NASTiered StorageVirtualization

Intelligent cacheing for optimised storage

Optimising for the virtual reality. By Andrew Buss, Service Director, Freeform Dynamics

 

Date: 1 Oct 2010

Virtualisation is an accepted part of server and data centre strategy. One of the biggest challenges is optimising the storage infrastructure to keep up with the changing workloads and storage demands. What are some of the key considerations to think about when investing in storage for the virtual data centre?           

Optimising storage assets for virtual workloads

Storage flexibility is often cited as one of the biggest challenges with a large-scale migration to server virtualisation. Moving to a dynamic storage architecture that can move and cache data may help to deliver performance where it is needed while keeping the costs in check.

Cacheing has long been used as a means to improving performance in computing. Anytime there is a step change in price, performance or features there are choices and trade-offs to be made that can dramatically affect the final implementation. One of the most common systems of cacheing, and one that works very effectively, is that of the CPU and memory system of modern PCs and servers. The CPU is where the work happens, and needs fast access to data wherever possible.

Early CPUs ran at about the same speed as memory so loads did not take many cycles. CPU speeds soon increased and now far outpace the speed of memory. The result is that today there are large delays when reading from and writing to memory.

If not addressed, processing would be held up for long periods and performance would suffer. For this reason, CPUs have cache memory built in that can hold recently used data so that it is almost instantly available.

At first, CPUs incorporated a single level of cache that was small as die space was at a premium. As memory sizes grew and applications used more resources, more cache memory was added to the CPU. But simply adding more was not the ideal solution, as access times grew with increasing cache size, while constraints such as cost and die size limits mean that cache memory remains a very limited resource. So today we have first, second and even third levels of cache as the different levels work together to get the best trade off of size, performance and cost.

Different CPUs, even from the same generation and vendor, have quite different cache setups depending on the expected usage model and price point. The trick to getting the performance is to balance the way the data is put into the cache and then replaced based on the usage history. Advanced techniques can also help the cache to perform well – for example, pre-fetching data can bring data into the cache and close to the CPU before it is necessary so that it is readily available when needed.

That’s great, but how and why is it relevant to the future of storage? Storage is an integral and expensive part of the computing infrastructure. It is also composed of a range of vastly different technologies, price points, features and performance, making it an ideal candidate for cacheing.

Another major issue is that as demand increases for yet more storage, applications are demanding high performance to satisfy the always-on generation of users. Many IT managers find that the storage infrastructure struggles to keep up with the demands of new technologies such as server virtualisation, as we can see below.

Mass storage is cheap, at least to acquire if not to manage, and plentiful, but performance and reliability leave much to be desired for high-end performance. Enterprise drives give good performance and reliability, but capacities are smaller and the prices higher. At the top end, solid state drives give excellent performance but the price points mean that they are generally unsuited to all but the most demanding applications.

It is possible to architect the storage system into various tiers, and allocate storage for applications on the appropriate tier to provide the performance and reliability needed, but it is a pretty blunt approach. Usage patterns may change by month, week, day or even hour. Virtualised workloads may ramp or down unpredictably, which may leave the storage struggling to adapt. Trying to cope with this by optimising manually may result in over-provisioning certain tiers of the storage system, rather making best use of the overall capabilities of the tier.

So this is where the CPU analogy comes into play. Caching can help to dynamically optimise the storage tiers by not requiring which tier the data resides in to be fixed and static. Instead, a common storage controller acts as the front end to all the storage tiers, and is able to determine in real time which data is used by various applications, and what operations are performed on it. It is able to move data between tiers, ensuring that data that is most frequently used or modified is placed in the higher performing tiers automatically, increasing utilisation of the most expensive and high performance storage tiers and lifting performance across the board. Taking another leaf from the learnings of CPU caching, prefetching data based on policy before it is needed can also move data into the top performing tiers in advance of it being accessed so that it is immediately available with high performance when needed, but without it taking up valuable space in the times when it is not critical. Such an example could be payroll processing, which is an activity that takes place in a short window every month that has a high business impact and risk when running, but that is generally inactive the rest of the month. Of course, this should not impact on the ability to also decide which data should always reside in a particular tier. Data can be pinned in place so that vital business applications can get the performance guarantees that they need, while the least used or least “important” data can be prevented from polluting the cache in the higher level tiers.

The value of investing in a caching architecture is best realised if it can encompass the entire storage stack, moving to a virtualised model so that data is not tied to physical storage and where management tools and automation take centre stage. This may take some time to come to full fruition as there is a large amount of installed storage that will be in place and in active use for many years. Depending on the cacheing solution, it may not be suitable to utilise this existing storage kit. Ripping it all out and replacing it is unlikely to be an option, so it will make sense to phase any cacheing implementation in gradually as new investments are made at the highest tiers of the storage hierarchy and to incorporate lower tiers as they are modernised.

www.freeformdynamics.com

ShareThis

« Previous article

Next article »

Tags: Virtualization

Related White Papers

11 Jan 2012 | White Papers

The Infoblox VMware vCenter Orchestrator Plug-in by Infoblox

The ease and speed with which enterprises can deploy virtual machines has outpaced their ability to provide IP address services to them in a timely fashion. ... Download white paper

23 Nov 2011 | White Papers

Automated Storage Tiering on Infortrend’s ESVA Solution by Infortrend

This white paper introduces automated storage tiering on Infortrend’s ESVA storage solutions. Automated storage tiering can generate significant advant... Download white paper

Read more White Papers»

Related News

21 May 2013 | BC/DR

21 May 2013 | BC/DR

20 May 2013 | BC/DR

16 May 2013 | BC/DR

Read more News »
Related SNS UK TV & Audio

16 Jan 2012 | Virtualization

Ciena and Colt - Network Encryption Managed Service

Colt has worked closely with Ciena to offer a new Network Encryption Service that provides secure, high-bandwidth transport with very low latency.

26 Sep 2011 | BC/DR

Introduction to Remote Replication

Shannon Lasiter introduces the Remote Replication option on Dell's PowerVault MD36x0f storage array.

29 Aug 2011 | BC/DR

ATSB: X5000 at HP DISCOVER

The HP Storage X5000 Network Storage System is a Converged System that runs Windows Storage Server 2008 R2 and was announced at HP Discover. In this vlog, HPStorageGuy talks to HP and Microsoft about the new product.

More SNS UK TV»

More Audio»

Related Web Exclusives

11 Feb 2013 | BC/DR

  • A look into the future

    Now that 2012 is nearly over, I guess it’s time to start looking at what’s coming down the track in 2013. Here are my top five predictions for th... Read more

4 Feb 2013 | BC/DR

4 Feb 2013 | BC/DR

17 Dec 2012 | BC/DR

Read more Web Exclusives»

Related Magazine Articles

October 2010 | SAN/NAS

October 2010 | Cloud Storage

  • The waiting is over!

    Don’t Miss SNW Europe, Datacenter Technologies and Virtualization World, 26th and 27th October 2010, Congress Frankfurt; where can you meet over 70 org... Read more

October 2010 | Virtualization

October 2010 | Ethernet Storage

  • In search of the Holy Grail

    SNS-UK talks with Simon Ragg, Managing Director of S3 (Simplified Storage Solutions), about the many challenges and opportunities facing the reseller in a co... Read more

Read more Magazine Articles»

Related Supplements

1 Oct 2008 | Virtualization

Discovering Business Continuity in a Virtualized Environment

At first, organisations saw VMware server virtualization mainly as a way to save money on their hardware and power budgets. Now though, innovative users have realised that virtualization can make vital contributions in many other ways as well - in particular, they are using it to improve application availability and enhance their disaster recovery capabilities.

Click here to learn more »

Read more Supplements »

Recruitment

Latest IT jobs from leading companies.

 

Click here for full listings»