What is Cache Memory? Cache Memory in Computers, Explained (2024)

What is cache memory?

Cache memory is a chip-based computer component that makes retrieving data from the computer's memory more efficient.

It acts as a temporary storage area that the computer's processor can retrieve data from easily. This temporary storage area, known as a cache, is more readily available to the processor than the computer's main memory source, typically some form of dynamic random access memory (DRAM).

Cache memory is sometimes called CPU (central processing unit) memory because it is typically integrated directly into the CPU chip or placed on a separate chip that has a separate bus interconnect with the CPU. Therefore, it is more accessible to the processor, and able to increase efficiency, because it's physically close to the processor.

In order to be close to the processor, cache memory needs to be much smaller than main memory. Consequently, it has less storage space. It is also more expensive than main memory, as it is a more complex chip that yields higher performance.

What it sacrifices in size and price, it makes up for in speed. Cache memory operates between 10 to 100 times faster than RAM, requiring only a few nanoseconds to respond to a CPU request.

The name of the actual hardware that is used for cache memory is high-speed static random access memory (SRAM). The name of the hardware that is used in a computer's main memory is DRAM.

Cache memory is not to be confused with the broader term cache. Caches are temporary stores of data that can exist in both hardware and software. Cache memory refers to the specific hardware component that enables computers to create caches at various levels of the network.

Types of cache memory

Cache memory is fast and expensive. Traditionally, it is categorized as "levels" that describe its closeness and accessibility to the microprocessor. There are three general cache levels:

L1 cache, or primary cache, is extremely fast but relatively small, and is usually embedded in the processor chip as CPU cache.

L2 cache, or secondary cache, is often more capacious than L1. L2 cache may be embedded on the CPU, or it can be on a separate chip or coprocessor and have a high-speed alternative system bus connecting the cache and CPU. That way it doesn't get slowed by traffic on the main system bus.

Cache memory mapping

Caching configurations continue to evolve, but cache memory traditionally works under three different configurations:

Direct mapped cache has each block mapped to exactly one cache memory location. Conceptually, a direct mapped cache is like rows in a table with three columns: the cache block that contains the actual data fetched and stored, a tag with all or part of the address of the data that was fetched, and a flag bit that shows the presence in the row entry of a valid bit of data.
Fully associative cache mapping is similar to direct mapping in structure but enables a memory block to be mapped to any cache location rather than to a prespecified cache memory location.
Set associative cache mapping can be viewed as a compromise between direct mapping and fully associative mapping in which each block is mapped to a subset of cache locations. It is sometimes called N-way set associative mapping, which provides for a location in main memory to be cached to any of "N" locations in the L1 cache.

Data writing policies

Data can be written to memory using a variety of techniques, but the two main ones involving cache memory are the following:

Write-through. Data is written to both the cache and main memory at the same time.
Write-back. Data is only written to the cache initially. Data may then be written to main memory, but this does not need to happen and does not inhibit the interaction from taking place.

The way data is written to the cache impacts data consistency and efficiency. For example, when using write-through, more writing needs to happen, which causes latency upfront. When using write-back, operations may be more efficient, but data may not be consistent between the main and cache memories.

One way a computer determines data consistency is by examining the dirty bit in memory. The dirty bit is an extra bit included in memory blocks that indicates whether the information has been modified. If data reaches the processor's register file with an active dirty bit, it means that it is not up to date and there are more recent versions elsewhere. This scenario is more likely to happen in a write-back scenario, because the data is written to the two storage areas asynchronously.

Specialization and functionality

In addition to instruction and data caches, other caches provide specialized system functions. According to some definitions, the L3 cache's shared design makes it a specialized cache. Other definitions keep the instruction cache and the data cache separate and refer to each as a specialized cache.

Translation lookaside buffers are also specialized memory caches whose function is to record virtual address to physical address translations.

Still other caches are not, technically speaking, memory caches at all. Disk caches, for instance, can use DRAM or flash memory to provide data caching similar to what memory caches do with CPU instructions. If data is frequently accessed from the disk, it is cached into DRAM or flash-based silicon storage technology for faster access time and response.

Specialized caches are also available for applications such as web browsers, databases, network address binding and client-side Network File System protocol support. These types of caches might be distributed across multiple networked hosts to provide greater scalability or performance to an application that uses them.

What is Cache Memory? Cache Memory in Computers, Explained (2)

Locality

The ability of cache memory to improve a computer's performance relies on the concept of locality of reference. Locality describes various situations that make a system more predictable. Cache memory takes advantage of these situations to create a pattern of memory access that it can rely upon.

There are several types of locality. Two key ones for cache are:

Temporal locality. This is when the same resources are accessed repeatedly in a short amount of time.
Spatial locality. This refers to accessing various data or resources that are near each other.

Performance

Cache memory is important because it improves the efficiency of data retrieval. It stores program instructions and data that are used repeatedly in the operation of programs or information that the CPU is likely to need next. The computer processor can access this information quicker from the cache than from the main memory. Fast access to these instructions increases the overall speed of the program.

Aside from its main function of improving performance, cache memory is a valuable resource for evaluating a computer's overall performance. Users can do this by looking at cache's hit-to-miss ratio. Cache hits are instances in which the system successfully retrieves data from the cache. A cache miss is when the system looks for the data in the cache, can't find it, and looks somewhere else instead. In some cases, users can improve the hit-miss ratio by adjusting the cache memory block size -- the size of data units stored.

Improved performance and ability to monitor performance are not just about improving general convenience for the user. As technology advances and is increasingly relied upon in mission-critical scenarios, having speed and reliability becomes crucial. Even a few milliseconds of latency could potentially lead to enormous expenses, depending on the situation.

What is Cache Memory? Cache Memory in Computers, Explained (3)

Cache vs. main memory

DRAM serves as a computer's main memory, performing calculations on data retrieved from storage. Both DRAM and cache memory are volatile memories that lose their contents when the power is turned off. DRAM is installed on the motherboard, and the CPU accesses it through a bus connection.

DRAM is usually about half as fast as L1, L2 or L3 cache memory, and much less expensive. It provides faster data access than flash storage, hard disk drives (HDDs) and tape storage. It came into use in the last few decades to provide a place to store frequently accessed disk data to improve I/O performance.

DRAM must be refreshed every few milliseconds. Cache memory, which also is a type of random access memory, does not need to be refreshed. It is built directly into the CPU to give the processor the fastest possible access to memory locations and provides nanosecond speed access time to frequently referenced instructions and data. SRAM is faster than DRAM, but because it's a more complex chip, it's also more expensive to make.

What is Cache Memory? Cache Memory in Computers, Explained (4)

Cache vs. virtual memory

A computer has a limited amount of DRAM and even less cache memory. When a large program or multiple programs are running, it's possible for memory to be fully used. To compensate for a shortage of physical memory, the computer's OS can create virtual memory.

To do this, the OS temporarily transfers inactive data from DRAM to disk storage. This approach increases virtual address space by using active memory in DRAM and inactive memory in HDDs to form contiguous addresses that hold both an application and its data. Virtual memory lets a computer run larger programs or multiple programs simultaneously, and each program operates as though it has unlimited memory.

In order to copy virtual memory into physical memory, the OS divides memory into page files or swap files that contain a certain number of addresses. Those pages are stored on a disk and when they're needed, the OS copies them from the disk to main memory and translates the virtual memory address into a physical one. These translations are handled by a memory management unit.

Implementation and history

Mainframes used an early version of cache memory, but the technology as it is known today began to be developed with the advent of microcomputers. With early PCs, processor performance increased much faster than memory performance, and memory became a bottleneck, slowing systems.

In the 1980s, the idea took hold that a small amount of more expensive, faster SRAM could improve the performance of the less expensive, slower main memory. Initially, the memory cache was separate from the system processor and not always included in the chipset. Early PCs typically had from 16 KB to 128 KB of cache memory.

With 486 processors, Intel added 8 KB of memory to the CPU as L1 memory. As much as 256 KB of external L2 cache memory was used in these systems. Pentium processors saw the external cache memory double again to 512 KB on the high end. They also split the internal cache memory into two caches: one for instructions and the other for data.

Processors based on Intel's P6 microarchitecture, introduced in 1995, were the first to incorporate L2 cache memory into the CPU and enable all of a system's cache memory to run at the same clock speed as the processor. Prior to the P6, L2 memory external to the CPU was accessed at a much slower clock speed than the rate at which the processor ran and slowed system performance considerably.

Early memory cache controllers used a write-through cache architecture, where data written into cache was also immediately updated in RAM. This approach minimized data loss, but also slowed operations. Later 486-based PCs developed the write-back cache architecture, where RAM isn't updated immediately. Instead, data is stored on cache and RAM is updated only at specific intervals or under certain circ*mstances where data is missing or old.

Editor's note: This article was revised in 2024 by TechTarget editors to improve the reader experience.

FAQs

What is Cache Memory? Cache Memory in Computers, Explained? ›

Cache memory is a chip-based computer component that makes retrieving data from the computer's memory more efficient. It acts as a temporary storage area that the computer's processor can retrieve data from easily.

Read On ›

What is cache memory in simple words? ›

Cache memory is an extremely fast memory type that acts as a buffer between RAM and the CPU. Cache Memory holds frequently requested data and instructions so that they are immediately available to the CPU when needed.

Discover More Details ›

How to clear cache memory? ›

In Chrome

On your computer, open Chrome.
At the top right, click More. Delete browsing data.
Choose a time range, like Last hour or All time.
Select the types of information you want to remove.
Click Delete data. If you delete cookies while signed in to Chrome, you won't be signed out of your Google Account.

How much cache memory is good for a laptop? ›

A good base for the L1 cache is 64 KB per core. You can find this out by dividing the total amount by the number of cores in the CPU. A 256 KB L2 cache per core is perfectly acceptable, but gamers might benefit from 512 KB per core. And anything between 32 and 96 MB of L3 cache is perfectly fine for most purposes.

See Details ›

What is the primary function of a cache memory in a computer? ›

Why is cache memory used? Cache memory is used to improve the processing speed and efficiency of a computer system. It does this by temporarily storing frequently accessed or recently used data from the main memory (RAM) in a smaller, faster storage area that is closer to the CPU.

Find Out More ›

What happens if I delete cache memory? ›

It can free up more room for RAM and also help limit how websites use data on your phone to track you. Clearing cache on Android can also help fix issues and improve performance. That's because cached files can become corrupted or overloaded. Clearing cache can force a browser or app to update and work as it should.

Tell Me More ›

Is it safe to clear cache? ›

It's not bad to clear your cached data now and then. Some refer to this data as “junk files,” meaning it just sits and piles up on your device. Clearing the cache helps keep things clean, but don't rely on it as a solid method for making new space.

Show Me More ›

Does clearing the cache delete passwords? ›

Cache refers to a temporary storage area on your device that stores recently visited web pages, so technically if you only clear the cached images and files, there are no passwords to worry about.

Explore More ›

How do I empty the cache on my computer? ›

How to Clear Your Cache and. ...
In most computer-based web browsers, to open menus used to clear your cache, cookies, and history, press Ctrl-Shift-Delete (Windows) or Command-Shift-Delete (Mac). ...
If you don't see instructions below for your specific version or browser, search your browser's Help menu for "clear cache".

More items...

Is it a good idea to remove all cookies? ›

The cache and cookies should be cleared regularly for several reasons. Depending on your settings, the cache can grow quite big, use a lot of disk space on your computer and cause slow web browsing. The cache can also cause issues when viewing new versions of previously visited web pages.

Show Me More ›

What happens if you have too much cache memory? ›

A full cache needs memory, and if you're full, a bogged down memory doesn't work very quickly. Clearing the backlog could make your device work quicker.

Read The Full Story ›

Where is cache memory located in a computer? ›

Cache memory is sometimes called CPU (central processing unit) memory because it is typically integrated directly into the CPU chip or placed on a separate chip that has a separate bus interconnect with the CPU.

See Details ›

How do I check my cache? ›

Open DevTools > Application > Storage. Cache Storage. Click a cache to view its contents.

Get More Info Here ›

What are the disadvantages of cache memory? ›

Disadvantages of Cache Memory

It is quite expensive.
The storage capacity is limited.

Why is cache expensive? ›

The more cache memory a computer has, the faster it runs. However, because of its high-speed performance, cache memory is more expensive to build than RAM.

What happens to data stored in RAM when a computer is turned off? ›

RAM is volatile, meaning it loses its stored data when the power is turned off or interrupted. This characteristic is a result of how RAM functions at the hardware level.

View Details ›

How do you explain cache to a child? ›

Kids Definition

: a place for hiding, storing, or preserving treasure or supplies.
: something hidden or stored in a cache.
: a computer memory with very short access time.

Sep 1, 2024

What happens when cache memory is full? ›

Have you ever noticed that your Android device starts to slow down after a while of using it? Or that some of your apps start to crash unexpectedly? This could be due to the accumulation of cached data on your device. This can slow down your device, cause apps to crash, and lead to other performance issues.

Learn More ›

Is cache memory good or bad? ›

Performance. Cache memory is important because it improves the efficiency of data retrieval. It stores program instructions and data that are used repeatedly in the operation of programs or information that the CPU is likely to need next.

Discover More Details ›

What happens when cache memory is removed from a computer? ›

Deleting or clearing your cache memory will wipe your computer clean of all the repeated tasks you perform on your device. You can choose not to clear memory such as logins and credentials, bank details and so on; but memory attained to tasks performed or for example websites visited on a browser, will be cleared.

Show Me More ›