The SSD Anthology: Understanding SSDs and New Drives from OCZ
by Anand Lal Shimpi on March 18, 2009 12:00 AM EST- Posted in
- Storage
Disappointed, I went back to OCZ
OCZ was worried. The last time I reviewed one of their SSDs I was truthful about it, and it hurt their sales considerably. Customers were returning drives, and to OCZ’s credit, they stepped up and even accepted some returns themselves - something that most manufacturers wouldn’t have done. Regardless what they had told me, there was some admission of fault there. Those JMicron drives were nothing short of crap.
As soon as OCZ started getting word that I wasn’t pleased with Vertex, they went into a state of panic. These drives all do very well in synthetic HDD tests like HDTach and ATTO, that’s generally all they’re reviewed in, so that’s all they’re tested in. But now OCZ was hearing that the Vertex wasn’t passing some of my tests and they had no idea what it was failing or why.
I tend to do a good job of keeping what tests I run secret until the review is published, so there isn’t any unfair optimization. I purposefully introduce new tests to our performance suites to help keep manufacturers honest and optimizing for real world usage scenarios rather than specific benchmarks. OCZ had no idea what I was running, but they knew that the Vertex wasn’t doing well.
Summit on the other hand was performing just fine, but that’s an expensive drive. Vertex was supposed to be good, it should’ve been good, there’s no reason for it to be performing this poorly. I ran the infamous iometer test to see what was going on:
Iometer 4KB Random Writes, IOqueue=1, 8GB sector space | IOs per second | MB/s | Average Latency | Maximum Latency |
Original Pre-release OCZ Vertex | 20.7 | 0.08 MB/s | 48.2 ms | 484.5 ms |
How on earth is this acceptable at all? Average latency of 48.2ms and a maximum latency as bad as the Apex and G.Skill Titan drives? I’ve heard some SSD vendors dismiss the iometer results but let me caution you against that. What these numbers are telling us is that on average, when your OS goes to write a 4KB file somewhere on your drive, it’ll take nearly 50ms. That’s 4.5x longer than a 5400 RPM 2.5” notebook drive and that’s the average case. What part of that sounds acceptable? Anyone who tells you otherwise is delusional.
I thought for sure that the drive was broken and that we’d made no progress since last fall. But the drive hadn’t launched yet, while there were glowing reviews of it, no one had wasted any money. I wrote an email to Ryan Petersen, OCZ’s CEO. I described my findings and told him that while the Vertex’s performance was better than any of the JMicron solutions, it was unacceptable for anything other than perhaps extremely light, single-tasking usage.
I told him it sucked. He said that wasn’t fair. We argued over email but he came back and asked me what I needed to see to make the drive better.
I told him I’d need an average response time in the sub-1ms range and a max latency no worse than Intel’s 94ms. I didn’t think it would be possible. I was prepared for OCZ to hate me once more. He told me to give him a couple of days.
250 Comments
View All Comments
punjabiplaya - Wednesday, March 18, 2009 - link
Great info. I'm looking to get an SSD but was put off by all these setbacks. Why should I put away my HDDS and get something a million times more expensive that stutters?This article is why I visit AT first.
Hellfire26 - Wednesday, March 18, 2009 - link
Anand, when you filled up the drives to simulate a full drive, did you also write to the extended area that is reserved? If you didn't, wouldn't the Intel SLC drive (as an example) not show as much of a performance drop, versus the MLC drive? As you stated, Intel has reserved more flash memory on the SLC drive, above the stated SSD capacity.I also agree with GourdFreeMan, that the physical block size needs to be reduced. Due to the constant erasing of blocks, the Trim command is going to reduce the life of the drive. Of course, drive makers could increase the size of the cache and delay using the Trim command until the number of blocks to be erased equals the cache available. This would more efficiently rearrange the valid data still present in the blocks that are being erased (less writes). Microsoft would have to design the Trim command so it would know how much cache was available on the drive, and drive makers would have to specifically reserve a portion of their cache for use by the Trim command.
I also like Basilisk's comment about increasing the cluster size, although if you increase it too big, you are likely to be wasting space and increasing overhead. Surely, even if MS only doubles the cluster size for NTFS partitions to 8KB's, write cycles to SSD's would be reduced. Also, There is the difference between 32bit and 64bit operating systems to consider. However, I don't have the knowledge to say whether Microsoft can make these changes without running into serious problems with other aspects of the operating system.
Anand Lal Shimpi - Wednesday, March 18, 2009 - link
I only wrote to the LBAs reported to the OS. So on the 80GB Intel drive that's from 0 - 74.5GB.I didn't test the X25-E as extensively as the rest of the drives so I didn't look at performance degradation as closely just because I was running out of time and the X25-E is sooo much more expensive. I may do a standalone look at it in the near future.
Take care,
Anand
gss4w - Wednesday, March 18, 2009 - link
Has anyone at Anandtech talked to Microsoft about when the "Trim" command will be supported in Windows 7. Also it would be great if you could include some numbers from Windows 7 beta when you do a follow-up.One reason I ask is that I searched for "Windows 7 ssd trim" and I saw a presentation from WinHEC that made it sound like support for the trim command would be a requirement for SSD drives to meet the Windows 7 logo requirements. I would think if this were the case then Windows 7 would have support for trim. However, this article made it sound like support for Trim might not be included when Windows 7 is initially released, but would be added later.
ryedizzel - Thursday, March 19, 2009 - link
I think it is obvious that Windows 7 will support TRIM. The bigger question this article points out is whether or not the current SSDs will be upgradeable via firmware- which is more important for consumers wanting to buy one now.Martimus - Wednesday, March 18, 2009 - link
It took me an hour to read the whole thing, but I really enjoyed it. It reminded me of the time I spent testing circuitry and doing root cause analysis.alpha754293 - Wednesday, March 18, 2009 - link
I think that it would be interesting if you were to be able to test the drives for the "desktop/laptop/consumer" front by writing a 8 GiB file using 4 kiB block sizes, etc. for the desktop pattern and also to test the drive then with a larger sizes and larger block size for the server/workstation pattern as well.You present some very very good arguments and points, and I found your article to be thoroughly researched and well put.
So I do have to commend you on that. You did an excellent job. It is thoroughly enjoyable to read.
I'm currently looking at a 4x 256 GB Samsung MLC on Solaris 10/ZFS for apps/OS (for PXE boot), and this does a lot of the testing; but I would be interested to see how it would handle more server-type workloads.
korbendallas - Wednesday, March 18, 2009 - link
If The implementation of the Trim command is as you described here, it would actually kind of suck."The third step was deleting the original 4KB text file. Since our drive now supports TRIM, when this deletion request comes down the drive will actually read the entire block, remove the first LBA and write the new block back to the flash:"
First of all, it would create a new phenomenon called Erase Amplification. This would negatively impact the lifetime of a drive.
Secondly, you now have worse delete performance.
Basically, an SSD 4kB block can be in 3 different states: erased, data, garbage. A block enters the garbage state when a block is "overwritten" or the Trim command marks the contents as invalid.
The way i would imagine it working, marking block content as invalid is all the Trim command does.
Instead the drive will spend idle time finding the 512kB pages with the most garbage blocks. Once such a page is found, all the data blocks from that page would be copied to another page, and the page would be erased. Doing it in this way maximizes the number of garbage blocks being converted to erased.
alpha754293 - Wednesday, March 18, 2009 - link
BTW...you might be able to simulate the drive as well using Cygwin where you go to the drive and run the following:$ dd if=/dev/random of=testfile bs=1024k count=76288
I'm sure that you can come up with fancier shell scripts and stuff that uses the random number generator for the offsets (and if you really want it to work well, partition it so that when it does it, it takes up the entire initial 74.5 GB partition, and when you're done "dirtying" the data using dd and offset in a random pattern, grow the partition to take up the entire disk again.)
Just as a suggestion for future reference.
I use parts of that to some (varying) degree for when I do my file/disk I/O subsystem tests.
nubie - Wednesday, March 18, 2009 - link
I should think that most "performance" laptops will come with a Vertex drive in the near future.Finally a performance SSD that comes near mainstream pricing.
Things are looking up, if more manufacturers get their heads out of the sand we should see prices drop as competition finally starts breeding excellence.