Pat Gelsinger from Intel shared at the IDF today that besides the 47 SSE4 instructions that will arrive with Penryn, the Nehalem processor will feature 7 additional SSE4 instructions:
“With Nehalem, we're delivering seven more new instructions that are part of it,” said Gelsinger. Application-specific instructions like POP counts and CRC-32, also new streaming instructions, specifically for workloads like XML. This is our first published data on that, where we see enormous reductions in the instruction count and over 3x performance improvement on XML-like workloads.”
One of the new optimisations will allow for 256 simultaneous compares in one instruction. Gelsinger said that this kind of improvement is what makes his heart beat and given the potential three fold performance increase, he’s understandably really excited by the potential of new instructions like this.
Otellini also showed off some more slides about QuickPath and the Nehalem project manager, Jim Brayton, also came on stage. Brayton said Nehalem will be real fast - but he wasn't at liberty to discuss the performance yet. More info at Bit Tech.