HPC

Home » HPC

Repost – Most Cloud Compute cycles will be HPC

in HPC by ronvanholst

I posted this at last year’s ComputerWorld Canada Blogging Idol contest, although a controversial position, it’s worth discussion:

A few months ago, I was chatting with a friend on the phone about HPC when he made the statement which is the title of this post.  I jotted it down, and it’s been rattling around in my brain ever since.  At first it seemed very odd, HPC is a neiche market, a small proportion of the overall IT market.  The more research I did the more it made sense, especially after I read a book called “The Fourth Paradigm: Data-Intensive Scientific Discovery”, free for download from Microsoft Research.  It’s not the typical marketing piece, but an excellent read if you enjoy Science and IT.  It speaks about four paradigms of Scientific enquiry and then goes on to expand on the new discipline of data-intensive computing.  The four paradigms of Scientific persuit are:

  1. empirical – describing natural phenomena, performing experiments
  2. theoretical – models, generalizations, developing ‘laws’ of science
  3. computational – simulation of complex phenomenon
  4. data exploration – eScience, discoveries are to be found in searching huge volumes of captured data from devices recording natural phenomena, experiments generating new results and simulations modelling problems in levels of detail beyond that acheiveable in paradigm 3.

In last year’s blog idol contest, I put some focus on the technology adoption curve and where I thought virtualization technology fell for 5 different classes of computing; MainframesServersDesktop,Embedded and High Performance Computing .  In the final category, HPC, I concluded that virtualization technology was in the earliest stages of adoption.  Compute virtualization is a pre-requisite technology to cloud computing (some might consider shared HPC resources using a batch scheduler without virtualization as cloud computing, but I think that’s stretching the definition of cloud too far).  But my experience in HPC to that point had only been up to paradigm 3.

Paradigm 4 is a poster child for cloud computing, and makes the case of HPC dominating cloud compute cycles an obvious conclusion the more you think about it.  HPC users are a small community, traditionally in the scientific and engineering disciplines.  With the advent of data-intensive science, this small community will consume a disproportionate amount of the computing capacity available in the world.  Not only that, but the HPC community is growing beyond science and engineering.  Business analytics are making their way into HPC to accelerate time to solution, exploring mountains of structured and unstructured data for patterns that provide intelligence towards competitive advantage.  I’ve also read about the potential for HPC in the Humanities.

We’ve got big data from science, engineering and now business too; open data from governments world wide as well as organizations like the World Bank.  So much data and so little time - HPC enables us to look at the data in different ways with thousands of scenarios, perhaps millions of scenarios being tested concurrently.

Is ‘Cloud Computing’ HPC?

in HPC by ronvanholst

Is ‘Cloud Computing’ = HPC?

That depends on who you talk to.

According to the U.S. National Institute of Standards and Technology who have published the official definition for cloud computing, cloud computing uses loosely coupled resources and HPC uses tightly coupled resources.  Thus in this case, cloud computing ≠ HPC.

Some cloud service providers have begun to offer compute resources with HPC attributes, so in this case cloud = HPC.

But then some HPC programs are designed to take advantage of large numbers of loosely coupled processes, so in this case cloud = HPC.

HPC is about performance and scalability. Cloud computing is about service delivery and scalability.  They are complementary rather than competing technologies.

What are Supercomputers used for?

in HPC, Innovation, Public ICT policy by ronvanholst

So if I expect people to vote for my proposal of a National Canadian Supercomputer centre, I should provide some examples of what we would do with such a machine.  Certainly not a comprehensive list, but a sampling of some interesting articles that I have read recently.

Health Sciences

Ivanov_cycleFig2We all hope technology will bring us breakthroughs in the prevention and treatment of disease.  Supercomputing lends a hand in many ways.  Here is an example of some basic research in the human proteome enabled by a supercomputer shared by universities in Ottawa and Kingston.  This type of basic research also known as bioinformatics, will accelerate the discovery on new cures.  There’s a small Canadian company, that is developing GPU based solutions for bioinformatics.  Genome research, which has similarities to proteome research, is being used to find cures for cancer.  Canada is participating in a global effort to share genome based cancer research.  This is an interesting article on the simulation of DNA repair mechanisms.  Gene research becomes a data mining exercise.

Product Design

Batteries are a foundational technology for many products and of special interest in new automobile design; this is a nice write up on how supercomputers are being used for better battery design.

Oil & Gas

The oil and gas industry uses supercomputer simulation for exploration (processing seismic data) and production (reservoir simulation), but this story talks about a new computer at the University of Regina that is being used for research into greener methods for petroleum processing, even the manufacturer of the machine is Canadian.

Space

ACT_TelescopeAlthough supercomputers are usually thought of for solving computationally intensive algorithms, this article talks about the processing needs of massive amounts of data for space research.  Canada also uses supercomputers for processing telescope data.

Environment

Weather simulation has been run on supercomputers for decades, but more recently, climate simulation is another growing area for supercomputer research.  Canada should play a leading role in modeling the changes in arctic climate.  This article makes reference to chemistry research using high performance computing for CO2 capture, and important component of climate modelling.  There is an interesting animation of an ocean current simulation that is attempting to predict the spread of the BP oil spill.  It has been suggested that disaster scenarios like this should be simulated as part of the due diligence for all new ocean oil drilling platforms.  With all the interest in drilling in the arctic, it would be in Canada’s best interest to not only model disaster scenarios for Canadian rigs in the arctic, but also the oil drilling sites of our arctic neighbours as well.

Canadian contributions to Supercomputing technology

Canada’s biggest supercomputer belongs to the SciNet consortium. Although this CBC article written last year states that it would be in the 15 machines in the world, it actually ranked 22nd when it first came online, and when the list was updated last month, it dropped to 28th.

I wrote a blog post on an innovative supercomputer built in Quebec.  Although this is a truly innovative supercomputer design, it is currently only ranked 72nd.

This article mentions Canada’s contribution to a multi-national effort to develop exascale computing software, although it is not specific on what Canadian institutions are participating in this development.

Although a CATA article from last year, it paints a sad picture for companies that develop new technology in Canada; the case in point here is a Canadian company that was developing a supercomputer system, of which I have written about in another post.  This company did not survive, but I wonder if it would have if it could have had a GoC reference account a year earlier than it did.

Back to the Future

in HPC, Innovation, Public ICT policy, social media by ronvanholst

I’m oDoc Brown Pours Beer into Mr. Fusion in Back to the Futureverdue for a blog post.  I’ve been very busy following up on some interesting opportunities and have also been “making my mark” with comments on a new Canadian blog based news service. This is a great site for Canadians wishing to make their mark by commenting on posts relevant to their expertise. One of my comments got a bit long and I noticed the posting lost my paragraph breaks, so I thought I would edit it a bit and elaborate further for my blog.

Remember “Mr. Fusion” from the movie “Back to the Future”?  Well the promise of cheap limitless energy from nuclear fusion has been “almost here” for a long time.

I’ve been reading articles on nuclear fusion since I was in high school.  In hind site it’s pretty funny, a couple of nerdy high school kids talking about nuclear fusion after reading about it in a Popular Science magazine they found in the library (PopSci have made their archives available on line BTW, so cool).  Thirty years later, I can’t honestly say that I know much more about nuclear fusion, but an article on the subject is just as temping now as then, so when I saw one in the Mark, I was hoping to discover a Canadian connection.  Failing to find one, I provided one of my own in my comments recaptured below:

Check out the National Ignition Facility home page, it has a great video of how this fusion reaction will be achieved.

I was hoping to see a Canadian angle in the above article on the subject of nuclear fusion. I don’ t know of any, but I’ll tell you about a Canadian connection that might have been.

So how is such a system designed with any confidence that it will work? More importantly, how can they be sure the reaction will not get out of control venting a nuclear cloud over suburban Livermore, California? The answer is supercomputing (often called High Performance Computing or HPC).

The Livermore facility is home to some of the largest supercomputers ever built by the US Department of Energy. When nuclear testing was banned, facilities like the Lawrence Livermore National Laboratory were built to turn the task of nuclear research over to computer simulation. To accurately simulate nuclear reactions in a computer requires machines with thousands of identical processors running in perfect synchronization. In March 2005 LLNL built Blue Gene/L with IBM the first supercomputer to exceed 100 TeraFLOPs of performance. But at LLNL, as soon as one machine is built, the plans for the next one begin.

A small Canadian start-up company from Ottawa was invited to propose a computer architecture for a PetaFLOP machine. This proposal was delivered along with a small (refrigerator sized) prototype supercomputer built in Kanata, Ontario by former Nortel engineers. Although the prototype machine was installed at LLNL, and early prototypes achieved many of the performance targets set by LLNL, a system was not purchased. That’s the Canadian connection to nuclear fusion that might have been.

Unfortunately the dream of a Canadian designed and built supercomputer died with the start-up that built them a few weeks ago when they ran out of funds. This company was able to sell a few systems for non-supercomputing tasks, but it was never able to win a supercomputing sale. I often wonder if the Canadian government had purchased a supercomputer from this little company, if it would have been enough to bring it to self-sustainability. Unfortunately our government does not have procurement vehicles to help along start-ups like this. Although Canada may not be directly involved in solving the science and engineering of nuclear fusion, there are other such “grand challenge” problems that are being solved by supercomputing technology.

Canada can have a role if it chooses to invest in supercomputing technology. We toss a bit of money for universities to build some nice systems to support academic research, but there is no national strategy to advance supercomputing technology for industry in Canada. It should be a component of Canada’s Digital Economy Strategy, it is for practically all other advanced nations. Maybe energy comes too easy for Canada, with clean hydro, lots of fossil fuels, and lots of uranium; the relative comfort of the present doesn’t impel us to invest as strongly in the future as we should. “Mr. Fusion” will not likely be invented in Canada, but if some of the new game changing technologies of the future are not invented here, Canada will cease to be one of the best places in the world to live.

Virtualization technology adoption in different computing classes

in HPC, Innovation by ronvanholst

I have expanding the content below into 5 posts for ComputerWorld Canada’s Blogging Idol competition, if you’re interested in this content, you will find my analysis there more thorough; part 1 introduction and mainframes, part 2 servers, part 3 desktop (where I got some good comments), part 4 embedded, and last but not least part 5 supercomputing or HPC.  I have spent some extra energy on this subject, as I have not seen anything like this searching the internet.

Virtualization on commodity computers is reaching market maturity, even though it’s an old concept borrowed from the days when mainframes ruled the computing landscape.  Today we have desktop virtualization, application virtualization, network virtualization (think VLANS), storage virtualization and so on.  What most people are talking about is server virtualization, that is, carving up one physical computer into a larger number of “virtual machines” (VM) providing what appears to be the resources of a computer to a user or an application, but doing this as many times as the computer can handle.  Four VMs to a computational unit (CPU – actually central processing unit) is a common ratio, but that ratio depends on a lot of things.  Today, processor chips have multiple CPUs (called multi-core), and business class computers often have multiple processor chips, so you could easily run 10-20 VMs with-out overloading a computer that costs less than a few thousand dollars today.  This is what has made made virtualization on COTS (Commercial Off The Shelf) computers such a strong value proposition in the last few years.  Some of the key value propositions are:

  1. Server Consolidation:  Most enterprises used to deploy one business application per compute server, because in many cases, relative to the cost of the software and the business value it brought, the cost of the computer was very small.  This led to computers in corporate computer rooms multiplying like rabbits.  Studies showed that most of these computers were used at less than 20% of their capacity; the operating costs of managing, housing and powering these systems far out-weighed the capital costs of purchasing the machines; so if you owned more that a handful, there was an opportunity for serious savings.  The Green IT movement is all over this as 8 machines running at 20% consume a lot more electricity than 2 machines running at 80%.
  2. Isolation:  Software inevitably crashes, whether through internal bugs or external viruses, and an application running all by itself in a VM should not crash its host computer or other VMs running there.
  3. Protection: Since a VM appears to the host computer as a simple file (encapsulating the application, operating system, data, configurations etc.), back-up is easy.  A study by VMware identified protection as a key value proposition for small and medium businesses.  Modern file systems and storage technologies can also largely automate this process with little or no impact to operational performance.  Thus when disaster strikes (some one spills their coffee on the hard drive, or your office gets hit by a tornado), that image just needs to be re-started some where else; the business process supported by that VM could be back up and running in as little as a few seconds, depending on how the business continuity plan was designed.
  4. Portability: An application is not tied to a specific computer.  When the computer becomes obsolete, it can be replaced with one that is more powerful or energy efficient.
  5. Heterogeneity:  Some applications run with only certain operating systems, or are only supported to certain versions, so on a single computer, you can run applications in Windows XP, Windows Vista, Red Hat Linux, etc.
  6. Rapid Deployment:  When business needs to deploy another IT service, a VM can be copied from library and be deployed in minutes.  Gone are the days when you had to wait days or even weeks for a server to be installed and deployed into service.
  7. Security:  VMs can be standardized and security updates can be more easily managed deployed.  Sensitive information can also be better managed.

Virtualization technology  has progressed along the technology adoption curve differently for different classes of computing.  VMs on mainframes are well along the “late majority” side of the market and that’s ancient history.  VMs on COTS are now well along the “early majority” side of the market driven there primarily by the first three value propositions noted above.

VMs in personal or desktop computing  appear to be earlier on the “early majority” curve than COTS;  I have heard that desktop virtualization is widely deployed to contractors and employees at the Government of Canada, and Fusion is popular among Apple Mac zealots who need to run Windows applications.  For the GoC, values 3, 6 and 7 make sense; and 5 is the value proposition for Mac users – having two machines in one.

ta4

 

There are two classes of computing for which virtualization is relatively new; embedded computing and high performance computing (HPC).  For now, let’s classify mobile computing (like smart phones) as a special case of embedded computing.

It seems to me that embedded computing would be in the “early adopter” phase.  There is less market maturity in this case as embedded computing have more recently adopted multi-core processors, embedded computing employs a much wider variety of processor types, CPUs are increasingly embedded in customer chips and applications are customized and tightly coupled to a specific target CPU architecture.  Since embedded processors have more recently adopted multi-core technology, consolidating embedded applications from multiple embedded processors onto a multi-core chip running VMs is a first step similar to the server consolidation value proposition that drove adoption of VMs on COTS.

Last but not least, my favorite computing class HPC. It’s still in the “innovators phase“, but should see a quick migration to the early adopter phase for commercial applications.  This is due to the value propositions to independent software vendors (ISVs) outlined in this blog post.  Improving deployment processes and reducing QA costs will help ISVs deploy better quality HPC applications at lower costs, but there will be many challenges to face.  If non-HPC applications have porting issues to virtualization, as discovered by the IT department at Sick Kids hospital in Toronto, HPC applications will have issues to a greater degree.  Most applications being deployed on VMs today are serial (having a single thread of execution), whereas HPC applications are almost entirely parallel, running many thread of execution simultaneously.  Whether parallelism is achieved through shared memory constructs or message passing libraries, this places greater challenges to VMs and the scheduling software that must optimally allocate the computational workload. Screenshot-1

 

I suspect it will take longer to advance along the adoption curve for “grand challenge” applications at large research facilities, as the performance and scalability requirements will be very challenging for virtualization to meet, and this is a small market segment.

For the newer players in the virtualization technology space, these should be interesting times, as this proven technology is re-applied to new challenges in different computing classes with unique requirements.

Gaming the Military?

in HPC by ronvanholst

Copied here is an impressive chart on the projected growth of HPC computational requirements in the global defense industry.  It looks like about an order of magnitude increase in demand for HPC cycles in the next three years; that’s definitely ahead of Moore’s Law.  Makes me wonder what the forecast is for Canada, I have been searching for data on HPC use in the Canadian forces, but haven’t found very much.habu

The point of the article is the emphasis on use of commodity Commercial Off The Shelf (COTS) components, to assemble large scale systems that are application specific – ie. the system is carefully tuned for one specific parallel program.  What is really interesting is the price/performance argument made for building an HPC cluster from Sony Playstations.  The cell processor in the PS3 is a very powerful compute engine, much more powerful that the AMD|Intel chip in your laptop.  Because of the manufacturing volume of the games market, you get a very high performance compute node for the fraction of the price of an equivalent performance compute server; and they don’t even need the bluray drive or hard drive, which would make the unit more energy efficient, less noisy and even cheaper.  That reminds me, the PS3 I bought the boys is really noisy, you’d think Sony could have done a better job with that; conductive cooling and a solid state drive would have been really cool (which by the way is also a requirement for military embedded systems).

Curious then that IBM has announced that there will be no further development of this amazing processor chip.  Curious as well that the machine we buy our kids to simulate war games, when configured as an HPC compute cluster, is a powerful tool to manage the real thing.

Finding your Voice

in HPC, social media by ronvanholst

8th2Well, Christmas is over, New Years is over and my birthday is over (oops that’s FaceBook content), so no more excuses for not publishing a new blog post.

Although my focus in this blog is HPC technology and how it can be used in Canada’s ICT strategy for innovation and competitiveness, I’d also like to begin commenting on the things that I am learning about Internet social media and its impact on personal branding and marketing.  On this track, I’d like to write some notes on one of Steven Covey’s books that I’m reading, “The 8th Habit, from Effectiveness to Greatness”.

Many years ago, at a company which used to be great, I took a course based on Dr. Covey’s highly acclaimed book, “The 7 Habits of Highly Effective People”.  The content resonated with me, as well as many of my colleagues.  Its a bit ironic that a company that made such a course easily available to its employees, became ineffective and failed so miserably, but that’s a bigger subject than I care to blog about.  I find that I am still applying what I learned, and that these principles for effectiveness have in fact become habits.  So when I saw “The 8th Habit, from Effectiveness to Greatness” at the local bookstore, I had to buy it.  I’m only part way into it, but enjoying it very much.

The connection to personal branding is that the 8th ‘habit’ is not really an additional habit, but adds a third dimension to the 7 habits framework, that is “Find Your Voice and Inspire Others to Find Theirs”.  It seems to me that is what WhyHire.me is also aiming to achieve.  Covey opens this subject with a threefold pain/problem/solution introduction.  In short: The pain in the workplace is that most people operate in an environment where their yearning for greatness is unfulfilled.  The problem is that most workplaces still have an industrial age mindset of people as resources that need to be managed.  The solution is to find your voice, and exercise your power to choose a course that unleashes your creativity.

He also emphasizes in the introduction that you learn the most when you turn around and teach others what you are learning; hence this blog post.  My goal is to establish my name, Ron Van Holst, the voice for High Performance Computing in Canada.

Green supercomputer innovation in Quebec

in HPC by ronvanholst

QC_silo_scaled

Update – unfortunately the Sun Microsystems links are broken, but the link to the other blog post is still good.

Sun Microsystems has a nice write up on the new high performance computer system they built at the University of Laval for the CLUMEQ consortium.  There is also a nicely done interactive video tour presenting a clever re-use of an obsolete cyclotron silo for a server room. Not only is a landmark building preserved, but it actually turns out to be an efficient way to build a data center or supercomputer. This blog post has a nice collection of graphics to illustrate the new system.

The “cylindrical machine” has three floors of equipment racks, each configured in a circle, with the cooling infrastructure in the basement. This allows for highly optimized air flow, which they modelled with Computational Fluid Dynamics (CFD) simulations. The “hot aisle” being the centre of the building. One of the speakers on the video shows alot of enthusiasm for the building itself being the CRAC (Computer Room Air Conditioner).  For most of the year the system uses “free air”, or outside air cooling.  Although there are cold water chillers, I’m assuming that this is a simple heat pump configuration, as the coils are reversed in the winter to transfer waste heat from the system into other buildings on campus to further improve their Power Usage Efficiency (PUE).  It is said that the entire system “breathes naturally”, and also incorporates Intel’s Intelligent Power technology, so that power consumption can be scaled to compute load. The high efficiency fans also dynamically adapt to cooling needs, and there is enough heat buffer in the facility to operate safely even if the chillers fail.

From a networking perspective, with three circles of equipment racks on three floors, and the networking equipment placed on the second floor, the 75 racks of equipment to all be connected with cables no longer than 10m, something that could never be acheived in a standard server room.  Not only does this save cabling costs, but should save some power by minimizing the optical power budget (I’m assuming optical as 10m is a stretch for copper at QDR Infiniband rates).  The cluster networking is configured as a fat tree with full bisectional bandwidth. This means that the bandwidth between any two servers is constant, thus there are no bandwidth bottlenecks anywhere in the system. This is an important attribute of a high performance cluster, as any imbalance in the system will directly impact “time to solution”. That’s quite an accomplishment for a cluster with 960 compute nodes.

A cheap place to build a big data centre?

in HPC, Public ICT policy by ronvanholst

There has been much talk about Iceland as a cost effective place to build large data centres, due to the access of cheap and reliable electricity, cool air, and an economy eager for new business opportunities.  A limiting factor would be fibre communications, some undersea cables have been built to Europe, but this is a limited resource.

Why not make Canada a destination for such large systems?  We have a better fibre infrastructure and close proximity to the world’s largest consumer economy.  Americans don’t have to be sold on the availability of cold air, as their meteorologists frequently remind them of this.  As for cheap power, although I’m not an expert, I would expect there are places where it can be had reliably and economically, perhaps even places where we have a surplus.  I’m thinking of the Chalk River facility up the Ottawa valley, where we have nuclear power from test reactors, but probably not much in the way of local demand for power (especially in an area where hydro electricity is easy to exploit).  You also should have infrastructure and expertise to support advanced technology.  Although our economy probably hasn’t been hit as hard as Iceland’s proportionately, as a nation we do need to make investments in forward looking businesses like better data centres and advanced computing facilities which improve Canadian competitiveness globally.

Update:

I found another good article on data centre location. It mentions several things like community incentives, local construction costs, cost of electricity, but also the need to avoid areas like airport flight paths, and proximity to volatile industries such as an explosives manufacturing site.  I suppose a nuclear test reactor wouldn’t be viewed so favourably for that type of analysis.

SSD, having your cake and eating it too!

in HPC by ronvanholst

ssd

It is fascinating that a technology originally developed for performance and utilized for that purpose in the planned Gordon supercomputer currently being built at the San Diego supercomputer center, is being used as a power saving technology. So we can have faster performance and save energy at the same time; it sounds almost too good to be true!  This article shows how MySpace uses FusionIO solid state drives that mount in a PCIe slot, not primarily for performance, but as a power saving strategy.  In this case, the higher cost of solid state storage must be offset by the savings in electricity. With this technology increasing in density and decreasing in price, that intersection was inevitable. The figure given is that the power consumed by solid state storage is 1% of an equivalent solution using spinning disks; keep in mind that this is more that the elimination of the spinning disk power consumption, but also the power cost of cooling, which in many cases can be roughly equal to the cost of powering the servers.  Makes you wonder if this has the potential to be a threat to the shared storage industry; shared storage provided data centres with better performance and reliability, as well as an easier method for sharing data and having a common back up scheme.  If local solid state storage provides a better solution for performance, reliability and scalability than shared storage while saving a lot of electricity, then this sounds like a game changer for the storage industry, if you can find a solution for sharing backing up data.  You’d think this would make for a rather complex file system requirements, as you would need to share local disk data across all servers. Although an older article, it seems that MySpace has already created a home grown distributed file system solution for that purpose a while back.  Maybe it’s time to sell stock in network storage companies, and move that money into solid state storage companies, as if solid state storage wasn’t a disruptive technology already.

Page 1 of 212