PGI News

The latest news on The Portland Group and PGI products.

18.11.14 PGI High Performance Computing Compilers Coming to IBM POWER Systems
14.2.14 PGI 2014 Brings New Level of Performance, Programming Simplicity to High Performance Computing
16.7.13 PGI Accelerator Compilers Add Support for AMD APUs and GPUs
13.2.13 The Portland Group Ships Major HPC Compilers and Development Tools Update
13.11.12 PGI Accelerator Compilers to Support New NVIDIA Tesla K20 Family of GPU Accelerators
13.11.12 PGI to Deliver OpenACC for Intel Xeon Phi
31.10.12 PGI and AMD Collaborate on APU Compilers to the Benefit of HPC Programmers
25.10.12 SGI to Resell PGI High-performance Compilers
19.6.12 PGI Ships PGI Accelerator Compilers with OpenACC
22.3.12 First PGI Compilers with Support for OpenACC Now Available
5.3.12 New Release of HPC Compilers and Development Tools Now Available from The Portland Group
14.12.11 The Portland Group Announces Availability of CUDA C/C++ Compiler for Multi-core x86
15.11.11 nCore and The Portland Group Team to Produce Training for Programming GPU Accelerators
26.10.11 The Portland Group Adds Support for AMD's "Bulldozer" Architecture
16.06.11 The Portland Group Updates Compilers to Deliver NVIDIA CUDA Architecture for x86 Platforms
17.02.11 The Portland Group Announces PGI 2011 HPC Compilers and Development Tools
21.09.10 PGI to Develop Compiler Based on NVIDIA CUDA C Architecture for x86 Platforms
20.07.10 The Portland Group Releases PGI Visual Fortran for Visual Studio 2010
20.05.10 Gaussian 09 Software Ported to 64-bit Mac OS X Using PGI Compilers
11.05.10 PGI Compilers Now Available on the Cray CX Line of Supercomputers
6.05.10 PGI Accelerator Compilers Enhanced for NVIDIA GPUs Based on New Fermi Architecture
17.11.09 PGI CUDA Fortran Now Available from The Portland Group
12.11.09 PGI Release 2010 Enables Scientists and Engineers to Harness the Computational Power of GPUs
5.08.09 PGI Visual Fortran 9.0 Adds Support for MPI Debugging on Windows HPC Server 2008 Clusters
23.06.09 PGI and NVIDIA Team To Deliver CUDA Fortran Compiler
23.06.09 New PGI 9.0 Compilers Simplify x64+GPU Programming

PGI High Performance Computing Compilers Coming to IBM POWER Systems

Santa Clara, Calif.
November 18, 2014

Optimizing Compilers Enable Developers to Easily Develop and Migrate Linux x86 Applications to GPU-Accelerated POWER Systems

NVIDIA today announced that it is developing an enhanced version of the widely used PGI optimizing compilers which will allow developers to quickly develop new applications or run Linux x86-based GPU-accelerated applications on IBM POWER CPU systems with minimal effort.

The PGI optimizing Fortran, C and C++ compilers for POWER will provide a user interface, language features, parallel programming features and optimization capabilities that are identical to those available on PGI Linux and x86 compilers.

The new compilers also will support high performance computing (HPC) systems based on the IBM POWER architecture, including the recently announced NVIDIA GPU-accelerated IBM POWER8 systems, and additional systems under development by members of the OpenPOWER Foundation.

"“Our goal is to let HPC developers recompile and run their applications on all major CPU and GPU-accelerated platforms with uniformly high performance using a common source code base," said Douglas Miles, director of PGI Compilers & Tools at NVIDIA. "We expect most GPU-accelerated x86 applications currently built with PGI compilers will port to GPU-accelerated POWER systems with a simple re-compile."

The POWER8 CPU is a massively multi-threaded processor, featuring 12 cores each capable of handling eight hardware threads simultaneously. Originally positioned for big data and cloud server applications, the POWER8 architecture is generating strong demand from HPC customers given its many performance-oriented features, such as a high-bandwidth CAPI port (Coherent Accelerator Processor Interface) and future support for the NVLink high-speed GPU interconnect.

"Porting and optimizing production HPC applications from one platform to another can be one of the most significant costs in the adoption of breakthrough hardware technologies," said Buddy Bland, project director of the Oak Ridge Leadership Computing Facility at Oak Ridge National Laboratory. "The PGI compiler has been our primary compiler on Jaguar and Titan since 2005. Having the PGI compiler suite available in the POWER environment will provide continuity and facilitate code portability of existing CPU-only and GPU-enabled Titan applications to our next major system."

"IBM’s Linux and x86 HPC customers have long had the luxury of leveraging the best capabilities and features from multiple HPC compiler solutions," said Dave Turek, vice president, Technical Computing OpenPOWER at IBM. "With the availability of PGI compilers alongside the widely used IBM XL optimizing compilers for POWER8, our customers will now have this same flexibility and advantage on current and next-generation IBM POWER System platforms as well."

Key features of the PGI compilers and tools for IBM POWER-based systems will include:

  • OpenACC directives for accelerators – Comprehensive support for OpenACC features in the PGI Accelerator native Fortran 2003, C11 and C++11 compilers on the latest generation of GPU accelerators from NVIDIA, including support for unified memory.
  • PGI CUDA Fortran extensions – Feature parity with CUDA Fortran on Linux/x86 platforms, offering the flexibility and power of the NVIDIA® CUDA® programming model in a native Fortran compiler for GPU-accelerated POWER systems.
  • Faster OpenMP performance – PGI compilers deliver an average of 75 percent faster performance on the latest SPEC© OMP2012 benchmark suite, compared to GCC 4[1].8 using the latest AVX-enabled multi-core x64 processors from Intel and AMD.
  • PGI optimization features – Fortran 2003, C11 and C++11 compilers with the full range of PGI multi-core optimizations including comprehensive loop optimizations, memory hierarchy optimizations, SIMD vectorization, function inlining, inter-procedural analysis and optimization, profile feedback and more.

For a complete list of the features and capabilities of PGI compilers and tools, visit http://www.pgroup.com/support/new_rel.htm.

Availability and Free Trial

NVIDIA will announce the availability of POWER support in the PGI compilers at a future date. PGI 2014 with x86 support is available today directly from NVIDIA and authorized resellers. New users can register for a free 30-day trial of PGI 2014 at www.pgroup.com.

Media Contact
George Millington
NVIDIA Corporation
(408) 562-7226
gmillington@nvidia.com

PGI Accelerator Compilers Add Support for AMD APUs and GPUs

Santa Clara, Calif.
February 14, 2014

New Release Adds OpenACC 2.0 Features for NVIDIA and AMD GPU Accelerators, Delivers Multi-core x64 Performance Gains

PGI, a leading suite of high-performance parallelizing compilers and development tools, now features support for the latest version of the OpenACC programming standard on accelerator platforms.

Available today, PGI® 2014 Compilers and Tools includes new capabilities for programming the recently announced NVIDIA® Tesla® K40 GPU accelerators using version 2.0 features of the OpenACC directives-based parallel programming specification. It also provides, for the first time, OpenACC support for AMD Radeon GPUs and APUs.

"We applaud PGI’s ability to extract performance from AMD discrete GPUs and APUs using OpenACC," said Suresh Gopalakrishnan, corporate vice president and general manager of the Server business at AMD. "It will help break down the remaining barriers to wide-scale accelerator adoption, and decouple the choice of accelerator programming model from the choice of accelerator hardware."

Key features of PGI 2014 Compilers and Tools include:

  • OpenACC 2.0 Features—PGI Accelerator native Fortran 2003, C99 and C++ compilers expand support for key OpenACC 2.0 features, including routine directive (procedure calls in accelerator regions), unstructured data lifetimes and others.
  • New NVIDIA® CUDA® Fortran Extensions—Add support for version 5.5 of the NVIDIA CUDA parallel programming platform, CUDA atomic functions and device-side debugging using Allinea DTT and TotalView from Rogue Wave.
  • Free PGI for OS X—Fortran 2003 and C99 compilers with all PGI multi-core x64 optimizations, command-line debugging and streamlined online documentation.

PGI 2014 compilers deliver an average of 75 percent faster performance on the latest SPEC® OMP2012 benchmark suite, compared to GCC using the latest AVX-enabled multi-core Intel and AMD x64 processors. Additional capabilities of PGI 2014 Compilers and Tools include full Fortran 2003 support, incremental Fortran 2008 features, updated libraries, support for the latest operating systems and a comprehensive suite of new and updated code examples and tutorials.

For a complete list of the features and capabilities of PGI 2014 Compilers and Tools, visit http://www.pgroup.com/support/new_rel.htm.

"The use of accelerators in high performance computing is now mainstream," said Douglas Miles, director of PGI Software at NVIDIA. "With PGI 2014, we are taking another big step toward our goal of providing platform-independent, multi-core and accelerator programming tools that deliver outstanding performance on multiple platforms without the need for extensive, device-specific tuning."

PGI 2014 is available today directly from NVIDIA and authorized resellers. A free 30-day trial of PGI 2014 is available for new users at www.pgroup.com. Registration is required.

Media Contact
George Millington
NVIDIA Corporation
(408) 562-7226
gmillington@nvidia.com

PGI Accelerator Compilers Add Support for AMD APUs and GPUs

Portland, Ore.
July 16, 2013

PGI Beta release supports OpenACC directive-based accelerator programming for AMD APUs and discrete GPUs

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing (HPC), today announced availability of a Beta release of the PGI Accelerator™ Fortran, C and C++ compilers with support for the OpenACC® API targeting AMD Accelerated Processing Units (APUs) and discrete Graphics Processing Units (dGPUs).

"One of PGI’s goals is to increase productivity and provide performance portability for applications developed and maintained by science and engineering domain experts," said Douglas Miles, Director of The Portland Group. "The OpenACC standard was developed in direct response to the HPC community’s interest for a vendor-neutral, platform-independent, directive-based accelerator programming model. Adding PGI Accelerator support for AMD APUs and GPUs is the latest step in the evolution of OpenACC and compiler technology for heterogeneous parallel computing at PGI."

Unveiled in November 2011, the OpenACC API was developed by PGI, Cray, and NVIDIA, with support from CAPS Entreprise. OpenACC is already supported by PGI compilers on NVIDIA© GPUs with the CUDA© parallel-programming architecture. The OpenACC 1.0 specification was developed in cooperation by the founding members and is based on the PGI Accelerator programming model. The OpenACC 2.0 specification has just recently been ratified.

Directive-based Programming

The OpenACC Application Programming Interface (API) describes a collection of compiler directives to specify loops and regions of code in standard C, C++ and Fortran. These regions can be offloaded from a host CPU to an attached accelerator, providing portability across operating systems, host CPUs and accelerators. By exposing parallelism to the compiler, directives allow the compiler to do the detailed work of mapping the computation onto the accelerator to deliver significant improvements to application performance. Using directives, developers can have a single code base that is multi-platform and multi-vendor compatible, a key advantage for multi-platform and multi-generation application development.

In a recent poll of over 1200 OpenACC evaluators, over 70% of the respondents found OpenACC easy to use and more than 75% reported seeing a speed-up when running on an accelerator.

"AMD is very pleased with the beta release of the PGI Accelerator Fortran, C and C++ compilers with support for the OpenACC API that targets AMD APUs and discrete GPUs," said Margaret Lewis, director, server software planning at AMD. " OpenACC is being adopted by HPC researchers and programmers as they look for easier ways to take advantage of the benefits of accelerated computing. OpenACC provides a straight forward means for programmers to accelerate their applications using familiar programming techniques. It also provides a path for legacy applications to maintain code portability and still take advantage of the newest high-performance heterogeneous parallel computing architectures."

The first Beta release of the PGI Accelerator compilers with support for the OpenACC standard on AMD dGPU and APU platforms is available now on a limited basis, with an open Beta release currently scheduled for later in 2013. Interested Beta testers can request access by contacting PGI directly at beta@pgroup.com. The Beta software includes a restricted-use license and the license agreement is available at www.pgroup.com/support/BTLA.

More information on the PGI Accelerator compilers with OpenACC support is available at www.pgroup.com/accel. More information on the OpenACC API and standard can be found at www.openacc.org.

Media Contact
Patrick Brooks
The Portland Group
(503) 628-2806

The Portland Group Ships Major HPC Compilers and Development Tools Update

Portland, Ore.
February 13, 2013

PGI 2013 delivers expanded support for programming HPC accelerators plus industry-leading multi-core x64 performance

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced that the 2013 release of the PGI® high-performance parallelizing compilers and development tools for Linux, Apple OS X and Microsoft Windows is now available. PGI 2013 includes new features and capabilities for programming the latest HPC accelerators using the OpenACC API. It also delivers significant performance gains on multi-core x64 processors.

"The high-performance computing landscape is evolving rapidly. With the recent introduction of new accelerators from NVIDIA, Intel and AMD, HPC users have more options than ever," said Douglas Miles, director of The Portland Group. "With PGI 2013, we are expanding support within our PGI Accelerator programming tools so developers wishing to access the huge potential performance of these new platforms can do so in a consistent, productive and portable way."

The 2013 release of the PGI Accelerator™ native Fortran 2003 and C99 compilers expands support for the OpenACC directive-based accelerator programming model through the addition of an all-new PGI Accelerator C++ compiler. All three compilers feature expanded support for the OpenACC standard as well as new PGI extensions for supporting multiple devices. PGI Accelerator compilers also now target the latest NVIDIA Tesla K20 and K20X GPUs. Support for targeting Intel Xeon Phi coprocessors and AMD APUs and discrete GPUs with OpenACC is planned for a future release. New CUDA Fortran extensions in PGI 2013 include support for textures as well as support for dynamic parallelism and separate compilation on suitable CUDA-enabled hardware. Both PGI Accelerator and CUDA Fortran now support the latest CUDA 5.0 software environment from NVIDIA in addition to supporting multiple devices from a single program or host thread.

In addition to expanded support for accelerators, PGI 2013 also delivers significantly faster performance on multi-core x64 processors including industry-leading OpenMP parallel performance on the new SPEC® OMP2012 benchmark suite* running on the latest AVX-enabled processors from AMD and Intel (see chart below). Overall performance on the SPEC CPU 2006 floating-point benchmarks is over 10% faster compared to the initial version of PGI 2012 released in February 2012. Similar performance gains have been seen on other HPC benchmarks as well.

SPEC OMP 2012 Comparison

SPECompG_base2012 relative performance as measured by The Portland Group during the weeks of January 28 and February 4, 2013. The number of OpenMP threads was set to match the number of cores on each system.

Additional features and enhancements in PGI 2013 include:

  • GNU 4.7 compatible C++ in an all-new compiler that comes complete with the full suite of PGI optimizations plus support for CUDA-x86, OpenMP and OpenACC.
  • Fortran 2003 features added include recursive I/O, parameterized derived types, deferred type parameters and deferred character length.
  • OpenMP 3.1 support including task yield and new atomic functions in C.
  • PGI Visual Fortran© is now integrated with Visual Studio 2012 and includes the Visual Studio 2012 shell.
  • PGDBG© parallel MPI/OpenMP graphical debugging tool has an improved interface for displaying source code including a new user configurable disassembly display.

PGI 2013 supports the latest operating system releases including Red Hat Enterprise Linux 6.3, Fedora 17, OpenSuSE 12, Ubuntu 12.10, Windows 8 and OS X Mountain Lion.

More information about the PGI Accelerator compilers is available online at www.pgroup.com/accelerate. PGI CUDA Fortran information is available separately at www.pgroup.com/cudafortran. CUDA-x86 information is also available at www.pgroup.com/cuda-x86. Evaluation copies of the new PGI 2013 compilers are available. Registration is required.

* SPEC OMP2012 is designed to provide a comparative measure of compute intensive performance across platforms. More information is available at www.spec.org. More information about PGI 2013 SPEC OMP2012 performance is available on the PGI website at www.pgroup.com/benchmark.

SPEC® is a registered trademark of the Standard Performance Evaluation Corporation (SPEC).

Media Contact
Patrick Brooks
The Portland Group
(503) 628-2806

PGI Accelerator Compilers to Support New NVIDIA Tesla K20 Family of GPU Accelerators

SC12 Salt Lake City, UT
November 13, 2012

PGI 2013 release gives developers access to Kepler features

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, announced at SC12 today that thePGI 2013 release of its PGI Accelerator™ compilers due out in early December will add support for the new family of NVIDIA® Tesla® K20 GPU accelerators and the CUDA® 5 parallel computing platform and programming model.

The PGI compilers and tools are used by scientists and engineers to create and run high-performance computing applications for complex modeling and simulations in weather forecasting, geophysical processing, aerodynamic simulation, structural analysis, automotive crash-testing, computational chemistry, and related fields. PGI compilers and tools support the latest 64-bit processors from AMD and Intel, and the new NVIDIA Tesla K20X and K20 GPU accelerators based on the next-generation NVIDIA Kepler™ compute architecture running on Linux, Apple OS X and Microsoft Windows platforms.

"We are tracking closely with NVIDIA and CUDA 5.0 to deliver support for dynamic parallelism and general function calls on this latest generation of GPU accelerators," said Douglas Miles, director, The Portland Group. "We've also been tuning code generation to optimize performance on Tesla K20 accelerators, and early reports from customers indicate that this is paying significant dividends on their GPU-enabled codes."

Introduced in 2009, PGI Accelerator compilers were the industry's first standard-compliant Fortran and C compilers to automatically offload computations from an x64 host program to a GPU accelerator. Also introduced in 2009, PGI CUDA Fortran is a Fortran analog of NVIDIA CUDA C. CUDA Fortran, included in all PGI Accelerator Fortran products, is comprised of a Fortran 2003 compiler and tool chain for programming NVIDIA GPUs using Fortran. PGI Release 2013 coming in December 2012 will include CUDA Fortran extensions for developers interested in accessing the new dynamic parallelism capabilities in the latest Kepler-based Tesla GPU accelerators, and the separate compilation and GPU object file linking capabilities of CUDA 5. PGI will be demonstrating the latest PGI Accelerator compilers in booth 1321 during the SC12 exhibition.

"PGI's powerful Accelerator compilers enable developers to take full advantage of the advanced new performance and efficiency features of CUDA 5 and the Tesla K20 accelerators," said Sumit Gupta, general manager of the Tesla accelerated computing business at NVIDIA. "Innovative technologies such as dynamic parallelism to Hyper-Q make the Kepler GPUs high performing and highly energy efficient, and more applicable to a wider set of developers and applications."

Media Contact
Patrick Brooks
The Portland Group
(503) 628-2806

PGI to Deliver OpenACC for Intel Xeon Phi

SC12, Salt Lake City, UT
November 13, 2012

The Portland Group to take PGI Accelerator programming model for heterogeneous HPC systems beyond GPUs

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced plans to extend its PGI Accelerator™ compiler technology with OpenACC to Intel® Xeon Phi™ coprocessors, Intel's family of products based on the Intel Many Integrated Core (MIC) architecture. Currently, scientists and engineers are using PGI Accelerator Fortran and C compilers as a means to exploit the massively parallel throughput capacity of CUDA-enabled GPUs from NVIDIA. Using their existing code bases and with only minor modifications to their build scripts, software developers using PGI Accelerator compilers will soon be able to target Intel Xeon Phi coprocessors as well.

"Until now, industry CIOs and government and university lab directors have encountered difficult decisions when evaluating accelerator technologies. Do we wait to see which technologies win out? Which programming model or models do we choose? How do we retrain developers to use these programming models? Will our applications be future-proof and portable across the different types of host CPUs and coprocessors?" said Douglas Miles, director, The Portland Group. "PGI Accelerator compilers will make Xeon Phi coprocessors programmable using standard OpenACC directives that are fully compatible with the accelerator application development efforts already under way at most significant HPC centers and sites. "

First announced in mid-2009, PGI Accelerator compilers provide a high-level coprocessor programming model intended for scientists, engineers and other domain experts who aren't full-time programmers. PGI Accelerator Fortran 2003, C and C++ compilers enable programmers to offload compute-intensive portions of an application to a coprocessor by adding portable compiler directives, treated as comments by other compilers, to existing standard-compliant programs and recompiling with appropriate compiler options. In 2012, the PGI Accelerator compilers were enhanced to support the new de facto standard OpenACC directives.

Key advantages to programming coprocessors with OpenACC directives include:

  • >High-level ? No prior experience with OpenCL, CUDA or other low-level programming models required. Ideal for domain experts.
  • Efficient ? Well-designed algorithms that would perform well when written using a low-level programming model can perform just as well using directives.
  • Single source base ? No separate co-processor source code required. Compile the same program for multi-core CPUs using PGI or any other standard-compliant compiler.
  • Performance portable ? Supports GPU accelerators and co-processors from multiple vendors.
  • Incremental ? Developers can port and tune parts of their application as time permits. No wholesale rewrite required.
  • Quick ? Most developers see results with modest effort.

"PGI OpenACC will enable programmers to develop portable applications that maximize the performance and power efficiency benefits of the hybrid CPU/Accelerator architecture of Titan," said Buddy Bland, Titan Project Director, Oak Ridge National Laboratory. "We are pleased to see that PGI is adding support for Intel Xeon Phi, which ensures that accelerated applications developed for Titan will port readily to HPC systems that incorporate Intel co-processors."

"As part of our on-going assessments of future computing architectures Sandia is actively engaged in the evaluation of solutions from a spectrum of vendors including Intel, NVIDIA, AMD, ARM and IBM," said Simon Hammond, Sandia National Laboratories. "We are already using PGI's compilers for exploratory code development on accelerators from NVIDIA. PGI's extension of OpenACC to support Intel's Xeon Phi processors enables us to use an open standards based programming model across a wide range of accelerator hardware."

"I've been working with PGI off-and-on since 2008 to explore programming models that can address both today's multi-core and tomorrows accelerator-enabled compute nodes," said John Michalakes, HPC Scientist, National Renewable Energy Laboratory. "PGI's commitment to creating programming models and compilers that enable functional and performance portability across platforms is unquestioned, and I look forward to using their compilers on Xeon Phi."

"We have been studying the performance of the FIM and NIM Numerical Weather Prediction models on both NVIDIA GPUs and Intel Xeon Phi co-processors," said Tom Henderson, Global Systems Division, at NOAA's Earth System Research Laboratory in Boulder. "We have already used the PGI compilers for our NVIDIA work and are pleased to see PGI will be supporting OpenACC as a uniform and standard programming model across both types of accelerators. We are hopeful that PGI OpenACC will provide a performance-portable solution that allows maintenance of a single source code for GPU, CPU, and Xeon Phi."

Beta support for the OpenACC standard on Xeon Phi coprocessors is planned for a PGI release in the first half of 2013. It will be available free of charge to PGI Accelerator licensees with a current PGI subscription. PGI Accelerator Fortran, C and C++ compilers with OpenACC directives currently support x64+NVIDIA systems running under Linux, OS X and Windows; the compilers are supported on all Intel and AMD x64 processor-based systems with CUDA-enabled NVIDIA GPUs. More information on the PGI Accelerator compilers with OpenACC is available at www.pgroup.com/accelerate. More information on the OpenACC API and standard can be found at www.openacc.org. More information on the Intel Xeon Phi coprocessor is at www.intel.com.

Media Contact
Patrick Brooks
The Portland Group
(503) 628-2806

PGI and AMD Collaborate on APU Compilers to the Benefit of HPC Programmers

Portland, Oregon
October 31, 2012

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced that PGI Accelerator™ Fortran, C and C++ compilers will soon target the AMD line of accelerated processing units (APUs) as well as the AMD line of discrete GPU (Graphics Processing Unit) accelerators. PGI will work closely with AMD to extend its PGI Accelerator directive-based compilers to generate code directly for AMD GPU accelerators, and to generate heterogeneous x64+GPU executable files that automatically use both the CPU and GPU compute capabilities of AMD APUs.

Introduced in 2009, PGI Accelerator compilers were the industry's first standard-compliant Fortran and C compilers to automatically offload computations from an x64 host program to a GPU accelerator. The PGI Accelerator programming model enables parallel programmers to offload code from a host CPU to an attached accelerator by using hints, known as directives, to identify areas of code suitable for acceleration. In addition to exposing parallelization opportunities to the compiler, directives can also be used to specify the details of how to efficiently map loops to a particular accelerator and how to optimize data movement. Directives free the developer to focus on algorithms and application functionality while the compiler does the detailed work of offloading computations onto an accelerator. The principle benefit of using compiler directives is significant improvements to application performance through incremental and portable modifications to existing source code.

"The PGI Accelerator compilers will open up programming of AMD APUs and GPUs to the growing number of HPC developers using directives to accelerate science and engineering applications," said Douglas Miles, director, The Portland Group. "Together with AMD, we are working to make heterogeneous programming easily accessible to mainstream C and Fortran developers, and to unleash the power of these devices."

"We look forward to working with PGI to ensure that through the use of standard compiler directives the full computational power of AMD platforms with integrated APUs can be easily tapped," said Terri Hall, Corporate VP, Business Alliances, AMD. "Engagements like this are key to expanding the developer ecosystem and the opportunities for AMD platforms."

In June 2012, PGI announced support throughout the PGI Accelerator product line for the OpenACC Application Programming Interface (API) version 1.0, a specification for directive-based accelerator programming developed jointly by several HPC industry members including PGI. More information on the OpenACC API and standard can be found at www.openacc.org. In addition to their GPU capabilities, PGI Accelerator compilers include PGI's complete suite of x86 host-performance optimization technologies including OpenMP extensions, automatic SIMD vectorization, auto-parallelization, inter-procedural analysis, function inlining, memory hierarchy optimizations and more.

Price and Availability
Preliminary support for AMD accelerators from within PGI Accelerator compilers is planned for mid-2013, and a full production release is scheduled for later that year. PGI will be demonstrating a development release of its PGI Accelerator compilers for AMD in booth 1321 at SC12 in Salt Lake City, November 12–15, 2012. Support for AMD accelerators will be included free of charge to PGI Accelerator licensees with a current PGI subscription. More information on the PGI Accelerator compilers is available at www.pgroup.com/accelerate.

Media Contact
Patrick Brooks
The Portland Group
(503) 628-2806

SGI to Resell PGI High-performance Compilers

Portland, Oregon and Fremont, California
October 25, 2012

Customers can now purchase PGI Fortran C, C++ compilers and development tools directly from SGI

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced a new reseller agreement with Silicon Graphics® International Corp. (SGI®), the trusted leader in technical computing. Under this agreement, SGI will resell PGI Fortran C and C++ compilers and development tools with SGI's line of servers based on Intel® processors and NVIDIA® GPU accelerators.

The PGI compilers and tools are used by scientists and engineers to create and run high-performance computing (HPC) applications for complex modeling and simulations in weather forecasting, geophysical processing, aerodynamic simulation, structural analysis, automotive crash-testing, computational chemistry, and related fields. PGI compilers and tools support the latest 64-bit processors from Intel and NVIDIA® Tesla® GPU accelerators, including Tesla GPUs based on the next-generation NVIDIA® Kepler™ architecture.

"The process of procuring software packages separate from an HPC hardware system can often be difficult and time-consuming," said Douglas Miles, director of The Portland Group. "Now, customers who purchase x64 or x64+GPU processor-based systems from SGI can easily include a widely-used and integrated set of high-performance compilers and development tools that enable porting, debugging and tuning of MPI and OpenMP parallel science and engineering applications."

SGI develops, markets, sells and services technical computing solutions consisting of a broad line of low-cost, mid-range and high-end servers, including scale-out SGI® Rackable™ and SGI® ICE™, and scale-up SGI® UV™ servers, and data storage solutions, differentiating software, and infrastructure products that are designed-to-order for large-scale data center deployment. These solutions can enable enterprises to meet computing and storage requirements at a lower total cost of ownership with greater flexibility and scalability.

"SGI is pleased to offer The Portland Group compiler products to our customers. The integrated SGI UV systems with NVIDIA Tesla accelerators and PGI Accelerator Cluster Development Kit is a strong CUDA and OpenACC application development platform that can enable accelerating more innovative applications in the areas of life sciences, Government, research, and manufacturing," stated Christian Tanasescu, SGI Vice President, Software Engineering.

"The era of accelerated computing is here. SGI, with the help of PGI compilers, enables scientific and technical computing users to move GPU-accelerated computing more easily and quickly," said Sumit Gupta, senior director of the Tesla GPU computing business at NVIDIA.

The PGI compiler suite includes the PGFORTRAN™, PGCC®, and PGC++™ compilers for the Fortran 2003, C, and C++ programming languages, respectively. PGI compilers and tools feature full native support for OpenMP parallel programming extensions in Fortran, C, and C++, full support for 64-bit addressing and native integrated scalar and vector SSE and AVX code generation.

The PGI Accelerator™ Fortran 2003 and C99 compilers also include support for the OpenACC directive-based x64+GPU programming model, CUDA Fortran extensions for NVIDIA GPUs, and CUDA C/C++ extensions for 64-bit and 32-bit x86 CPUs. PGI products include a bundled version of the ACML library of highly optimized numeric functions for mathematical, engineering, scientific, and financial applications, the PGDBG® MPI and OpenMP parallel debugger and PGPROF® MPI, OpenMP and CUDA parallel performance profiler. Cluster configurations are also available.

Media Contact
Patrick Brooks
The Portland Group
(503) 628-2806

PGCL page on The Portland Group website.

Media Contact
Mike Markowitz
STMicroelectronics
[enable javascript]
(781) 591-0354

-->

PGI Ships PGI Accelerator Compilers with OpenACC

Hamburg, Germany
June 19, 2012

Release Supports the Version 1.0 Specification of the OpenACC Standard, a Directive-Based Programming Model for GPUs and Accelerators

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced immediate availability of its PGI Accelerator™ Fortran and C compilers with support for the new OpenACC® 1.0 specification for directive-based programming of GPUs and accelerators that allows programmers to provide hints or guidance in their programs.

"PGI continues to make accelerator programming more straight forward and productive for domain experts in science and engineering," said Douglas Miles, Director of The Portland Group. "The OpenACC standard dovetails perfectly into our existing compiler products as a subset of the PGI Accelerator programming model. The PGI Accelerator compilers for Fortran 2003 and C include support for both high-level directive-based x64+GPU programming and explicit GPU programming using CUDA C/C++/Fortran language extensions."

The OpenACC standard, unveiled at the SC11 conference in November, 2011, as founded by PGI, Cray, and NVIDIA, with support from CAPS, is fully compatible and interoperable with the NVIDIA® CUDA® parallel programming architecture. The OpenACC 1.0 specification was developed cooperatively by the founding members and is based in large part on the PGI Accelerator programming model. The OpenACC Application Program Interface (API) describes a collection of compiler directives to specify loops and regions of code in standard C, C++ and Fortran to be offloaded from a host CPU to an attached accelerator, providing portability across operating systems, host CPUs and accelerators. By using directives, developers can have a single code base that is multi-platform and multi-vendor compatible, a key advantage for cross-platform and multi-generation application development.

Using OpenACC, parallel programmers can offload code from a host CPU to an attached accelerator by using hints, known as directives, to identify areas of code suitable for acceleration. In addition to exposing parallelizing opportunities to the compiler, directives can also be used to specify the details of how to efficiently map loops to a particular accelerator and how to optimize data movement. Directives free the developer to focus on algorithms and application functionality while the compiler does the detailed work of offloading computations onto an accelerator. The principle benefit of directives is significant improvements to application performance without requiring modification to the underlying source code.

PGI Accelerator Compilers with OpenACC Support
First announced in 2008, the PGI Accelerator compilers augment the OpenACC standard in several areas to provide enhanced functionality and flexibility including:

  • Auto-generation of optimized loop schedules.
  • Automatic use of shared memory.
  • Automatic sum reductions.
  • Interoperability with CUDA Fortran and CUDA C/C++.
  • PGI Unified Binary™ technology executable files that work in the presence or absence of an accelerator.

In addition, PGI Accelerator compilers include PGI's complete suite of x86 host-performance optimization technologies including automatic SIMD vectorization, auto-parallelization, interprocedural analysis, function inlining and more.

Currently, PGI Accelerator Fortran and C99 compilers support x64+NVIDIA systems running under Linux, OS X and Windows; the compilers are supported on all Intel and AMD x64 processor-based systems with CUDA-enabled NVIDIA GPUs. OpenACC support will be included in PGI Release 2012 version 12.6 and later. It is available free of charge to PGI Accelerator licensees with a current PGI subscription. A free trial version is available from the PGI website at www.pgroup.com/support/trial.htm. More information on the PGI Accelerator compilers with OpenACC is available at www.pgroup.com/accelerate. More information on the OpenACC API and standard can be found at www.openacc.org.

Media Contact
Patrick Brooks
The Portland Group
[enable javascript]
(503) 682-2806

First PGI Compilers with Support for OpenACC Now Available

Portland, Ore.
March 22, 2012

Beta Release Includes Initial Support for Standardized Directive-Based Programming Model for GPUs and Accelerators

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced immediate availability of the initial release of its PGI Accelerator™ Fortran and C compilers with partial support for the new OpenACC® specification for directive-based programming of GPUs and accelerators. In the next few months, PGI plans to include additional OpenACC features with full support for the OpenACC 1.0 specifications scheduled for release in June.

"PGI continues to invest in technologies to make GPU programming more straight-forward and productive for domain experts in science and engineering," said Douglas Miles, Director of The Portland Group. "The OpenACC standard was developed in direct response to HPC community's interest for a vendor-neutral, platform-independent, directive-based accelerator programming model, and dovetails perfectly into our existing compiler products as a standardized subset of the PGI Accelerator programming model."

The OpenACC standard, unveiled at SC11 in November, 2011, as founded by PGI, Cray, and NVIDIA, with support from CAPS, is fully compatible and interoperable with the NVIDIA® CUDA® parallel programming architecture. The OpenACC 1.0 specification was developed in cooperation by the founding members and is based in large part on the PGI Accelerator programming model. The OpenACC Application Program Interface (API) describes a collection of compiler directives to specify loops and regions of code in standard C, C++ and Fortran to be offloaded from a host CPU to an attached accelerator, providing portability across operating systems, host CPUs and accelerators. By using directives, developers can have a single code base that is multi-platform and multi-vendor compatible, a key advantage for multi-platform and multi-generation application development.

Directive-based Programming
The use of OpenACC allows parallel programmers to provide simple hints, known as "directives," to the compiler, which identify areas of code to accelerate, without requiring programmers to modify or adapt the underlying code itself. By exposing parallelism to the compiler, directives allow the compiler to do the detailed work of mapping the computation onto the accelerator to deliver significant improvements to application performance.

"OpenACC is key to enabling the extensive community of new researchers and programmers to, for the first time, take advantage of the transformative benefits of parallel computing," said Ian Buck, general manager, GPU computing software at NVIDIA. "OpenACC makes it very easy for individuals with or without extensive parallel programming experience to accelerate new applications using a familiar programming model. And, it breathes new life into legacy applications by maintaining code portability during the transition to high-performance parallel computing architectures."

A trial version of the beta release of the PGI Accelerator compilers with support for the OpenACC standard is available for free from the PGI website at http://www.pgroup.com/support/downloads.php. The beta software includes a restricted use license. The license agreement is available at http://www.pgroup.com/support/BTLA. More information on the PGI Accelerator compilers with OpenACC support is available at http://www.pgroup.com/accelerate. More information on the OpenACC API and standard can be found at: http://www.openacc.org.

Media Contact
Patrick Brooks
The Portland Group
[enable javascript]
(503) 682-2806

New Release of HPC Compilers and Development Tools Now Available from The Portland Group

Portland, Ore.
March 5, 2012

PGI 2012 adds support for OpenACC GPU directives and native CUDA C/C++ optimized for multi-core x86

The Portland Group® (PGI), a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers and tools for high-performance computing, today announced availability of the 2012 release of the PGI® line of high-performance parallelizing compilers and development tools for Linux, OS X and Windows. PGI 2012 is the first general release to include support for the OpenACC directive-based programming model for NVIDIA CUDA-enabled Graphics Processing Units (GPUs). This release is also the first to include the fully feature-enabled PGI CUDA C/C++ compiler for multi-core x64 CPUs from Intel and AMD. In addition, PGI 2012 includes a number of performance and feature enhancements for multi-core x64 processor-based HPC systems.

"GPU Accelerators are now a mainstay in HPC with NVIDIA's CUDA achieving the widest adoption so far", said Douglas, Miles, director of The Portland Group. "With Release 2012, PGI continues to supplement and refine its GPU programming tools so developers wishing to access the huge potential performance of GPUs can do so in a productive and portable way."

First announced in 2008, the PGI Accelerator Fortran and C compilers provide a directive-based high-level approach to GPU programming. Targeting scientists and engineers who are not full time programmers, the PGI Accelerator programming model frees developers from the tedious aspects of GPU programming and lets them focus instead on optimizing their algorithms. Other benefits compared to low-level GPU programming models include enhanced productivity (many developers see performance gains in just minutes) and improved source code portability achieved by retaining compatibility across GPUs and CPUs. PGI 2012 includes full support for the PGI Accelerator 1.3 specification including asynchronous data transfer and kernel launch directives, and enhanced data management capabilities.

Users reporting about their experiences with PGI Accelerator: "I have written micro-magnetic codes (written in Fortran 90) to study the properties of two and three dimensional magnetic systems. The directives approach enabled me to port my existing code with ease to perform my computations on the GPU which resulted in a significant speedup (more than 20 times) of the computation," said Professor M. Amin Kayali from University of Houston. And Panos Labropoulos from the University of Groningen and ASTRON reports, "My main interest is the calibration and imaging using data from radio arrays and in particular LOFAR (www.lofar.org). Such an array consists of tens to hundreds of thousands on relatively simple dipole antenna elements that collect low frequency astronomical signals. With directives and GPUs, we were able to accelerate the code by nearly 6x in 5 days. This speed-up is significant because it allows us to process this enormous amount of data (using many nodes, each with a couple GPUs), in a reasonable amount of time. It will save us years of work."

In addition to enhancements to the PGI Accelerator programming model, PGI 2012 also introduces support for the OpenACC Application Programming Interface (API). Announced by NVIDIA, Cray, CAPs and PGI in November 2011, the OpenACC API is an industry-wide effort to standardize accelerator programming based on compiler directives. Based in large part on the PGI Accelerator programming model, the OpenACC API provides a hardware independent methodology for programming accelerators, co-processors and related offload engines. Currently, PGI is conducting a closed evaluation of its first Fortran and C compilers to include support for a subset of the OpenACC 1.0 specification. A public evaluation will be available in March. PGI expects to ship fully OpenACC version 1.0 conformant compilers by mid-year 2012.

Other features and enhancements in PGI 2012 include expanded support for the new Advanced Vector Extensions to the x64 instruction set architecture (AVX) in the latest Intel Sandy Bridge and AMD Bulldozer CPUs, improved auto-parallelization support, support for recursive I/O (part of the Fortran 2003 language standard), a new GNU compatible C++ compiler with all PGI features and optimizations, OpenMP nested parallelism, remote cluster debugging support and Eclipse integration of the PGI C/C++ compilers on Linux. Additional GPU programming features new in PGI 2012 include support for the latest CUDA 4.1 release and support for several of the new features in the latest generation of NVIDIA CUDA-enabled GPUs including asynchronous data transfers and kernel launch. New in CUDA Fortran is the ability to specify a stream identifier in CUF kernels. Finally, PGI 2012 includes the fully featured, performance optimized PGI CUDA C/C++ compiler for x86. Developed in collaboration with NVIDIA and announced at the NVIDIA GPU Technology Conference in Beijing, China last December, CUDA-x86 processes CUDA C/C++ as a native parallel-programming language for general-purpose multi-core x86 microprocessors from AMD and Intel.

PGI 2012 supports the latest operating system releases including Red Hat Enterprise Linux 6.2, Fedora 16, SLES 11 SP1, Ubuntu 11.10 and Apple OS X Lion.

More information about the PGI Accelerator and OpenACC programming models are available online at www.pgroup.com/accelerate. PGI CUDA Fortran information is available separately at www.pgroup.com/cudafortran. CUDA-x86 information is also available at www.pgroup.com/cuda-x86. Evaluation copies of the new PGI 2012 compilers are available from The Portland Group web site at www.pgroup.com. Registration is required.

Media Contact
Patrick Brooks
The Portland Group
[enable javascript]
(503) 682-2806

The Portland Group Announces Availability of CUDA C/C++ Compiler for Multi-core x86

Portland, Ore.
December 14, 2011

New CUDA compiler enables parallel programmers to take full advantage of industry-standard CPUs from AMD and Intel

The Portland Group, a wholly-owned subsidiary of STMicroelectronics and a leading supplier of compilers for high-performance computing (HPC), today announced that a performance-optimized PGI CUDA C/C++ compiler for multi-core x86 platforms (CUDA-x86) will ship with its PGI 2012 release due out in January 2012.

CUDA is NVIDIA's parallel computing architecture that enables dramatic increases in computing performance by harnessing the power of NVIDIA GPUs. Originally announced at the 2010 NVIDIA GPU Technology Conference (GTC), CUDA-x86 extends CUDA beyond the GPU into a system-wide programming model. The release of CUDA-x86 is a key step towards making the x86+GPU architecture an integrated parallel platform.

"CUDA-x86 is another key milepost in PGI's roadmap for comprehensive support of programming of heterogeneous CPU+GPU systems," stated Douglas Miles, director, The Portland Group. "Directive-based GPU programming with the PGI Accelerator compilers makes it extremely easy to get started programming GPUs. CUDA extensions allow programmers to write explicit parallel algorithms that are inherently massively scalable, and to dive as deep as they want to go to optimize for a specific platform. We expect both approaches to be critically important as accelerator-based systems become more and more pervasive."

"CUDA is the most popular GPU parallel-programming model in the world today, and developers need the flexibility to target multiple architectures with the same code,." said Sanford Russell, director of CUDA marketing at NVIDIA. ."The release of CUDA-x86 delivers the benefits of CUDA as a general-purpose programming model for high-performance parallel applications running in heterogeneous computing environments."

The PGI compiler for CUDA-x86 processes CUDA C/C++ as a native parallel-programming language for general-purpose multi-core x86 microprocessors from AMD and Intel. CUDA-x86 includes full support for NVIDIA's CUDA C/C++ language for GPUs, so programmers can recompile CUDA application source code for execution on an x86 host. Using CUDA-x86 developers can compile and optimize their CUDA applications to run on x86-based workstations, servers and clusters with or without an NVIDIA GPU accelerator. CUDA C/C++ applications compiled for x86 targets use multiple cores and the streaming SIMD (Single Instruction Multiple Data) capabilities of Intel and AMD CPUs for parallel execution.

Highlights of PGI CUDA C/C++ compiler for multi-core x86 include:

  • Optimization and parallelization of native CUDA C/C++on x86 hosts.
  • Low-overhead native parallel execution of CUDA C/C++ on x86 hosts.
  • Executes each CUDA thread block using a single host core; automatically eliminates synchronization where possible.
  • Support for the latest processors from AMD and Intel including support for the new AVX instructions.
  • Automatically in-lines device kernel functions and translates chevron syntax to parallel/vector loops.
  • Full support for NVIDIA's CUDA C/C++language for GPUs on x86 hosts.
  • Full support for GPU texture memory.
  • NVIDIA CUBLAS library support.
  • Supports all PGI host optimizations for Intel/AMD.

In addition, PGI CUDA C/C++ for GPU devices is planned for release in mid 2012. At that time, using PGI Unified Binary™ technology, one binary will be able to use NVIDIA GPUs when present or default to using multi-core x86 if no GPU is present.

Performance Data

In a performance comparison of popular parallel programming models, PGI compiled the CUDA-x86 version of a LBM benchmark (part of the Parboil benchmark suite) and compared execution time to the same program parallelized using OpenMP. Program execution times for the different programming models are shown below:

Benchmark OpenMP Execution Time CUDA Execution Time
LBM 221 sec. 221 sec.

System: 4 core Intel i7 920 (2.67GHz), 12GB, Red Hat Enterprise Linux 5.3

Additional performance information is available on the PGI website at http://www.pgroup.com/cuda-x86

Price and Availability
The PGI CUDA C/C++compiler on multi-core x86 is part of the PGI 2012 release version 12.1 due out in January. It is available at no charge to PGI Accelerator C/C++ licensees with a current PGI Subscription service. PGI products are supported on the Linux, Apple Mac OS X and Microsoft Windows operating systems. A 15 day evaluation is available from The Portland Group web site at www.pgroup.com. Registration is required.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(781) 591-0354

nCore and The Portland Group Team to Produce Training for Programming GPU Accelerators

Portland, Ore.
November 15, 2011

The Portland Group, a wholly-owned subsidiary of STMicroelectronics and a leading supplier of compilers for high-performance computing (HPC), and nCore Design, a leading training and professional services provider, today announced their collaboration to offer a new 2-day training course on programming GPUs using the PGI Accelerator™ programming model.

The new course, titled NCT-500 PGI Accelerator Programming, has been developed by nCore working with PGI. The two-day intensive training course employs in-depth hands-on lectures and laboratories to provide attendees with comprehensive training on all aspects of programming GPUs with PGI's directive-based programming model. The course is appropriate for engineers, scientists and other domain experts seeking to improve their GPU programming skills. An intermediate knowledge of C or Fortran programming, as well as knowledge of computer architectures and corresponding software development experience, are mandatory pre-requisites for this course.

"This course will provide attendees with the insights and skills necessary to have them up and running quickly porting their applications to GPUs," said Douglas Miles, Director of The Portland Group. "nCore brings tremendous expertise, along with a solid track record for providing quality training and professional service."

"GPU programming skills are in high demand and we are thrilled to be working closely with The Portland Group to produce this course based on PGI's Accelerator Programming Model," said Ian Lintault, Managing Director of nCore. "This instructor-led course will be offered at client locations for groups of eight to 15. Our hands-on learning style assures students will be able to reinforce theory and concepts with practical exercises—a successful training model nCore has perfected."

The two-day course, NCT-500 PGI Accelerator Programming, is available from nCore starting November 14, 2011 and is priced at $1,895.00 per student. For more information, contact info@ncoredesign.com or ncoredesign.com/pgi/ for a booking.

About nCore Design
nCore Design is a global provider of training and professional services focused on multicore and many-core computing platforms. As the world leader in hands-on, instructor-led training courses, nCore helps clients increase software performance, throughput, and efficiency while reducing time-to-market.

nCore develops high performance solutions based on state-of-the-art parallel technologies using our specialized knowledge of applied software optimization techniques on modern computing platforms.

nCore works with local partners to deliver state-of-the-art solutions to government agencies, high-technology organizations, defense, research, biomedical and financial companies. nCore is a privately held company with offices in the US and Japan.

To learn more about nCore Design, call 971-228-0700 or access http://ncoredesign.com/

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(781) 591-0354

The Portland Group Adds Support for AMD's "Bulldozer" Architecture

Portland, Ore.
October 26, 2011

PGI compilers now support Advanced Vector Instructions (AVX) for both AMD and Intel x64 CPUs

The Portland Group, a wholly-owned subsidiary of STMicroelectronics and a leading supplier of compilers for high-performance computing (HPC), today announced its products now include support for the upcoming microprocessors from AMD (NYSE: AMD) based on their &quo;Bulldozer&quo; architecture.

The "Bulldozer" core architecture uses a flexible floating point unit called the Flex FP that can simultaneously execute two 128-bit commands or one 128-bit and one 256-bit command. The powerful floating point unit is included in the upcoming 16-core processors codenamed "Interlagos" intended for high-performance servers. Additionally, "Interlagos" delivers a rich mix of features targeting complicated, multi-threaded HPC environments.

"PGI is the compiler-of-choice for performance critical HPC applications, science and engineering ISC applications, and research community applications on AMD Opteron™ processor-based systems," said Margaret Lewis, director of Commercial Solutions and Software Strategy, AMD. "The combination of AMD's Flex FP architecture with AVX and FMA4 together with PGI compilers will enable customers to fully exploit the power of AMD "Bulldozer" based processors."

Supercomputer maker Cray Inc., which includes PGI compilers with nearly all of its HPC systems worldwide, has indicated that some of its largest HPC customers will upgrade their existing systems to include 16-core "Interlagos" processors.

"PGI compilers, with their support and optimization for both the latest AMD Interlagos CPUs and many-core GPUs, are a key component of the Cray XE6 and Cray XK6 supercomputers,." said Barry Bolding, vice president of Cray's products division. ."PGI will play a critical role in helping Cray customers maximize the performance and value of these next-generation systems including AVX-enabled AMD CPUs."

PGI has been aggressively working to add support to its compilers for AMD's "Bulldozer" architecture. A key feature of the offering is the addition of support for AMD's variety of Advanced Vector Instructions (AVX), including AMD's extended FMA4 instruction set. PGI has experience with AVX instructions, having supported Intel's AVX instruction set, including FMA, since May 2011.

"This new processor technology coming to market has the potential to deliver a big performance boost and PGI is helping HPC programmers realize that potential by delivering today compilers and tools specifically designed to take advantage of this e latest generation of AMD processor-based systems," said Douglas Miles, director, The Portland Group. "AMD plays a critical role in the HPC community and brings important new performance capabilities with its upcoming 16-core AVX-enabled processors. After extensive development, we are pleased with the early performance results our compilers have demonstrated with this new micro-architecture."

PGI compilers and tools are widely used in HPC development environments as more organizations seek cross-platform compatibility and productivity. A key advantage of the compilers' is PGI Unified Binary™ technology—the ability to generate a single executable file containing code sequences optimized for multiple processors from AMD and Intel. PGI Unified Binary technology enables Independent Software Vendors (ISVs) and custom applications developers to take advantage of the latest processor innovations while treating x64 as a single platform, maximizing flexibility and eliminating the need to target and optimize for separate processors.

"The importance of the PGI Unified Binary is stronger than ever," added Miles. "AMD has taken a lead in including FMA4 instructions in its architecture that will affect binary compatibility and performance portability. PGI provides the easy-to-use bridge that enables programmers to efficiently and seamlessly move their code between AMD and Intel environments without having to worry about underlying instruction set differences."

Support for AMD's "Bulldozer" architecture is included in all PGI products effective immediately, and is available at no charge to licensees with a current PGI's Subscription service. PGI products are supported on the Linux, Apple Mac OS X and Microsoft Windows operating systems. Evaluation copies are available from The Portland Group web site at www.pgroup.com. Registration is required.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(781) 591-0354

The Portland Group Updates Compilers to Deliver NVIDIA CUDA Architecture for x86 Platforms

Portland, Ore.
June 16, 2011

First release of new PGI C/C++ compilers lets CUDA developers target industry standard servers from Intel and AMD

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics (NYSE: STM) and a leading supplier of compilers for high-performance computing (HPC), today announced that it is now shipping the PGI CUDA C and C++ compilers for systems based on the industry standard general-purpose 64-bit and 32-bit x86 architectures.

CUDA is NVIDIA's parallel computing architecture that enables dramatic increases in computing performance by harnessing the power of NVIDIA GPUs (graphics processing units). With PGI's new CUDA C/C++ compilers, significantly more developers can use the CUDA parallel programming model to optimize the performance of the critical parts of their code base, targeting servers and clusters with or without NVIDIA GPUs.

"With the addition of PGI CUDA C and C++ for x86, PGI further extends its comprehensive suite of tools for programming GPUs," said Douglas Miles, director, The Portland Group. "It's another important element in our ongoing strategy of providing HPC programmers with a full range of options for optimizing compute-intensive applications and leveraging the latest technical innovations from AMD, Intel and NVIDIA."

"CUDA is the world's preeminent parallel programming model, supporting a range of open standards, architectures and programming languages," said Sanford Russell, director of CUDA marketing at NVIDIA. "Now for the first time, developers can run their CUDA apps on any x86 clusters."

When run on x86-based systems, PGI CUDA C/C++ applications perform parallel execution by using the multiple processor cores, and by using Streaming SIMD (Single Instruction Multiple Data) Extensions (SSE), including the new AVX instructions available on the latest generation of x86 compatible CPUs from Intel and AMD.

PGI will roll out the x86 CUDA C/C++ compilers in three phases. Phase 1, available now, demonstrates the capabilities of the technology and allows developers to begin working with the compilers. Phase 2, scheduled for the fourth quarter of 2011, will include performance optimizations intended to extract maximum performance of CUDA programs running on the x86 target platform. Phase 3, planned for mid 2012, will include support for PGI Unified Binary(tm) technology-the ability to run one executable on both CPUs and GPUs. The PGI CUDA C/C++ compilers for x86 are included as part of the PGI Accelerator(tm) product line. All PGI Accelerator products support both CUDA programming and the PGI Accelerator high-level directive-based programming model targeting scientific and engineering-domain experts working in high-performance computing.

PGI Accelerator compilers are currently available for C99 and Fortran 2003. Both CUDA Fortran, a Fortran analog to NVIDIA CUDA C, and PGI CUDA C/C++ for x86 were developed by PGI in cooperation with NVIDIA. HPC programmers targeting applications for GPUs as well 64-bit x64 and 32-bit x86 processor based systems use PGI products widely. PGI products are supported on the Linux, Apple Mac OS X and Microsoft Windows operating systems.

More information about PGI CUDA C/C++ for x86 is available online at www.pgroup.com/cuda-x86. 15 day evaluation copies are available as well. Registration is required.

For more information on NVIDIA CUDA, please visit www.nvidia.com/cuda.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(781) 591-0354

The Portland Group Announces PGI 2011 HPC Compilers and Development Tools

Portland, Ore.
February 17, 2011

New release includes support for latest Intel and AMD microprocessors, enhancements for programming GPUs

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics today announced availability of the 2011 release of the PGI® line of high-performance parallelizing compilers and development tools for Linux, Mac OS X and Windows. PGI 2011 is the first general release to include full support for the PGI Accelerator programming model 1.2 specification on x64 processor-based systems incorporating NVIDIA CUDA1-enabled Graphics Processing Units (GPUs). In addition, the new PGI release includes a number of enhancements for multi-core x64 processor-based HPC systems

New features and enhancements include support for the new Advanced Vector Extensions to the x64 instruction set architecture (AVX) in upcoming Intel Sandy Bridge and AMD Bulldozer CPUs, support for the Fortran 2003 language standard, enhancements in C++ performance through default fast exception handling and improved Boost C++ libraries support, OpenMP nested parallelism, new memory-hierarchy optimizations, debugger improvements including compact parallel register displays and tab-based sub-windows, and performance profiler enhancements to simplify browsing of multi-core profiles. The 2011 release also supports GPU performance profiling and benefits from revamped packaging for faster download and installation.

"PGI compilers are a key software component on Cray XE6 systems, providing our customers with tremendous value and performance," said Barry Bolding, vice president of Cray's products division. "PGI plays a pivotal role in the development of optimizing compiler technology for the latest HPC systems, and is an important collaborator with Cray in our efforts to develop and standardize high-level portable programming models applicable to current and future many-core and accelerator-enabled systems."

Major HPC system vendors including Cray, HP and IBM are all now delivering systems incorporating GPU accelerators. Fully utilizing the performance potential from these new systems requires most HPC developers to modify and re-compile their code. Several different models for programming GPUs have been proposed with NVIDIA's CUDA architecture and programming model achieving the widest adoption within the HPC community. In response to HPC market demand, NVIDIA and PGI cooperated to develop CUDA Fortran, a Fortran 2003 compiler and tool chain for programming of NVIDIA GPUs using Fortran in 2009. The two companies are now working together to develop the PGI CUDA C/C++ compiler that enables CUDA programs to port to any multi-core x64 processor-based system with or without NVIDIA GPU accelerators. This capability was demonstrated at SC10 last November, and will be rolled out to PGI users in phases as part of PGI 2011 software updates throughout the coming year.

"The continuing evolution of the PGI compilers to support the CUDA parallel architecture ensures that applications developed by more than 100,000 CUDA developers worldwide can be portable to all types of HPC systems," said Sanford Russell, director of CUDA marketing at NVIDIA. "This trend will clearly continue with the upcoming release of the CUDA-x86 compiler, enabling developers to compile and optimize their CUDA applications to run on x86-based systems."

PGI 2011 CUDA Fortran enhancements include CUDA Fortran module data sharing, a new capability to automate CUDA Fortran kernel generation, and optimized array slice assignments. New code generation and scheduling optimizations, including automatic loop unrolling, contribute to improved performance of both PGI Accelerator and CUDA Fortran GPU codes.

In addition to CUDA language extensions, PGI 2011 expands support for the high-level PGI Accelerator programming model, a set of compiler directives used to specify regions of code in standard Fortran and C programs that can be offloaded from a host CPU to an attached accelerator to enhance performance. PGI 2011 adds full PGI Accelerator 1.2 programming model support, including inter-procedural device resident data which enables larger portions of applications to be effectively accelerated, and the cache and unroll clauses that allow fine tuning of GPU kernels by the programmer. Concurrently, PGI has released the PGI Accelerator 1.3 programming model specification which includes a roadmap of planned future developments.

PGI 2011 supports the latest operating system releases including Red Hat Enterprise Linux 6, Fedora 13, SLES 11 PS1 and Ubuntu 10.10.

More information about the PGI Accelerator programming model is available online at www.pgroup.com/accelerate. PGI CUDA Fortran information is available separately at www.pgroup.com/cudafortran. Evaluation copies of the new PGI 2011 compilers are available from The Portland Group web site at www.pgroup.com. Registration is required.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI to Develop Compiler Based on NVIDIA CUDA C Architecture for x86 Platforms

San Jose, California
September 21, 2010

PGI to Demonstrate New PGI CUDA C Compiler at SC10 Supercomputing Conference in November

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers for high-performance computing, today announced it is developing a CUDA C compiler targeting systems based on the industry-standard general-purpose 64- and 32-bit x86 architectures. The new PGI CUDA C compiler for x86 platforms will be demonstrated at the SC10 Supercomputing conference taking place in New Orleans, LA, November 13-15, 2010.

The NVIDIA CUDA architecture was developed to enable offloading computationally intensive kernels to massively parallel GPUs. Through function calls and language extensions, CUDA gives developers explicit control over the mapping of general-purpose computational kernels to GPUs, as well as the placement and movement of data between an x86 processor and the GPU.

The PGI CUDA C compiler for x86 platforms will allow developers using CUDA to compile and optimize CUDA applications to run on x86-based workstations, servers and clusters with or without an NVIDIA GPU accelerator. When run on x86-based systems without a GPU, PGI CUDA C applications will use multiple cores and the streaming SIMD (Single Instruction Multiple Data) capabilities of Intel and AMD CPUs for parallel execution.

"CUDA C for x86 is a perfect complement to CUDA Fortran and PGI's optimizing parallel Fortran and C compilers for multi-core x86," said Douglas Miles, director, The Portland Group. "It's another important element in our on-going strategy of providing HPC programmers with development tools that give PGI users a full range of options for optimizing compute-intensive applications, while allowing them to leverage the latest technical innovations from AMD, Intel and NVIDIA."

"In less than three years, CUDA has become the most widely used massively parallel programming model," said Sanford Russell, general manager of GPU Computing software at NVIDIA. "With the CUDA for x86 CPU compiler, PGI is responding to the need of developers who want to use a single parallel programming model to target many core GPUs and multi-core CPUs."

PGI offers two programming models for GPU accelerators. PGI Accelerator™ is a high-level directive-based programming model targeting scientific and engineering-domain experts working in high-performance computing. PGI Accelerator compilers are currently available for C99 and Fortran 95/2003. CUDA Fortran, a Fortran 95/2003 analog to NVIDIA CUDA C, was developed by PGI in cooperation with NVIDIA in 2009. CUDA Fortran allows expert programmers to control all aspect of GPU programming. In addition to programming GPU accelerators, PGI products are used widely by HPC programmers targeting applications for 64-bit x64 and 32-bit x86 processor based systems.

For more information on NVIDIA CUDA, please visit www.nvidia.com/cuda.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

The Portland Group Releases PGI Visual Fortran for Visual Studio 2010

Portland, Ore
July 13, 2010

PGI optimizing Fortran compiler for multi-core x64+GPU platforms includes bundled version of latest Windows IDE

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and the leading independent supplier of compilers for high-performance computing, today announced the general availability of PGI Visual Fortran® (PVF®) for Visual Studio 2010. PVF integrates PGI high-performance parallel Fortran compilers and tools with Microsoft Visual Studio to offer a high-productivity development solution to scientists and engineers upgrading to the latest 64-bit multi-core platforms running Microsoft Windows.

PGI compilers and tools are used widely by performance-oriented programmers on Linux, Mac OS X and Windows systems based on multi-core CPUs from Intel and AMD and incorporating GPU accelerators from NVIDIA. The new 10.6 version of the PGI 2010 release adds support for building Windows Fortran applications using the latest version of the popular Microsoft Visual Studio Integrated Development Environment (IDE)-Visual Studio 2010. Visual Studio is the most widely used IDE in the world. PVF tools and technologies, including an MPI/OpenMP parallel debugger, enable Visual Studio developers to efficiently develop High Performance Computing (HPC) applications for multi-core workstations and Windows HPC Server 2008 clusters. In addition, PGI Visual Fortran is available with support for programming NIVDIA GPU accelerators using directive-based PGI Accelerator Fortran or CUDA Fortran language extensions.

"With this latest release of PVF, PGI Fortran compilers and tools for multi-core processors and GPUs are available through Visual Studio 2010 to the large base of scientists and engineers developing for Windows," said Douglas Miles, director, The Portland Group. "PVF's world-class performance and state-of-the art compiler technologies allow developers to leverage the wide array of new microprocessor and accelerator innovations coming out of Intel, AMD and NVIDIA together with the productivity advantages of Microsoft HPC technologies.""

PGI Visual Fortran is based on PGI's native OpenMP and auto-parallelizing compiler for the Fortran 95/2003 programming languages.

Key features include:

  • Native Fortran 95/2003 Visual Studio project system;
  • Fortran text editor extensions, intrinsic function tips and keyword completion;
  • Integrated PGI-custom Fortran-capable debugger for debugging of single-thread, multi-thread and OpenMP parallel applications and MSMPI parallel applications running locally and on clusters;
  • Microsoft Visual C++ interoperability and full support for Visual Studio 2010, Visual Studio 2008 and Visual Studio 2005; and
  • Optional support for the Visual Numerics IMSL Fortran Numerical Library and the Intel MKL Math Kernel Library.

In addition to comprehensive support for multi-core CPUs, PGI Visual Fortran also supports GPU accelerators. The PGI Accelerator programming model is a high-level implicit model similar to OpenMP for multi-core x64 systems. Using compiler directives, programmers can offload compute-intensive code regions from a host CPU to a GPU accelerator. Programs containing PGI Accelerator directives remain 100% standard-compliant and portable. PVF also supports CUDA Fortran language extensions. Defined in cooperation with NVIDIA, CUDA Fortran is an analog to the NVIDIA CUDA C compiler and gives expert programmers direct control of all aspects of GPGPU programming.

PVF includes PGI Unified Binary™ technology-the ability to generate a single executable file containing code sequences optimized for multiple processors from AMD, Intel and NVIDIA. PGI Unified Binary technology enables Independent Software Vendors (ISVs) and custom applications developers to take advantage of the latest processor innovations while treating x64 and x64+GPU as a single platform, maximizing flexibility and eliminating the need to target and optimize for separate processors.

Price and Availability
PVF is distributed by electronic download and is available today. A single user academic license with GPU support is $249 USD. Commercial licenses start at $599 and $899 with GPU support. PVF is also available in multi-user network floating license configurations. All PGI products include a 30 day money-back guarantee. A 15 day free trial of PVF along with complete product and pricing information is available from The Portland Group web site at www.pgroup.com/pvf.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

Gaussian 09 Software Ported to 64-bit Mac OS X Using PGI Compilers

Portland, Ore
May 20, 2010

Cross-platform development tools from The Portland Group speed migration of popular computational chemistry application

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and a leading supplier of compilers for high-performance computing (HPC), today announced that Gaussian, Inc, an ongoing collaboration of scientists and academic research groups throughout the world, used PGI compilers to port a version of its flagship Gaussian 09 product to Intel processor-based Macintosh computers running 64-bit versions of Mac OS X.

PGI develops and markets high-performance C/C++ and Fortran compilers and development tools that are widely used by engineers and scientists, and are designed to extract maximum performance from the latest multi-core processors from Intel and AMD. PGI Workstation™ compilers and tools enable building, debugging and profiling of 64-bit multi-core applications on the latest generation of Macs based on processors from Intel and running the Mac OS X operating system.

"A large percentage of academic and government HPC users rely on Macbooks as their mobile computing solution," said Douglas Miles, director, The Portland Group. "Now that Gaussian 09 is available on Mac OS X as well as on Linux- and Windows-based systems, these HPC developers can use and work on Gaussian 09 anywhere, any time using optimizing PGI Fortran and C/C++ compilers & tools. The potential jump in productivity for these users is huge."

Gaussian 09 is the latest in the Gaussian series of electronic structure programs. It is used by chemists, chemical engineers, biochemists, physicists and others for research in established and emerging areas of chemical interest. It can be used to study molecules and reactions under a wide range of conditions, including those that produce both stable species and compounds which are difficult or impossible to observe experimentally, such as short-lived intermediates and transition structures.

"Because of Gaussian's position as one of the preeminent electronic-structures software programs, over the years we have ported it to nearly every HPC platform using nearly every compiler and tool chain available," said Dr. Michael Frisch, president of Gaussian, Inc. "When we have a choice, we prefer PGI. We have found their dependability and performance to be consistently first rate."

PGI® compilers are recognized in the HPC community for delivering world-class performance across a wide spectrum of applications and benchmarks, and they are referenced regularly as the industry standard for performance and reliability. PGI compilers and tools are used widely in high-performance computing, the field of technical computing engaged in the modeling and simulation of complex processes, such as ocean modeling, weather forecasting, seismic analysis, bioinformatics and other areas.

A free 15-day evaluation of PGI Workstation compilers & tools for Mac OS X is available for download at www.pgroup.com/support/downloads.php. Registration is required.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI Compilers Now Available on the Cray CX Line of Supercomputers

Portland, Ore
May 11, 2010

PGI optimizing compilers deliver uniform high-performance computing development environment across entire Cray product line

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and a leading supplier of compilers for high-performance computing (HPC), today announced an agreement under which Cray Inc will resell PGI® optimizing Fortran, C, and C++ compilers and development tools with the Cray CX1 line of deskside supercomputers and the recently announced Cray CX1000 line of rack mount supercomputers.

The PGI compilers and tools are used by scientists and engineers to create and run high-performance computing applications for complex modeling and simulations in weather forecasting, geophysical processing, aerodynamic simulation, structural analysis, automotive crash-testing, computational chemistry, and related fields. PGI compilers and tools support the latest 64-bit processors from AMD and Intel as well as CUDA-enabled GPU accelerators from NVIDIA running under both Linux and Microsoft Windows.

Cray has offered PGI compilers with its scalable supercomputers since 2004, and PGI compilers are installed on nearly all Cray XT systems worldwide. These same PGI compilers will now be available for purchase directly from Cray for use on its Intel® Xeon® processor-based Cray CX1 and CX1000 systems running either Linux or Windows HPC Server 2008. In addition, the PGI compilers for the Cray CX line will include the new PGI Accelerator features for NVIDIA GPUs, as well as the PGDBG® OpenMP/MPI graphical debugger and PGPROF® OpenMP/MPI graphical parallel performance profiling tools.

"Giving Cray's customers the flexibility and simplicity of purchasing our high-performance compilers directly from Cray for the Cray CX1 and Cray CX1000 systems is a big win for everyone," said Douglas Miles, director, The Portland Group. "Together we can offer a uniform PGI compiler environment on the Cray CX machines running Linux or Windows up through the high-end scalable Cray XT machines and provide significant ease of migration and application upscaling benefits for existing and future Cray users."

First introduced in September 2008, the Cray CX1 line of deskside supercomputers are ideal for individuals and departments who want to harness the compute power of an integrated HPC cluster. Equipped with state-of-the-art visualization and storage capabilities and optionally configured with NVIDIA GPUs, the Cray CX1 system delivers performance leadership across a broad range of applications in a compact, deskside system. The Cray CX1000 system is the latest addition to Cray's CX line of supercomputing systems, scaling up to 128 nodes in a rack-mount configuration and offering HPC users the same integrated software stack as the Cray CX1 system with a simplified, common operating environment for either Windows or Linux along with several system management packages.

"We are excited to offer our Cray CX1 and Cray CX1000 customers access to the same compiler technology enjoyed by our Cray XT customers," said Ian Miller, senior vice president of the productivity solutions group and marketing at Cray. "In addition, PGI's ability to deliver technologies that leverage the GPU capabilities in the Cray CX line make it extremely relevant as more customers explore the acceleration capabilities in NVIDIA Tesla-based systems."

The PGI compiler suite includes the PGFORTRAN™, PGCC®, and PGC++™ compilers for the Fortran 95/03, C, and C++ programming languages, respectively. PGI compilers and tools feature full native support for OpenMP parallel programming extensions in Fortran, C, and C++, full support for 64-bit addressing, native integrated scalar and vector SSE and AVX code generation, directive-based x64+GPU programming, CUDA Fortran extensions for NVIDIA GPUs, and a bundled version of the ACML 4.4 library of highly optimized numeric functions for mathematical, engineering, scientific, and financial applications. PGI compilers are highly optimized for AMD and Intel CPUs, and NVIDIA CUDA-enabled GPUs running Linux, Mac OS X and Windows. The PGDBG debugger and PGPROF performance profiler are MPI- and OpenMP-enabled, and available for cluster configurations.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI Accelerator Compilers Enhanced for
NVIDIA GPUs Based on New Fermi Architecture

Portland, Ore
May 6, 2010

The Portland Group's CUDA Fortran and directive-based Fortran and C99 x64+GPU compilers now support the latest NVIDIA GPUs

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and a leading supplier of compilers for high-performance computing (HPC), today announced that its entire line of PGI Accelerator™ compiler products, including its new PGI 10.4 release, now support the latest NVIDIA graphics processing units (GPU) based on the Fermi architecture. The NVIDIA Tesla 20-series supports many new features for the HPC space as well as support for version 3.0 of the NVIDIA CUDA toolkit. NVIDIA CUDA-enabled GPUs are used to accelerate the performance of appropriate HPC applications beyond what is possible with the latest multi-core x64 host CPUs from Intel and AMD.

The latest version of PGI Accelerator compilers provide full support for CUDA Fortran on the latest NVIDIA GPU platforms and add support for allocatable device arrays within Fortran modules along with several API enhancements. CUDA Fortran, co-defined by NVIDIA and The Portland Group, is an extended version of the Fortran 2003 programming language that gives software developers direct control over all aspects of GPU programming. The PGI 10.4 release also enhances support for the PGI Accelerator directives-based programming model on Fermi platforms. The PGI Accelerator directives make GPU-software development easily approachable by application domain experts. Rather than porting and parallelizing entire programs or functions for the GPU, the PGI Accelerator directives allow incremental porting and parallelization of individual compute-intensive loops and code segments using standard-compliant and portable Fortran or C.

The PGI 10.4 release adds several ease-of-use features, including the use of PGI Unified Binary technology to build one version of an application that will run on any CUDA-enabled GPU. With PGI 10.4 compilers, programmers can automatically generate code that works and is optimized for both a Tesla C1060 GPU or the new Tesla C2050 GPU. In addition, they can take advantage of new GPU features including faster double-precision arithmetic, larger and configurable fast shared memory, and increased number of cores. Support for new NVIDIA GPU platforms in PGI 10.4 extends across Linux, Windows and MacOS, and within Microsoft Visual Studio via PGI Visual Fortran.

"With PGI 10.4, HPC users can create highly optimized heterogeneous multi-core applications for the latest CPUs from Intel and AMD in combination with the latest generation of GPUs from NVIDIA," said Douglas Miles, director The Portland Group. "Efficiently using all available host cores for certain parts of an application while accelerating other portions on GPUs is the key to squeezing maximum performance out of today's GPU-enabled workstations and cluster nodes. With Fermi's improvement in double-precision performance, we expect a big increase in the number and type of applications that benefit from GPU acceleration."

"A large part of the success of Tesla GPUs in the HPC space can be attributed to the quality of the development tools from NVIDIA and its partners," said Sanford Russell, general manager, GPU Computing at NVIDIA. "This announcement from PGI, building on the tools already in the market, is more evidence of the increasing momentum behind GPU computing in general and our CUDA architecture in particular."

The Portland Group compilers and tools for Fermi GPUs are part of the PGI 2010 release version 10.4 and are available now. A 15-day free trial is available from the PGI website. More detailed information about PGI compilers and tools is available online at www.pgroup.com

For more information on NVIDIA CUDA, please visit www.nvidia.com/cuda.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI CUDA Fortran Now Available from The Portland Group

Portland, Ore
November 17, 2009

NVIDIA CUDA architecture now supported directly in the HPC industry-leading PGI Fortran compiler

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics, today announced the general availability of its CUDA Fortran compiler for x64 and x86 processor-based systems running Linux, Mac OS X and Windows. Developed in collaboration with NVIDIA Corporation (Nasdaq: NVDA), the inventor of the graphics processing unit (GPU), PGI Release 2010 includes the first Fortran compiler compatible with the NVIDIA line of CUDA-enabled GPUs. A compiler is a software tool that translates applications from the high-level programming languages in which they are written by software developers into a binary form a computer can execute.

With developers taking advantage of the hundreds of cores and the relatively low cost of NVIDIA GPUs, programming to take advantage of the CUDA C compiler has become a popular means for accelerating the solution of complex computing problems. The PGI CUDA Fortran compiler is expected to accelerate GPU adoption even further in the High-Performance Computing (HPC) industry, where many important applications are written in Fortran. HPC is the field of technical computing engaged in the modeling and simulation of complex processes, such as ocean modeling, weather forecasting, environmental modeling, seismic analysis, bioinformatics and other areas.

"The GPU is ideally suited for the computationally intensive applications that define the HPC industry," said Andy Keane, general manager, Tesla business at NVIDIA. "The industry has been vocal in expressing its need for a CUDA-compatible Fortran option. NVIDIA customers can now build native CUDA Fortran applications using the widely-used and long-proven PGI Fortran compiler."

The CUDA Fortran compiler is compatible with all NVIDIA GPUs that include Compute Capability 1.3 or higher, which includes most NVIDIA Quadro Professional Graphics solutions and all NVIDIA Tesla GPU Computing solutions. Developers are invited to download the PGI CUDA Fortran compiler from The Portland Group website at www.pgroup.com/support/downloads.php. A 15-day trial license is available at no charge. In an effort to simplify adoption, NVIDIA has granted PGI rights to redistribute the relevant components of the CUDA Software Development Kit (SDK) as part of the PGI CUDA Fortran installation package.

PGI® products are used widely in high-performance computing (HPC). PGI compilers are recognized in the HPC community for delivering world-class performance across a wide spectrum of applications and benchmarks, and they are referenced regularly as the industry standard for performance and reliability. More information on the CUDA Fortran compiler is available directly from The Portland Group web site at www.pgroup.com/cudafortran.

GPU computing forums for news, discussion and programming tips are available at forums.nvidia.com and www.pgroup.com/userforum.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI Release 2010 Enables Scientists and Engineers
to Harness the Computational Power of GPUs

Portland, Ore
November 12, 2009

Latest compilers from The Portland Group extend support for the PGI Accelerator Programming Model and PGI CUDA Fortran

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics, today announced that release 2010 of the PGI® line of high-performance parallelizing compilers and development tools for Linux, Mac OS X and Windows will be available on Tuesday, November 17. PGI 2010 is the first general release to include full support for the PGI Accelerator Programming model v1.0 standard on x64 processor-based systems incorporating NVIDIA CUDA-enabled Graphical Processing Units (GPUs). In addition to supporting high-level programming of accelerators using the PGI Accelerator programming model, the PGI Release 2010 also includes PGI CUDA Fortran, an explicit GPU programming model and application programming interface (API) that gives expert programmers direct control of all aspects of programming NVIDIA GPUs.

The PGI Accelerator programming model is a collection of compiler directives used to specify regions of code in Fortran and C programs that can be offloaded from a host CPU to an attached accelerator to enhance performance. Applications optimized using the PGI Accelerator directives remain 100% portable to other compilers and platforms, and execute on systems with or without a GPU accelerator.

PGI 2010 offers full support for the PGI Accelerator programming model including the following new features:

  • GPU device-resident data—the ability to define and leave data on the GPU across accelerator regions and subroutine boundaries
  • Support for COMPLEX and DOUBLE COMPLEX data types in Fortran
  • Support for C structs and Fortran derived types
  • Automatic GPU-side loop unrolling for improved performance
  • Support for Accelerator regions nested within OpenMP parallel regions
  • Support for Linux, Mac OS X (including Snow Leopard) and Windows (including Windows 7).

"Within five years, most HPC systems will include both x86 CPUs and accelerators in some form," said Douglas Miles, director, The Portland Group. "The PGI 2010 compilers will play a role in establishing accelerated computing as the mainstream HPC architecture."

PGI CUDA Fortran includes a Fortran 95/03 compiler and tool chain for native programming of NVIDIA GPUs using Fortran. CUDA Fortran subroutines can launch and execute in parallel on the hundreds of cores in an NVIDIA GPU under control of an x64 host CPU. Developed in collaboration with NVIDIA, PGI CUDA Fortran extensions supported in the PGI 2010 Fortran 95/03 compiler enable HPC developers to explicitly control all aspects of data movement, memory utilization and computation on CUDA GPUs.

Additional new features in the PGI 2010 compilers and tools include support for more Fortran 2003 incremental features, the latest EDG 4.1 C++ front-end with enhanced GNU and Microsoft compatibility, OpenMP parallel programming support for up to 256 cores, and AVX code generation. PGI 2010 also includes a major update to the PGPROF performance profiler, which now supports performance profiling of binary executables without re-compiling or any special software privileges, uniform operation and features on Linux, Mac OS X and Windows, support for PGI Accelerator and PGI CUDA Fortran GPU-side performance statistics, and an updated graphical user interface. Finally, PGI 2010 supports the latest operating system releases including Red Hat Fedora 10/11, SuSE 11.1 and Ubuntu 9, Mac OS X Snow Leopard and Windows 7.

Enhancements in the PGI 2010 release of PGI Visual Fortran for Microsoft Visual Studio include full support for the PGI Accelerator Programming model and PGI CUDA Fortran on NVIDIA CUDA-enabled GPUs, and the addition of a new standalone version of the PGPROF performance profiler for x64 and GPUs with support for the Common Compiler Feedback Format (CCFF). CCFF is a draft standard published by PGI that defines what compiler information is stored and how the information is formatted. CCFF enables HPC tools providers to offer more and better information about optimizing performance.

More information about the PGI Accelerator Programming model is available online at www.pgroup.com/accelerate. PGI CUDA Fortran information is available separately at www.pgroup.com/cudafortran. Evaluation copies of the new PGI 2010 compilers are available from The Portland Group web site at www.pgroup.com. Registration is required.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI Visual Fortran 9.0 Adds Support for MPI Debugging on
Windows HPC Server 2008 Clusters

Portland, Ore
August 8, 2009

Latest version allows users to build, launch and debug from within Microsoft Visual Studio

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and leading supplier of compilers for high-performance computing (HPC), today announced the general availability of PGI Visual Fortran® (PVF®) Release 9.0 for Windows workstations, servers and clusters. PVF 9.0 is the first general release to include support for the building, launching and debugging of Microsoft MPI (MSMPI) Fortran applications from within the Microsoft Visual Studio integrated development environment.

PVF augments the Visual Studio debugger by adding a Fortran language specific custom debug engine. The PVF debug engine supports debugging of single and multi-thread, OpenMP, multi-thread MSMPI and hybrid MSMPI+OpenMP Fortran applications. It enables debugging of 64-bit or 32-bit applications using source code or assembly code, and provides full access to the registers and hardware state of the processors. Other new multi-process MSMPI capabilities in PVF 9.0 include Visual Studio property pages for configuring compile-time options, launching applications either locally on a workstation or on a distributed-memory Windows HPC Server 2008 cluster system, and debugging of programs running either locally or on a cluster.

"PVF 9.0 is a big step forward in ease-of-use for HPC Fortran programmers porting applications to or developing applications for Windows workstations, servers and clusters," said Douglas Miles, director, The Portland Group. "For Windows Fortran users looking to leverage the power of Windows HPC Server 2008 clusters, the ability to cover all aspects of MPI and parallel Fortran application development from within the Microsoft Visual Studio IDE can simplify their work considerably."

"The majority of HPC applications are still written in Fortran and parallelized using MPI and OpenMP," said Vince Mendillo, Director, Technical Computing Marketing, Microsoft. "By including MSMPI job launch and debugging support within PGI Visual Fortran, PGI has further enhanced the Windows HPC Server 2008 ecosystem and simplified porting of HPC applications to Windows clusters."

Additional new features in PVF 9.0 include support for Intel Core i7 (Nehalem) and six-core AMD Opteron (Istanbul) processors, several incremental Fortran 2003 features, improvements in serial debugging and disassembly speed, and completely updated documentation and online help. PGI Visual Fortran is compatible with both the current version of Visual Studio, Visual Studio 2008, and the previous version, Visual Studio 2005.

PGI Release 9.0 is the first general release to include support for the high-level PGI Accelerator™ programming model on x64 processor-based Linux systems incorporating NVIDIA CUDA-enabled GPUs. Announced last June, the PGI Release 9.0 line of high-performance parallelizing compilers and development tools for Linux, Mac OS X and Windows is now available.

Further information on The Portland Group products, including PGI® Release 9.0, can be found at www.pgroup.com, by calling Sales at (503) 682-2806, or by email to sales@pgroup.com.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

PGI and NVIDIA Team To Deliver CUDA Fortran Compiler

Portland, Ore
June 23, 2009

Fortran support for NVIDIA CUDA GPUs to be incorporated into a new version of the PGI Fortran compiler

The Portland Group®, a wholly-owned subsidiary of STMicroelectronics and leading supplier of compilers for high-performance computing (HPC), today announced an agreement with NVIDIA under which the two companies plan to develop new Fortran language support for CUDA GPUs.

The NVIDIA® CUDA™ architecture allows developers to offload computationally intensive kernels to the massively parallel GPU. Through function calls and language extensions, CUDA gives developers explicit control over the mapping of general-purpose computational kernels to GPUs as well as placement and movement of data between the x64 processor and the GPU. The NVIDIA CUDA C compiler already provides this capability to C programmers. The CUDA Fortran compiler will provide this same level of control and optimization in a native Fortran environment from PGI.

"Fortran support for CUDA GPUs is a perfect complement to our existing roadmap for the PGI Accelerator Fortran and C compilers," said Douglas Miles, director, The Portland Group. "It enables interoperability of PGI Fortran and CUDA C and gives PGI users a full range of options in porting and optimizing Fortran applications to leverage the power of CUDA-enabled NVIDIA GPUs."

"The GPU computing developer community has made it clear there is a need and demand for a production-quality Fortran solution on the GPU," said Andy Keane, general manager, Tesla GPU Computing Solutions, NVIDIA. "With their large base of Fortran developers for x64 processor-based HPC systems, PGI provides a perfect bridge for migration of production science and engineering codes from existing platforms to NVIDIA Tesla GPUs."

The Portland Group and NVIDIA will release the Fortran language specification for CUDA GPUs at the International Conference on Supercomputing in Hamburg, Germany this week. The CUDA Fortran compiler will be added to a production release of the PGI Fortran compilers scheduled for availability in November 2009. More detailed information about PGI compilers and tools is available online at www.pgroup.com.

For more information on NVIDIA CUDA, please visit www.nvidia.com/cuda.

All registered trademarks and marks are the property of their respective owners.

Media Contacts
Mike Markowitz
STMicroelectronics
[enable javascript]
(212) 821 8959

Click me