| 000 | 04625nam a2200421 i 4500 | ||
|---|---|---|---|
| 999 |
_c10170 _d10170 |
||
| 001 | 899003685 | ||
| 005 | 20190930112847.0 | ||
| 006 | m o d | ||
| 007 | cr mn||||||||| | ||
| 008 | 141201t20152015maua o 001 0 eng d | ||
| 020 | _a9780128021187 | ||
| 020 | _a9780128021996 (ebook) | ||
| 040 |
_aCaPaEBR _beng _erda _epn _cCaPaEBR _dE7B |
||
| 049 | _aOSUU | ||
| 050 | 1 | 4 |
_aQA76.642 _b.R456 2015eb |
| 082 | 0 | 4 |
_222 _a004.35 _bR.J.H |
| 100 | 1 |
_aReinders, James, _eauthor. _933254 |
|
| 245 | 1 | 0 |
_aHigh performance parallelism pearls : _bmulticore and many-core programming approaches / _cJames Reinders, Jim Jeffers. |
| 264 | 1 |
_aWaltham, Massachusetts : _bMorgan Kaufmann, _c2015. |
|
| 264 | 4 | _c℗♭2015. | |
| 300 |
_axlv, 502 pages : _billustrations ; _c24 cm. |
||
| 336 |
_atext _btxt _2rdacontent. |
||
| 337 |
_acomputer _bc _2rdamedia. |
||
| 338 |
_aonline resource _bcr _2rdacarrier. |
||
| 500 | _aElectronic reproduction. Palo Alto, Calif. : ebrary, 2014. Available via World Wide Web. Access may be limited to ebrary affiliated libraries. | ||
| 500 | _acomputer bookfair2016 | ||
| 500 | _aIncludes indexes. | ||
| 500 | _aPreviously issued in print: 2014. | ||
| 505 | 8 | _aFront Cover; High Performance Parallelism Pearls: Multicore and Many-core Programming Approaches; Copyright; Contents; Contributors; Acknowledgments; Foreword; Humongous computing needs: Science years in the making; Open standards; Keen on many-core architecture; Xeon Phi is born: Many cores, excellent vector ISA ; Learn highly scalable parallel programming; Future demands grow: Programming models matter; Preface; Inspired by 61 cores: A new era in programming; Chapter 1: Introduction; Learning from successful experiences; Code modernization; Modernize with concurrent algorithms. Modernize with vectorization and data localityUnderstanding power usage; ISPC and OpenCL anyone?; Intel Xeon Phi coprocessor specific; Many-core, neo-heterogeneous; No "Xeon Phi" in the title, neo-heterogeneous programming; The future of many-core; Downloads; Chapter 2: From "Correct" to "Correct & Efficient": A Hydro2D Case Study with Godunov's Scheme; Scientific computing on contemporary computers; Modern computing environments; CEA's Hydro2D; A numerical method for shock hydrodynamics; Euler's equation; Godunov's method; Where it fits; Features of modern architectures. Performance-oriented architectureProgramming tools and runtimes; Our computing environments; Paths to performance; Running Hydro2D; Hydro2D's structure; Computation scheme; Data structures; Measuring performance; Optimizations; Memory usage; Thread-level parallelism; Arithmetic efficiency and instruction-level parallelism; Data-level parallelism; Summary; The coprocessor vs the processor; A rising tide lifts all boats; Performance strategies; Chapter 3: Better Concurrency and SIMD on HBM ; The application: HIROMB -- BOOS -Model; Key usage: DMI ; HBM execution profile. Overview for the optimization of HBM Data structures: Locality done right; Thread parallelism in HBM ; Data parallelism: SIMD vectorization; Trivial obstacles; Premature abstraction is the root of all evil; Results; Profiling details; Scaling on processor vs. coprocessor; Contiguous attribute; Summary; References; Chapter 4: Optimizing for Reacting Navier-Stokes Equations; Getting started; Version 1.0: Baseline; Version 2.0: ThreadBox ; Version 3.0: Stack memory; Version 4.0: Blocking; Version 5.0: Vectorization; Intel Xeon Phi coprocessor results; Summary. Chapter 5: Plesiochronous Phasing BarriersWhat can be done to improve the code?; What more can be done to improve the code?; Hyper-Thread Phalanx; What is nonoptimal about this strategy?; Coding the Hyper-Thread Phalanx; How to determine thread binding to core and HT within core?; The Hyper-Thread Phalanx hand-partitioning technique; A lesson learned; Back to work; Data alignment; Use aligned data when possible; Redundancy can be good for you; The plesiochronous phasing barrier; Let us do something to recover this wasted time; A few "left to the reader" possibilities. | |
| 650 | 0 | _aCoprocessors. | |
| 650 | 0 |
_aParallel programming (Computer science) _xData processing. |
|
| 655 | 0 | _aElectronic books. | |
| 700 |
_aJeffers, Jim _eauthor. _933559 |
||
| 710 | 2 | _aebrary, Inc. | |
| 776 | 0 | 8 |
_iPrint version: _aReinders, James. _tHigh performance parallelism pearls : multicore and many-core programming approaches. _dWaltham, Massachusetts : Morgan Kaufmann, c2015 _hxlv, 502 pages _z9780128021187 _w2014951989. |
| 856 |
_3Abstract _uhttp://repository.fue.edu.eg/xmlui/handle/123456789/3648 |
||
| 942 |
_2ddc _cBK |
||