Results 1 to 2 of 2
  1. #1
    ROG Guru: Green Belt Array restsugavan PC Specs
    restsugavan PC Specs
    MotherboardRAMPAGE VI EXTREME
    ProcessorIntel Core i9 7980XE 2.6GHz
    Memory (part number)64GB GSKILL 3.2 GHz 16-16-18-36 2T
    Graphics Card #1ASUS ROG POSEIDON 1080Ti
    MonitorSAMSUNG UD590D
    Storage #1SAMSUNG 970 EVO PLUS 1TB x 3
    Storage #2WD 4TB External HDD
    CPU CoolerNZXT KRAKEN 62
    CaseCorsair Carbine 540
    Power SupplyCorsair AX1200i
    Keyboard Microsoft
    Mouse Microsoft
    Mouse Pad ASUS
    Headset/Speakers Sony
    OS Windows 10 2021 Insider Preview 20257.1005
    Network RouterD-LINK
    restsugavan's Avatar
    Join Date
    Sep 2017
    Reputation
    51
    Posts
    598

    Lightbulb 3 Video Link Unlock Skylake X full performace via AVX512

    There are 3 years after Skylake X SKUs release but look like people couldn't use its microarchitect full potential.
    I'd see the most interesting video like about AVX512 below.

    https://www.youtube.com/watch?v=D-mM6X5xnTY
    https://www.youtube.com/watch?v=I3efQKLgsjM
    https://www.youtube.com/watch?v=543a1b-cPmU

    Hope everyone find the path to full Skylake X SKUs unlock performance. It's also can be apply with Icelake X Tigerlake Rocket lake Alderlake and
    new wave of Intel AVX512 support CPUs.
    Core i9 7980XE / R6E 3201 BIOS / Intel 02006A08 MCU 2020-06-16 / VROC VMD 7.5.0.1030 / RST 18.0.0.4897 / ME FW 11.12.80.1734 DV 2040.100.0.1029 / 64GB GSKILL @3.2GHz 16-18-18-38 2T / POSEIDON 1080i GTX + NVIDIA Driver 465.12 WHQL / / 3 x 1TB SAMSUNG 970 EVO PLUS + OMS Driver 18.30.3.1148 WHQL / Microsoft Windows 10 Version 2004 Build 20262.1010 / i219 V PHY UNDI OROM / Boot Agent 0.0.29 / 0.1.16 / 12.19.0.16 WHQL AQC 107n FW 4.2.32 + AQC 2.2.1.0 WHQL. CAM 4.16.0 Realtek HD 6.0.9066.1 WHQL

  2. #2
    New ROGer Array
    Join Date
    Feb 2019
    Reputation
    16
    Posts
    87

    AVX-512 is the key to performance

    Quote Originally Posted by restsugavan View Post
    There are 3 years after Skylake X SKUs release but look like people couldn't use its microarchitect full potential.
    I'd see the most interesting video like about AVX512 below.

    https://www.youtube.com/watch?v=D-mM6X5xnTY
    https://www.youtube.com/watch?v=I3efQKLgsjM
    https://www.youtube.com/watch?v=543a1b-cPmU

    Hope everyone find the path to full Skylake X SKUs unlock performance. It's also can be apply with Icelake X Tigerlake Rocket lake Alderlake and
    new wave of Intel AVX512 support CPUs.
    There are a lot of misconceptions about AVX-512 and it has an undeservedly bad reputation (Linus Torvald's comments did not help either).
    Some, so call experts, are even saying that there is only a dozen or so people in the world that can use AVX-512 properly to extract performance out of thee intel CPUs.

    The truth is that you actually need real programming skills to vectorize you code but the rewards are amazing!
    Speed ups of 10x is not uncommon.
    BUT it requires you to actually know how to design and build your systems vectorized!
    You cannot just take your normal sequential code that you write in some high-level language and hope that somehow the compiler will find a way to compile into efficient AVX-512 binary. This is not going to happen!

    You have to design you system for a SIMD target platform in mind from the very beginning and vectorize algorithms.
    This requires linear algebra and multivariable calculus understanding, making the threshold high.

    BUT again the payback is enormous if you find a away to write efficient code that use AVX-512. I've seen speed ups in 100x for certain workloads, especially when you can organize the problem and the data into large matrices and vectors.

    Matrix to matrix multiplication is one example of this. The bottle next is no longer the CPU (AVX-512 just plows through these workloads and a 10980XE have 2! FMAs per core that's right 2 effectively you have 36 FMAs cores in a 10980XE not 18!) but the memory subsystem. Basically the memory subsystem cannot feed the CPU fast enough, requiring you to align data for optimal reading and writing (with optimal cache access etc.)


    In my view, AVX-512 is an absolute pleasure to work with! It is so incredible powerful and rewarding! It's not an obscurity BUT it requires the Developer to design and build vectorized code not hope the complier will somehow do that for you!

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •