Page 1 of 2 1 2 LastLast
Results 1 to 10 of 17
  1. #1
    ROG Member Array
    Join Date
    Apr 2017
    Reputation
    10
    Posts
    16

    Unhappy Help! System hangs/freezes when overclocked

    Hi,

    I recently got the following configuration setup:

    CPU - i9-7980XE
    RAM - 8x16GB = 128GB of RAM - G.Skill F4-3200C14Q2-128GTZS (https://www.gskill.com/en/product/f4-3200c14q2-128gtzsw)
    MOTHERBOARD - ASUS Rog Rampage VI Extreme
    BIOS 1004


    I am currently trying to diagnose system freezes/hangs/crash that seems to occur after several hours of uptime (tested on Windows 10 or other OS). I am not able to diagnose exactly where the issue is coming from, but from the BSOD dumps it appears this is memory related. I've tried both the CPU settings set to "Auto" and to Manually set a Multiplier and Core Voltage in the BIOS, but still the same issue occurs. The only thing I can think of now is XMP.

    I am trying to narrow down where the issue could be coming from, so any suggestions would be greatly appreciated.

    After running 24h of memtest86+ 5.0.1 (which did one and a half pass), with XMP enabled, no issues were detected.
    After running 5-13 hours of Prime95 the system would eventually freeze or crash at some point, and the BSOD dumps are of type MEMORY_CORRUPTION_LARGE and MEMORY_CORRUPTION_ONE_BIT_LARGE.


    The BSOD information states that the issue is memory-related, which makes me think that the Voltage set by default when XMP is enabled is incorrect. When enabling XMP in the BIOS, it sets the Voltage to 1.350V which is indeed the correct value that the RAM is supposed to get at 3200Mhz. However the following happens:



    As you can see there are two issues:

    1. The Voltage reading is incorrect and too high: it reads 1.360V instead of 1.350 --> Why? Is this normal?
    2. The second DRAM Voltage value flickers between 1.344V and 1.360V --> Any clue about what's happening there?


    I have tried to swap all the RAM from CHC&CHD DIMM slots to CHA&CHB DIMM slots, but the same happens, the CHC&CHD DRAM Voltage fluctuates... Could it be related to my issue?

    As for now, I have decided to bring the XMP settings back to disabled and run Prime95 again for the next 24h (unless the system freezes again) to double check that it indeed comes from the XMP settings and no anything else. I will report back here.

    If anyone has any idea about what I should check/set to resolve the issue, I would really appreciate it. Been trying to resolve this for a week now.

    Thank you!
    Last edited by Thireus; 01-16-2018 at 03:40 AM.

  2. #2
    Tech Marketing Manager HQ Array Raja@ASUS's Avatar
    Join Date
    Apr 2011
    Reputation
    161
    Posts
    7,454

    What you are seeing is normal. The reported value is tapped off the power plane, so isn't going to be exactly indicative of the voltage at the dram module pads. And the SIO reading does fluctuate due to the way it samples. This is not the cause of instability.

    Establish whether your system is stable at stock settings. Load defaults, do not overclock the cpu or memory, and check if it still freezes.

  3. #3
    ROG Guru: Yellow Belt Array xarot PC Specs
    xarot PC Specs
    Laptop (Model)Asus GL504GM
    MotherboardAsus Rampage VI Extreme
    Processori9-7980XE
    Memory (part number)Corsair Dominator Platinum 64GB CL16-18-18
    Graphics Card #1Palit RTX2080TI
    Sound CardIntegrated
    MonitorAsus PG279Q
    Storage #1Intel 750 1.2 TB PCIe
    Storage #2Samsung 960 Pro 1 TB
    CPU CoolerEK Supremacy EVO Full Nickel
    CasePhanteks Enthoo Primo
    Power SupplyCorsair AX1500i
    Keyboard Blackstorm Mech RGB
    Mouse Steelseries Rival 310
    Headset Sennheiser HD555
    Mouse Pad Razer Goliathus Extended
    OS Windows 10 x64
    xarot's Avatar
    Join Date
    Oct 2013
    Reputation
    44
    Posts
    181

    If you want to test RAM, try this new RAM test software instead of Memtest86+. It can detect memory-related errors very fast. Try to reach at least 5000 % coverage. It detected errors in 2 minutes when I tried to get my Corsair Dominator 4000 kit to run on the Rampage VI Extreme using XMP (it couldn't run). :P

    https://www.karhusoftware.com/ramtest/

    What version of Prime95 and are you overclocking? Using AVX on or off? Also, tried to set VCCSA a tad higher? Say 0.95 - 1.0 V?
    Gaming: i9-7980XE H2O - Rampage VI Extreme - 64 GB Corsair Dominator Platinum DDR4-3200 - Palit RTX2080TI H2O - Phanteks Enthoo Primo - Corsair AX1500i - Intel 750 1.2 TB - Samsung 960 PRO 1 TB
    HTPC: i7-6950X - X99-M WS - 32 GB G.Skill RipjawsV DDR4-2400 - GTX1070 - Bitfenix Pandora - Corsair AX860 - Intel 750 400 GB + Samsung 1 TB 850 EVO
    All around: i7-9900X H2O - Rampage VI Extreme - 64 GB G.Skill 4000 CL18-19-19-39 - Strix RTX2060 - In Win 101C - Samsung 960 EVO 512 GB

  4. #4
    ROG Member Array
    Join Date
    Apr 2017
    Reputation
    10
    Posts
    16

    Quote Originally Posted by Raja@ASUS View Post
    What you are seeing is normal. The reported value is tapped off the power plane, so isn't going to be exactly indicative of the voltage at the dram module pads. And the SIO reading does fluctuate due to the way it samples. This is not the cause of instability.

    Establish whether your system is stable at stock settings. Load defaults, do not overclock the cpu or memory, and check if it still freezes.
    Thank you Raja for your quick reply. So far I have now conducted the following testing:

    - Various CPU tweaking with XMP enabled:

    Prime95 crash after 4 hours, system was never stable past 13h of uptime (that's been happening for a week now, when I first enabled XMP)

    - Everything back to Auto with XMP enabled:

    Prime95 crash after 4 hours

    - Everything Auto without XMP enabled:

    Prime95 still running as I write this message, for more than 12 hours now... So it seems to be all good without XMP.

    Which seems to confirm that the issue is XMP related. I'm also going to follow xarot's advise and try to reproduce crashes with specific RAM testing tools when XMP is enabled and compare to when XMP is disabled. The currently issue I'm facing for troubleshooting this is that it takes several hours for the system to either crash or freeze with the Prime95 testing methodology.

    I will also attempt to reproduce the issue with CMOS cleared and everything back on stock speed. But that will require another 2 days of testing at least.
    Last edited by Thireus; 01-16-2018 at 02:29 PM.

  5. #5
    Tech Marketing Manager HQ Array Raja@ASUS's Avatar
    Join Date
    Apr 2011
    Reputation
    161
    Posts
    7,454

    Yep, isolate the memory, and tune things from there.

  6. #6
    ROG Member Array
    Join Date
    Apr 2017
    Reputation
    10
    Posts
    16

    Quote Originally Posted by Raja@ASUS View Post
    Yep, isolate the memory, and tune things from there.
    Hi Raja,

    I would like to give a quick update about my troubleshooting progress...

    I think I was able to make the system freeze occur much sooner than before by using a combination of fast Prime95 blend test and memtest, here is what I did:

    - Customise Prime95 26.6 with "Time to run each FFT size (in minutes):" to a value of "1" instead of "15", so that all tests run much faster
    - Run one instance of memtest


    The following happened:

    - With XMP enabled I had a system freeze after about 1h30 running
    - Without XMP enabled, I had no system freeze at all after more than 15h as previously discussed


    I thus decided it was time to reset my BIOS back to a clean 1004. So I re-flashed my 1004 version with the same version, which cleared all the settings for me. Before I did that, I made sure to save my XMP-enabled settings that were causing freezes. I then ONLY enabled XMP with this new cleared BIOS (everything else to stock).

    These are the non-stock settings when XMP enabled would cause the system to freeze after ~1h30 of custom Prime95 in combination with memtest:

    AVX Instruction Core Ratio Negative Offset [15]
    AVX-512 Instruction Core Ratio Negative Offset [15]
    CPU SVID Support [Disabled]
    CPU Load-line Calibration [Level 3]
    CPU Current Capability [140%]
    CPU Power Phase Control [Extreme]
    Autonomous Core C-State [Enabled]
    Enhanced Halt State (C1E) [Enabled]
    CPU C6 report [Enabled]
    Package C State [C6(non Retention) state]
    Intel(R) Speed Shift Technology [Enabled]
    MFC Mode Override [OS Native Support]
    Fast Boot [Disabled]
    Boot from Network Devices [Ignore]


    After re-flashing the BIOS and ONLY enabling XMP, the above previous settings are the following:

    AVX Instruction Core Ratio Negative Offset [Auto]
    AVX-512 Instruction Core Ratio Negative Offset [Auto]
    CPU SVID Support [Auto]
    CPU Load-line Calibration [Auto]
    CPU Current Capability [Auto]
    CPU Power Phase Control [Auto]
    Autonomous Core C-State [Auto]
    Intel(R) Speed Shift Technology [Auto]
    MFC Mode Override [MFC Driver Override]
    Fast Boot [Enabled]
    Next Boot after AC Power Loss [Normal Boot]
    Boot from Network Devices [Legacy only]


    Now on these stock+XMP-enabled settings my stress tests have been running for about 2 hours (with the Prime95 settings I mention above and 2 instances of memtest running) and so far no freezes or crashes…

    I will let things run like this for several hours and see if the issue is resolved. But I would still like to be able to reproduce the behaviour I noticed on the non-default BIOS options I initially had that were causing XMP not to function properly.

    Could you please have a look at the above setting differences and tell me if you think there is something incompatible with XMP that I had enabled that could be the cause of the freezes? So I can try to reproduce it and will know that I should avoid this setting for future reference.

    Thank you!

    P.S.: I also found a bug, when you re-flash the BIOS on the same 1004 version, after the first boot the system will ask to press F1, once the settings are saved the system will reboot and when selecting (F8) a UEFI drive for the first time on these new settings the system would immediately crash and reboot. This happens on the first boot only, then on next reboots UEFI is working properly. I think I've seen another member on the forum reporting the same behaviour. Also, do you know where I could post this kind of bug report?

  7. #7
    ROG Enthusiast Array Nixon2992 PC Specs
    Nixon2992 PC Specs
    MotherboardASUS ROG STRIX X99 GAMING(UEFI:1504)
    ProcessorIntel Core i7 5820K
    Memory (part number)Corsair Vengeance LPX [CMK16GX4M4A2133C15]
    Graphics Card #1ASUS Strix 1070 OC
    Graphics Card #2ASUS GeForce GT 710 Silent LP [710-2-SL]
    MonitorAcer CB290C
    Storage #1Seagate BarraCuda ST2000DM006
    Storage #2WD Black [WDS512G1X0C]
    CPU CoolerNoctua NH-D15S
    CaseFractal Design Define R5 Blackout Edition
    Power SupplyCorsair RMi 1000W [CP-9020084-EU]
    OS Windows 10 Home for single language 1704
    Network RouterASUS RT N66U
    Nixon2992's Avatar
    Join Date
    Dec 2016
    Reputation
    10
    Posts
    69

    You try change DRAM Phase control to EXTREME for A/B & C/D chanels and reduce DRAM switching frequncy to minimum value in Digi+ VRM.This action reduce fluctuations votage.
    You system may freeze if undevoltage VCCIO rail.
    Miniatura de Adjuntos Miniatura de Adjuntos Digi%2B Power.jpg  


  8. #8
    ROG Member Array
    Join Date
    Apr 2017
    Reputation
    10
    Posts
    16

    Thank you Nixon2992, I think you're right, there is something about the voltage that isn't set properly. Once I isolate the setting that causes instability I will try what you suggest if it's still relevant.

    After countless hours of troubleshooting the issue to isolate which settings are incompatible with XMP, I have successfully isolated a bunch of settings that are responsible for the system freezes:

    STOCK +
    XMP Enabled
    Autonomous Core C-State [Enabled]


    These settings will cause the machine to freeze after several hours. With my previous Prime95+memtest stress test methodology I am able to make the system freeze after about 1h and 30 minutes.

    I have confirmed that with Autonomous Core C-State set to STOCK default, there is no freeze happening when XMP is enabled.

    I'm still trying to narrow it down even further, but if anyone has an idea why these settings are incompatible please let me know.

    Is anyone at Asus trying to reproduce the above? Or am I doing all the debugging work?

    Edit: Was able to narrow it even further. LLC set to Auto (default) doesn't resolve freezes.

    Edit2: CPU Power Phase Control [Extreme] is not what's causing the system freezes, I have set that one to default.
    Last edited by Thireus; 01-17-2018 at 07:54 PM.

  9. #9
    ROG Member Array
    Join Date
    Apr 2017
    Reputation
    10
    Posts
    16

    I finally have the answer to the freezing issues I've been having! And I’m quite confident this is an issue with the Asus Rampage VI Extreme motherboard or maybe with the BIOS!

    On the Asus Rampage VI Extreme, enabling both XMP and Core C-State will cause the system to freeze or crash after a few hours. When on stock settings, these two options are incompatible, at least on BIOS 1004. Therefore, I would like to request Asus to have a look at this technical issue, which clearly appears to me to be a board instability.

    The following BIOS setting is responsible for the system freezes when XMP is enabled:

    Autonomous Core C-State [Enabled]

    All other settings (except XMP and the above) should be on stock!

    To reproduce the issue and cause system freezes that occur as soon as possible after system boot on Windows 10, one can run Prime95 v26.6 with the following Custom Torture Test settings:

    Min FFT: 8
    Max FFTP: 4096
    Run FFTs in-place: Checked
    Time to run each FFT size (in minutes): 1


    Also, running 2 instances of memtest from HCI Design: http://hcidesign.com/memtest/, with the maximum available memory might help trigger the freeze when running Prime95 in parallel. Although, when freezes occur memtest should not find any memory errors! (unless you have bad memory, which is not my case as I've already tested my memory)

    Anyway, with the above C-State enabled, XMP enabled, and everything else on stock, the system should freeze after several hours of stress test, which should freezes after a maximum of 2h elapsed on the stress test which should be equivalent to 230% of memtest coverage. However, I have noticed that with other options enabled, such as the ones I mention in my first post, these freezzes can instead be system crashes (BSOD) in which case the Windows minidump after BSOD will be mentioned that this is memory related, sometimes it will be MEMORY_CORRUPTION_LARGE, sometimes MEMORY_CORRUPTION_ONE_BIT_LARGE. Crashes/Freezes can also occur much sooner too! If you wish to test your system stability with XMP and C-States enabled, I would recommend to please make sure to have everything on stock settings before enabling both XMP and Autonomous Core C-State only in the BIOS, that way you’ll only get system freezes!

    As for now, I will leave "Autonomous Core C-State" on Auto (which I assume is in reality disabled). I will also write Asus an email about the issue, and hopefully their technical team will be able to fix this bug in a future BIOS release.

    I was also curious to know if others had similar issues and decided to Google about it, and I found people who reported in the past that enabling C-States was incompatible with XMP and would cause their system to crash after several hours, which happens to be the issue I have with the Rog Rampage VI Extreme: https://vip.asus.com/forum/view.aspx...Language=en-us

    Does anyone understand why, when enabling C-States on my config and with XMP enabled, the system would freeze after several hours? (When XMP is disabled C-States are perfectly working fine)

    ---------

    @Raja, do you know if this is something that can be looked at by the technical team? That appears to be a bug to me, more than just incompatibility. Or at least the BIOS should prevent the use of C-States (or maybe these specific settings) when XMP is enabled.

    Also, I would like to add that from what I can remember from my previous build which involved an x99 Prime Deluxe II with an i7-6950X processor I didn’t have any problems having C-States and XMP work together!

  10. #10
    ROG Member Array
    Join Date
    Apr 2017
    Reputation
    10
    Posts
    16

    I would like to add that people are having similar freezing issues on the following threads:

    https://rog.asus.com/forum/showthrea...eeze-issue-fix
    https://rog.asus.com/forum/showthrea...omputer-freeze!

    They might not all be related, but there is a good chance some of their issues are linked to what I found and detail in my previous posts.
    Last edited by Thireus; 01-18-2018 at 12:15 PM.

Page 1 of 2 1 2 LastLast

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •