Skocz do zawartości
j1gg

Jak zdiagnozować czy z dedykiem jest wszystko ok

Polecane posty

Witam, od dwóch dni mam problem z dedykiem, moje serwery csgo się crashuja. Sprawdzałem na wszystkie możliwe sposoby, nawet czysty serwer pada jak mucha. Jak mogę sprawdzić czy ze sprzętem jest wszystko ok? Robiłem coś takiego memtester 1024 5 i badblocks -sv /dev/sda niby 0 błędów.

Przed aktualizacją kernela miałem taki błąd linux-gate.so + 0x435

Teraz coś takiego linux-gate.so + 0xd59

Udostępnij ten post


Link to postu
Udostępnij na innych stronach

smartem wyciągnij dane to coś się może dowiesz

odpal atop albo sysstat z interwałem 1 minuta to zobaczysz co się dzieje przed crashem

 

 

Udostępnij ten post


Link to postu
Udostępnij na innych stronach

smartctl 6.4 2014-10-07 r4002 [x86_64-linux-4.6.0-0.bpo.1-amd64] (local build)

Copyright © 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

 

=== START OF INFORMATION SECTION ===

Model Family: Seagate Barracuda 7200.12

Device Model: ST31000524AS

Serial Number: 5VPCTDPZ

LU WWN Device Id: 5 000c50 05c7e45e3

Firmware Version: JC45

User Capacity: 1,000,204,886,016 bytes [1,00 TB]

Sector Size: 512 bytes logical/physical

Rotation Rate: 7200 rpm

Device is: In smartctl database [for details use: -P show]

ATA Version is: ATA8-ACS T13/1699-D revision 4

SATA Version is: SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)

Local Time is: Sun Aug 14 14:10:05 2016 CEST

SMART support is: Available - device has SMART capability.

SMART support is: Enabled

 

=== START OF READ SMART DATA SECTION ===

SMART overall-health self-assessment test result: PASSED

See vendor-specific Attribute list for marginal Attributes.

 

General SMART Values:

Offline data collection status: (0x82) Offline data collection activity

was completed without error.

Auto Offline Data Collection: Enabled.

Self-test execution status: ( 41) The self-test routine was interrupted

by the host with a hard or soft reset.

Total time to complete Offline

data collection: ( 609) seconds.

Offline data collection

capabilities: (0x7b) SMART execute Offline immediate.

Auto Offline data collection on/off support.

Suspend Offline collection upon new

command.

Offline surface scan supported.

Self-test supported.

Conveyance Self-test supported.

Selective Self-test supported.

SMART capabilities: (0x0003) Saves SMART data before entering

power-saving mode.

Supports SMART auto save timer.

Error logging capability: (0x01) Error logging supported.

General Purpose Logging supported.

Short self-test routine

recommended polling time: ( 1) minutes.

Extended self-test routine

recommended polling time: ( 176) minutes.

Conveyance self-test routine

recommended polling time: ( 2) minutes.

SCT capabilities: (0x103f) SCT Status supported.

SCT Error Recovery Control supported.

SCT Feature Control supported.

SCT Data Table supported.

 

SMART Attributes Data Structure revision number: 10

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

1 Raw_Read_Error_Rate 0x000f 118 094 006 Pre-fail Always - 188037260

3 Spin_Up_Time 0x0003 100 100 000 Pre-fail Always - 0

4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 18

5 Reallocated_Sector_Ct 0x0033 095 095 036 Pre-fail Always - 240

7 Seek_Error_Rate 0x000f 087 060 030 Pre-fail Always - 484336953

9 Power_On_Hours 0x0032 066 066 000 Old_age Always - 30032

10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0

12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 18

183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0

184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0

187 Reported_Uncorrect 0x0032 001 001 000 Old_age Always - 121

188 Command_Timeout 0x0032 100 098 000 Old_age Always - 4295098401

189 High_Fly_Writes 0x003a 099 099 000 Old_age Always - 1

190 Airflow_Temperature_Cel 0x0022 065 042 045 Old_age Always In_the_past 35 (Min/Max 17/47 #1671)

194 Temperature_Celsius 0x0022 035 058 000 Old_age Always - 35 (0 17 0 0 0)

195 Hardware_ECC_Recovered 0x001a 040 012 000 Old_age Always - 188037260

197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0

198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0

199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0

240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 30087 (163 46 0)

241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 1484163697

242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 1786202825

 

SMART Error Log Version: 1

ATA Error Count: 24 (device log contains only the most recent five errors)

CR = Command Register [HEX]

FR = Features Register [HEX]

SC = Sector Count Register [HEX]

SN = Sector Number Register [HEX]

CL = Cylinder Low Register [HEX]

CH = Cylinder High Register [HEX]

DH = Device/Head Register [HEX]

DC = Device Command Register [HEX]

ER = Error register [HEX]

ST = Status register [HEX]

Powered_Up_Time is measured from power on, and printed as

DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,

SS=sec, and sss=millisec. It "wraps" after 49.710 days.

 

Error 24 occurred at disk power-on lifetime: 12956 hours (539 days + 20 hours)

When the command that caused the error occurred, the device was active or idle.

 

After command completion occurred, registers were:

ER ST SC SN CL CH DH

-- -- -- -- -- -- --

40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455

 

Commands leading to the command that caused the error were:

CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name

-- -- -- -- -- -- -- -- ---------------- --------------------

60 00 c8 ff ff ff 4f 00 39d+13:34:53.009 READ FPDMA QUEUED

27 00 00 00 00 00 e0 00 39d+13:34:53.008 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

ec 00 00 00 00 00 a0 00 39d+13:34:53.007 IDENTIFY DEVICE

ef 03 46 00 00 00 a0 00 39d+13:34:53.007 SET FEATURES [set transfer mode]

27 00 00 00 00 00 e0 00 39d+13:34:53.007 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

 

Error 23 occurred at disk power-on lifetime: 12956 hours (539 days + 20 hours)

When the command that caused the error occurred, the device was active or idle.

 

After command completion occurred, registers were:

ER ST SC SN CL CH DH

-- -- -- -- -- -- --

40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455

 

Commands leading to the command that caused the error were:

CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name

-- -- -- -- -- -- -- -- ---------------- --------------------

60 00 c8 ff ff ff 4f 00 39d+13:34:50.170 READ FPDMA QUEUED

27 00 00 00 00 00 e0 00 39d+13:34:50.169 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

ec 00 00 00 00 00 a0 00 39d+13:34:50.168 IDENTIFY DEVICE

ef 03 46 00 00 00 a0 00 39d+13:34:50.168 SET FEATURES [set transfer mode]

27 00 00 00 00 00 e0 00 39d+13:34:50.168 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

 

Error 22 occurred at disk power-on lifetime: 12956 hours (539 days + 20 hours)

When the command that caused the error occurred, the device was active or idle.

 

After command completion occurred, registers were:

ER ST SC SN CL CH DH

-- -- -- -- -- -- --

40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455

 

Commands leading to the command that caused the error were:

CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name

-- -- -- -- -- -- -- -- ---------------- --------------------

60 00 c8 ff ff ff 4f 00 39d+13:34:47.331 READ FPDMA QUEUED

27 00 00 00 00 00 e0 00 39d+13:34:47.331 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

ec 00 00 00 00 00 a0 00 39d+13:34:47.330 IDENTIFY DEVICE

ef 03 46 00 00 00 a0 00 39d+13:34:47.330 SET FEATURES [set transfer mode]

27 00 00 00 00 00 e0 00 39d+13:34:47.329 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

 

Error 21 occurred at disk power-on lifetime: 12956 hours (539 days + 20 hours)

When the command that caused the error occurred, the device was active or idle.

 

After command completion occurred, registers were:

ER ST SC SN CL CH DH

-- -- -- -- -- -- --

40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455

 

Commands leading to the command that caused the error were:

CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name

-- -- -- -- -- -- -- -- ---------------- --------------------

60 00 c8 ff ff ff 4f 00 39d+13:34:44.483 READ FPDMA QUEUED

27 00 00 00 00 00 e0 00 39d+13:34:44.483 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

ec 00 00 00 00 00 a0 00 39d+13:34:44.482 IDENTIFY DEVICE

ef 03 46 00 00 00 a0 00 39d+13:34:44.482 SET FEATURES [set transfer mode]

27 00 00 00 00 00 e0 00 39d+13:34:44.482 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

 

Error 20 occurred at disk power-on lifetime: 12956 hours (539 days + 20 hours)

When the command that caused the error occurred, the device was active or idle.

 

After command completion occurred, registers were:

ER ST SC SN CL CH DH

-- -- -- -- -- -- --

40 51 00 ff ff ff 0f Error: UNC at LBA = 0x0fffffff = 268435455

 

Commands leading to the command that caused the error were:

CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name

-- -- -- -- -- -- -- -- ---------------- --------------------

60 00 c8 ff ff ff 4f 00 39d+13:34:41.644 READ FPDMA QUEUED

27 00 00 00 00 00 e0 00 39d+13:34:41.644 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

ec 00 00 00 00 00 a0 00 39d+13:34:41.643 IDENTIFY DEVICE

ef 03 46 00 00 00 a0 00 39d+13:34:41.643 SET FEATURES [set transfer mode]

27 00 00 00 00 00 e0 00 39d+13:34:41.643 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]

 

SMART Self-test log structure revision number 1

Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error

# 1 Extended offline Interrupted (host reset) 90% 30032 -

 

SMART Selective self-test log data structure revision number 1

SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS

1 0 0 Not_testing

2 0 0 Not_testing

3 0 0 Not_testing

4 0 0 Not_testing

5 0 0 Not_testing

Selective self-test flags (0x0):

After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.

 

root@mk259:~#

Udostępnij ten post


Link to postu
Udostępnij na innych stronach

Zaloguj się, aby skomentować

Będziesz mógł dodać komentarz po zalogowaniu się



Zaloguj się

×