On my 64bit aptosid 3.1-5.slh.3-aptosid-686, I get black screens reading "KERNEL BUG" every day.
/var/log/messages is quoted below, it doesn't speak of a kernel bug any more.
This happens only when I run video-processing processes, namely a script that sequencially runs mplex, spumux, dvadauthor and HandBrakeCLI. The system hangs randomly in either process. This once also happened half a second after script-start, so I don't think this a temperature problem. The same happens when I start the PC with the previously installed kernel 3.0-6.slh.3-aptosid-686.
When I start with the older kernel 2.6.39-3.slh.1-aptosid-686, the script runs smoothly until I attempt an additional action like logging in from a remote pc via nfs and opening a directory. Then the system just freezes without giving the black screen, but /var/log/messages looks similar.
Below, I give excerpts from /var/log/messages showing different events with different kernels, which are stated in the messages.
I already did memtest without result.
Can anyone tell me what might be wrong?
LeTuX
-----------
Code:
Dec 20 18:41:04 alexpc kernel: [ 6717.223367] ------------[ cut here ]------------
Dec 20 18:41:04 alexpc kernel: [ 6717.223380] WARNING: at /tmp/buildd/linux-aptosid-3.1/debian/build/source_i386_none/mm/truncate.c:286 truncate_inode_pages_range+0x234/0x27b()
Dec 20 18:41:04 alexpc kernel: [ 6717.223383] Hardware name: GA-MA74GM-S2H
Dec 20 18:41:04 alexpc kernel: [ 6717.223385] Modules linked in: powernow_k8 mperf cpufreq_stats cpufreq_powersave cpufreq_conservative bnep rfcomm ppdev bluetooth rfkill lp fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc ext3 jbd dm_crypt snd_hda_codec_hdmi snd_seq snd_hda_codec_realtek radeon ttm snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq_device snd_timer drm_kms_helper drm i2c_algo_bit shpchp snd soundcore snd_page_alloc parport_pc parport ati_agp sp5100_tco i2c_piix4 k8temp pci_hotplug evdev button pcspkr processor ext4 mbcache jbd2 crc16 dm_mod raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear md_mod sr_mod cdrom sd_mod crc_t10dif usbhid ata_generic pata_acpi hid pata_atiixp ahci libahci libata ohci_hcd ehci_hcd r8169 mii usbcore scsi_mod ssb mmc_core pcmcia pcmcia_core [last unloaded: scsi_wait_scan]
Dec 20 18:41:04 alexpc kernel: [ 6717.223450] Pid: 2463, comm: mv Not tainted 3.1-5.slh.3-aptosid-686 #1
Dec 20 18:41:04 alexpc kernel: [ 6717.223452] Call Trace:
Dec 20 18:41:04 alexpc kernel: [ 6717.223459] [<c012cdb4>] ? warn_slowpath_common+0x7c/0x8f
Dec 20 18:41:04 alexpc kernel: [ 6717.223463] [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 18:41:04 alexpc kernel: [ 6717.223466] [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 18:41:04 alexpc kernel: [ 6717.223469] [<c012cde2>] ? warn_slowpath_null+0x1b/0x1f
Dec 20 18:41:04 alexpc kernel: [ 6717.223472] [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 18:41:04 alexpc kernel: [ 6717.223478] [<c0183466>] ? truncate_inode_pages+0x17/0x1b
Dec 20 18:41:04 alexpc kernel: [ 6717.223490] [<f86a2593>] ? ext4_evict_inode+0xd1/0x2a1 [ext4]
Dec 20 18:41:04 alexpc kernel: [ 6717.223494] [<c01bc9ae>] ? d_delete+0xb6/0xd9
Dec 20 18:41:04 alexpc kernel: [ 6717.223498] [<c01bf2d5>] ? evict+0x82/0x121
Dec 20 18:41:04 alexpc kernel: [ 6717.223502] [<c01b867c>] ? do_unlinkat+0xca/0x107
Dec 20 18:41:04 alexpc kernel: [ 6717.223506] [<c01d3f22>] ? fsnotify_find_inode_mark_locked+0xe/0x36
Dec 20 18:41:04 alexpc kernel: [ 6717.223509] [<c01d4a4c>] ? dnotify_flush+0x27/0x9d
Dec 20 18:41:04 alexpc kernel: [ 6717.223514] [<c01ad533>] ? filp_close+0x54/0x5b
Dec 20 18:41:04 alexpc kernel: [ 6717.223517] [<c01ad59c>] ? sys_close+0x62/0x9b
Dec 20 18:41:04 alexpc kernel: [ 6717.223521] [<c039d89f>] ? sysenter_do_call+0x12/0x28
Dec 20 18:41:04 alexpc kernel: [ 6717.223524] ---[ end trace 9735c6c19f55e03d ]---
Dec 20 19:01:09 alexpc kernel: [ 7922.640544] ------------[ cut here ]------------
Code:
Dec 20 19:01:09 alexpc kernel: [ 7922.640544] ------------[ cut here ]------------
Dec 20 19:01:09 alexpc kernel: [ 7922.640556] WARNING: at /tmp/buildd/linux-aptosid-3.1/debian/build/source_i386_none/mm/truncate.c:286 truncate_inode_pages_range+0x234/0x27b()
Dec 20 19:01:09 alexpc kernel: [ 7922.640559] Hardware name: GA-MA74GM-S2H
Dec 20 19:01:09 alexpc kernel: [ 7922.640561] Modules linked in: powernow_k8 mperf cpufreq_stats cpufreq_powersave cpufreq_conservative bnep rfcomm ppdev bluetooth rfkill lp fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc ext3 jbd dm_crypt snd_hda_codec_hdmi snd_seq snd_hda_codec_realtek radeon ttm snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq_device snd_timer drm_kms_helper drm i2c_algo_bit shpchp snd soundcore snd_page_alloc parport_pc parport ati_agp sp5100_tco i2c_piix4 k8temp pci_hotplug evdev button pcspkr processor ext4 mbcache jbd2 crc16 dm_mod raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear md_mod sr_mod cdrom sd_mod crc_t10dif usbhid ata_generic pata_acpi hid pata_atiixp ahci libahci libata ohci_hcd ehci_hcd r8169 mii usbcore scsi_mod ssb mmc_core pcmcia pcmcia_core [last unloaded: scsi_wait_scan]
Dec 20 19:01:09 alexpc kernel: [ 7922.640628] Pid: 2463, comm: mv Tainted: G W 3.1-5.slh.3-aptosid-686 #1
Dec 20 19:01:09 alexpc kernel: [ 7922.640630] Call Trace:
Dec 20 19:01:09 alexpc kernel: [ 7922.640636] [<c012cdb4>] ? warn_slowpath_common+0x7c/0x8f
Dec 20 19:01:09 alexpc kernel: [ 7922.640640] [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 19:01:09 alexpc kernel: [ 7922.640643] [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 19:01:09 alexpc kernel: [ 7922.640646] [<c012cde2>] ? warn_slowpath_null+0x1b/0x1f
Dec 20 19:01:09 alexpc kernel: [ 7922.640649] [<c0183408>] ? truncate_inode_pages_range+0x234/0x27b
Dec 20 19:01:09 alexpc kernel: [ 7922.640655] [<c0183466>] ? truncate_inode_pages+0x17/0x1b
Dec 20 19:01:09 alexpc kernel: [ 7922.640668] [<f86a2593>] ? ext4_evict_inode+0xd1/0x2a1 [ext4]
Dec 20 19:01:09 alexpc kernel: [ 7922.640671] [<c01bc9ae>] ? d_delete+0xb6/0xd9
Dec 20 19:01:09 alexpc kernel: [ 7922.640675] [<c01bf2d5>] ? evict+0x82/0x121
Dec 20 19:01:09 alexpc kernel: [ 7922.640679] [<c01b867c>] ? do_unlinkat+0xca/0x107
Dec 20 19:01:09 alexpc kernel: [ 7922.640683] [<c01d3f22>] ? fsnotify_find_inode_mark_locked+0xe/0x36
Dec 20 19:01:09 alexpc kernel: [ 7922.640687] [<c01d4a4c>] ? dnotify_flush+0x27/0x9d
Dec 20 19:01:09 alexpc kernel: [ 7922.640691] [<c01ad533>] ? filp_close+0x54/0x5b
Dec 20 19:01:09 alexpc kernel: [ 7922.640693] [<c01ad59c>] ? sys_close+0x62/0x9b
Dec 20 19:01:09 alexpc kernel: [ 7922.640698] [<c039d89f>] ? sysenter_do_call+0x12/0x28
Dec 20 19:01:09 alexpc kernel: [ 7922.640700] ---[ end trace 9735c6c19f55e03e ]---
Dec 20 19:08:46 alexpc kernel: [ 8378.916915] ------------[ cut here ]------------
Code:
Dec 21 12:33:35 alexpc kernel: [ 1583.838586] Pid: 3271, comm: java Not tainted 3.0-6.slh.3-aptosid-686 #1
Dec 21 12:33:35 alexpc kernel: [ 1583.838591] Call Trace:
Dec 21 12:33:35 alexpc kernel: [ 1583.838608] [<c01a54bd>] ? bad_page+0x8d/0xe0
Dec 21 12:33:35 alexpc kernel: [ 1583.838616] [<c01a5602>] ? free_pages_prepare+0xf2/0x100
Dec 21 12:33:35 alexpc kernel: [ 1583.838624] [<c01a6d28>] ? free_hot_cold_page+0x28/0x140
Dec 21 12:33:35 alexpc kernel: [ 1583.838632] [<c01a701f>] ? __pagevec_free+0x1f/0x30
Dec 21 12:33:35 alexpc kernel: [ 1583.838640] [<c01a98db>] ? release_pages+0x13b/0x1f0
Dec 21 12:33:35 alexpc kernel: [ 1583.838650] [<c01c71fb>] ? free_pages_and_swap_cache+0x7b/0x90
Dec 21 12:33:35 alexpc kernel: [ 1583.838660] [<c01b8363>] ? tlb_flush_mmu+0x53/0x80
Dec 21 12:33:35 alexpc kernel: [ 1583.838667] [<c01b8399>] ? tlb_finish_mmu+0x9/0x40
Dec 21 12:33:35 alexpc kernel: [ 1583.838673] [<c01bdc1d>] ? unmap_region+0xcd/0xe0
Dec 21 12:33:35 alexpc kernel: [ 1583.838681] [<c01beb33>] ? do_munmap+0x223/0x2a0
Dec 21 12:33:35 alexpc kernel: [ 1583.838687] [<c01bef4f>] ? sys_brk+0xef/0x100
Dec 21 12:33:35 alexpc kernel: [ 1583.838695] [<c044ba98>] ? sysenter_do_call+0x12/0x28
Dec 21 12:33:35 alexpc kernel: [ 1583.838700] Disabling lock debugging due to kernel taint
Code:
Dec 23 13:10:54 alexpc kernel: [ 330.236543] Pid: 331, comm: kswapd0 Not tainted 2.6.39-3.slh.1-aptosid-686 #1
Dec 23 13:10:54 alexpc kernel: [ 330.236549] Call Trace:
Dec 23 13:10:54 alexpc kernel: [ 330.236563] [<c01a1f3b>] ? bad_page+0x8b/0xd0
Dec 23 13:10:54 alexpc kernel: [ 330.236571] [<c01a2072>] ? free_pages_prepare+0xf2/0x100
Dec 23 13:10:54 alexpc kernel: [ 330.236579] [<c01a3828>] ? free_hot_cold_page+0x28/0x140
Dec 23 13:10:54 alexpc kernel: [ 330.236586] [<c01a3b1f>] ? __pagevec_free+0x1f/0x30
Dec 23 13:10:54 alexpc kernel: [ 330.236593] [<c01a76c8>] ? free_page_list+0x68/0xa0
Dec 23 13:10:54 alexpc kernel: [ 330.236602] [<c01a8790>] ? shrink_page_list+0x120/0x710
Dec 23 13:10:54 alexpc kernel: [ 330.236610] [<c01a7985>] ? update_isolated_counts.isra.46+0x135/0x160
Dec 23 13:10:54 alexpc kernel: [ 330.236618] [<c01a90e8>] ? shrink_inactive_list+0x148/0x2d0
Dec 23 13:10:54 alexpc kernel: [ 330.236626] [<c01a9706>] ? shrink_zone+0x496/0x550
Dec 23 13:10:54 alexpc kernel: [ 330.236639] [<c01a9cdf>] ? kswapd+0x51f/0x720
Dec 23 13:10:54 alexpc kernel: [ 330.236647] [<c01a97c0>] ? shrink_zone+0x550/0x550
Dec 23 13:10:54 alexpc kernel: [ 330.236655] [<c0151dc9>] ? kthread+0x69/0x70
Dec 23 13:10:54 alexpc kernel: [ 330.236661] [<c0151d60>] ? kthread_worker_fn+0x150/0x150
Dec 23 13:10:54 alexpc kernel: [ 330.236672] [<c0437bb6>] ? kernel_thread_helper+0x6/0xd
Dec 23 13:10:54 alexpc kernel: [ 330.236676] Disabling lock debugging due to kernel taint
Code:
Dec 23 13:58:06 alexpc kernel: [ 258.081944] Modules linked in: powernow_k8 mperf cpufreq_stats cpufreq_powersave cpufreq_conservative bnep rfcomm bluetooth rfkill ppdev lp fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc ext3 jbd dm_crypt snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_seq snd_timer snd_seq_device radeon sp5100_tco ttm i2c_piix4 snd drm_kms_helper drm i2c_algo_bit soundcore shpchp pci_hotplug ati_agp k8temp parport_pc evdev pcspkr parport snd_page_alloc button processor ext4 mbcache jbd2 crc16 dm_mod raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear md_mod sr_mod cdrom sd_mod crc_t10dif usbhid ata_generic hid pata_acpi ahci ohci_hcd libahci pata_atiixp libata ehci_hcd scsi_mod r8169 mii usbcore ssb mmc_core pcmcia pcmcia_core [last unloaded: scsi_wait_scan]
Dec 23 13:58:06 alexpc kernel: [ 258.082575]
Dec 23 13:58:06 alexpc kernel: [ 258.082575] Pid: 2456, comm: flush-8:64 Not tainted 3.1-5.slh.3-aptosid-686 #1 Gigabyte Technology Co., Ltd. GA-MA74GM-S2H/GA-MA74GM-S2H
Dec 23 13:58:06 alexpc kernel: [ 258.082575] EIP: 0060:[<f86ff2b5>] EFLAGS: 00010246 CPU: 1
Dec 23 13:58:06 alexpc kernel: [ 258.082575] EIP is at mpage_da_submit_io+0x188/0x3a4 [ext4]
Dec 23 13:58:06 alexpc kernel: [ 258.082575] EAX: 5e00083c EBX: 00000000 ECX: 00000000 EDX: 00000000
Dec 23 13:58:06 alexpc kernel: [ 258.082575] ESI: f618b3e0 EDI: f25a3ce0 EBP: f25a3e5c ESP: f25a3c34
Dec 23 13:58:06 alexpc kernel: [ 258.082575] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
Dec 23 13:58:06 alexpc kernel: [ 258.082575] 0000000e f73a2c00 e4d09558 00155734 00000000 00005f34 00005f34 00000000
Dec 23 13:58:06 alexpc kernel: [ 258.082575] f25a3d64 e4d09558 00001000 00000000 00000000 f8720530 00005f34 0000000e
Dec 23 13:58:06 alexpc kernel: [ 258.082575] 00007358 00000000 e4d09620 00000000 00000000 0000000e 00000000 f618b3e0
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<f8700e18>] ? mpage_da_map_and_submit+0x396/0x3a8 [ext4]
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c0236879>] ? __lookup_tag+0x81/0xd9
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c0236e3c>] ? radix_tree_gang_lookup_tag_slot+0x79/0x93
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c017a6ef>] ? find_get_pages_tag+0x9e/0xb6
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<f8700ffc>] ? write_cache_pages_da+0x109/0x293 [ext4]
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<f87013ae>] ? ext4_da_writepages+0x228/0x33c [ext4]
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c0181a11>] ? do_writepages+0x12/0x1b
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c67f7>] ? writeback_single_inode+0xb9/0x1ea
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c6b9a>] ? writeback_sb_inodes+0x12f/0x1b8
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c6c76>] ? __writeback_inodes_wb+0x53/0x84
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c6d69>] ? wb_writeback+0xc2/0x137
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01810a5>] ? determine_dirtyable_memory+0x31/0x43
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c70a7>] ? wb_do_writeback+0x120/0x131
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c70ff>] ? bdi_writeback_thread+0x47/0x101
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c01c70b8>] ? wb_do_writeback+0x131/0x131
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c013fb8f>] ? kthread+0x63/0x68
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c013fb2c>] ? kthread_worker_fn+0x10d/0x10d
Dec 23 13:58:06 alexpc kernel: [ 258.082575] [<c039de3e>] ? kernel_thread_helper+0x6/0xd
Dec 23 13:58:06 alexpc kernel: [ 258.120141] ---[ end trace 909122826bd02045 ]---
|