系统出现i40e驱动错误,网络连接失败

系统正常运行过程中,系统日志突然出现i40e错误,网络中断,重启后恢复,过一段时间后又会出现,且目前只使用了电口,没有使用光口。错误信息如下:
Jun 5 10:46:23 muyun kernel: [61482.274302] i40e 0000:02:00.0: ARQ: 未知事件 0x0000 被忽略
Jun 5 10:46:23 muyun kernel: [61482.354504] i40e 0000:02:00.0: ARQ: 未知事件 0x0000 被忽略
Jun 5 10:46:23 muyun kernel: [61482.434688] i40e 0000:02:00.0: ARQ: 未知事件 0x0000 被忽略
设备详细信息如下:
系统版本:22.03.sp4

网络接口信息:
[root@muyun ~]# lspci -nnk | grep -i net -A 3
02:00.0 以太网控制器 [0200]: Intel Corporation 10GbE SFP+ 以太网控制器 X710 [8086:1572] (rev 02)
子系统: Intel Corporation Ethernet Converged Network Adapter X710 [8086:0000]
正在使用的内核驱动程序: i40e
内核模块: i40e
02:00.1 以太网控制器 [0200]: Intel Corporation 10GbE SFP+ 以太网控制器 X710 [8086:1572] (rev 02)
子系统: Intel Corporation Ethernet Converged Network Adapter X710 [8086:0000]
正在使用的内核驱动程序: i40e
内核模块: i40e
03:00.0 以太网控制器 [0200]: Intel Corporation I350 千兆光纤网络连接 [8086:1522] (rev 01)
正在使用的内核驱动程序: igb
内核模块: igb
03:00.1 以太网控制器 [0200]: Intel Corporation I350 千兆光纤网络连接 [8086:1522] (rev 01)
正在使用的内核驱动程序: igb
内核模块: igb
03:00.2 以太网控制器 [0200]: Intel Corporation I350 千兆光纤网络连接 [8086:1522] (rev 01)
正在使用的内核驱动程序: igb
内核模块: igb
03:00.3 以太网控制器 [0200]: Intel Corporation I350 千兆光纤网络连接 [8086:1522] (rev 01)
正在使用的内核驱动程序: igb
内核模块: igb
04:00.0 以太网控制器 [0200]: Intel Corporation I210 千兆网络连接 [8086:1533] (rev 03)
正在使用的内核驱动程序: igb
内核模块: igb
05:00.0 以太网控制器 [0200]: Intel Corporation I210 千兆网络连接 [8086:1533] (rev 03)
正在使用的内核驱动程序: igb
内核模块: igb
06:00.0 以太网控制器 [0200]: Intel Corporation I210 千兆网络连接 [8086:1533] (rev 03)
正在使用的内核驱动程序: igb
内核模块: igb
07:00.0 以太网控制器 [0200]: Intel Corporation I210 千兆网络连接 [8086:1533] (rev 03)
正在使用的内核驱动程序: igb
内核模块: igb
08:00.0 以太网控制器 [0200]: Intel Corporation I210 千兆网络连接 [8086:1533] (rev 03)
正在使用的内核驱动程序: igb
内核模块: igb
09:00.0 以太网控制器 [0200]: Intel Corporation I210 千兆网络连接 [8086:1533] (rev 03)
正在使用的内核驱动程序: igb
内核模块: igb
[root@muyun ~]# ip addr
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
valid_lft forever preferred_lft forever
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: GE7: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:4c brd ff:ff:ff:ff:ff:ff
3: GE12: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 64:57:e5:9a:a1:91 brd ff:ff:ff:ff:ff:ff
4: GE8: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:4d brd ff:ff:ff:ff:ff:ff
5: GE11: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
link/ether 64:57:e5:9a:a1:92 brd ff:ff:ff:ff:ff:ff
6: GE9: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:4e brd ff:ff:ff:ff:ff:ff
7: GE10: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:4f brd ff:ff:ff:ff:ff:ff
8: GE1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP group default qlen 1000
link/ether 64:57:e5:8d:19:46 brd ff:ff:ff:ff:ff:ff
inet 10.10.105.30/25 brd 10.10.105.127 scope global GE1
valid_lft forever preferred_lft forever
inet6 fe80::6657:e5ff:fe8d:1946/64 scope link
valid_lft forever preferred_lft forever
9: GE2: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:47 brd ff:ff:ff:ff:ff:ff
10: GE3: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:48 brd ff:ff:ff:ff:ff:ff
11: GE4: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:49 brd ff:ff:ff:ff:ff:ff
12: GE5: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:4a brd ff:ff:ff:ff:ff:ff
13: GE6: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc mq state DOWN group default qlen 1000
link/ether 64:57:e5:8d:19:4b brd ff:ff:ff:ff:ff:ff
inet 192.168.2.2/24 brd 192.168.2.255 scope global GE6
valid_lft forever preferred_lft forever
inet6 fe80::6657:e5ff:fe8d:194b/64 scope link
valid_lft forever preferred_lft forever
[root@muyun ~]#
[root@muyun ~]#
[root@muyun ~]#
[root@muyun ~]# lspci | grep -i ethernet
02:00.0 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 02)
02:00.1 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 02)
03:00.0 Ethernet controller: Intel Corporation I350 Gigabit Fiber Network Connection (rev 01)
03:00.1 Ethernet controller: Intel Corporation I350 Gigabit Fiber Network Connection (rev 01)
03:00.2 Ethernet controller: Intel Corporation I350 Gigabit Fiber Network Connection (rev 01)
03:00.3 Ethernet controller: Intel Corporation I350 Gigabit Fiber Network Connection (rev 01)
04:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
05:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
06:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
07:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
08:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
09:00.0 Ethernet controller: Intel Corporation I210 Gigabit Network Connection (rev 03)
[root@muyun ~]# lspci -vvv -s 02:00.0
02:00.0 Ethernet controller: Intel Corporation Ethernet Controller X710 for 10GbE SFP+ (rev 02)
Subsystem: Intel Corporation Ethernet Converged Network Adapter X710
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- SERR- <PERR- INTx-
Interrupt: pin A routed to IRQ 17
Region 0: Memory at d0800000 (64-bit, prefetchable) [disabled] [size=8M]
Region 3: Memory at d1008000 (64-bit, prefetchable) [disabled] [size=32K]
Expansion ROM at df780000 [virtual] [disabled] [size=512K]
Capabilities: [40] Power Management version 3
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
Status: D0 NoSoftRst+ PME-Enable- DSel=0 DScale=1 PME-
Capabilities: [50] MSI: Enable- Count=1/1 Maskable+ 64bit+
Address: 0000000000000000 Data: 0000
Masking: 00000000 Pending: 00000000
Capabilities: [70] MSI-X: Enable- Count=129 Masked-
Vector table: BAR=3 offset=00000000
PBA: BAR=3 offset=00001000
Capabilities: [a0] Express (v2) Endpoint, MSI 00
DevCap: MaxPayload 2048 bytes, PhantFunc 0, Latency L0s <512ns, L1 <64us
ExtTag+ AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ SlotPowerLimit 0W
DevCtl: CorrErr- NonFatalErr- FatalErr- UnsupReq-
RlxdOrd+ ExtTag+ PhantFunc- AuxPwr- NoSnoop- FLReset-
MaxPayload 128 bytes, MaxReadReq 512 bytes
DevSta: CorrErr+ NonFatalErr+ FatalErr- UnsupReq+ AuxPwr- TransPend-
LnkCap: Port #0, Speed 8GT/s, Width x8, ASPM L1, Exit Latency L1 <16us
ClockPM- Surprise- LLActRep- BwNot- ASPMOptComp+
LnkCtl: ASPM Disabled; RCB 64 bytes, Disabled- CommClk-
ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
LnkSta: Speed 8GT/s, Width x8
TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
DevCap2: Completion Timeout: Range ABCD, TimeoutDis+ NROPrPrP- LTR-
10BitTagComp- 10BitTagReq- OBFF Not Supported, ExtFmt- EETLPPrefix-
EmergencyPowerReduction Not Supported, EmergencyPowerReductionInit-
FRS- TPHComp- ExtTPHComp-
AtomicOpsCap: 32bit- 64bit- 128bitCAS-
DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis- LTR- 10BitTagReq- OBFF Disabled,
AtomicOpsCtl: ReqEn-
LnkCap2: Supported Link Speeds: 2.5-8GT/s, Crosslink- Retimer- 2Retimers- DRS-
LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- SpeedDis-
Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS-
Compliance Preset/De-emphasis: -6dB de-emphasis, 0dB preshoot
LnkSta2: Current De-emphasis Level: -6dB, EqualizationComplete+ EqualizationPhase1+
EqualizationPhase2+ EqualizationPhase3+ LinkEqualizationRequest-
Retimer- 2Retimers- CrosslinkRes: unsupported
Capabilities: [100 v2] Advanced Error Reporting
UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+
CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+
AERCap: First Error Pointer: 14, ECRCGenCap+ ECRCGenEn- ECRCChkCap+ ECRCChkEn-
MultHdrRecCap- MultHdrRecEn- TLPPfxPres- HdrLogCap-
HeaderLog: 40000001 0000000f d026680c 00000000
Capabilities: [140 v1] Device Serial Number 91-a1-9a-ff-ff-e5-57-64
Capabilities: [150 v1] Alternative Routing-ID Interpretation (ARI)
ARICap: MFVC- ACS-, Next Function: 1
ARICtl: MFVC- ACS-, Function Group: 0
Capabilities: [160 v1] Single Root I/O Virtualization (SR-IOV)
IOVCap: Migration- 10BitTagReq- Interrupt Message Number: 000
IOVCtl: Enable- Migration- Interrupt- MSE- ARIHierarchy- 10BitTagReq-
IOVSta: Migration-
Initial VFs: 64, Total VFs: 64, Number of VFs: 0, Function Dependency Link: 00
VF offset: 272, stride: 1, Device ID: 154c
Supported Page Size: 00000553, System Page Size: 00000001
Region 0: Memory at 0000000000000000 (64-bit, prefetchable)
Region 3: Memory at 0000000000000000 (64-bit, prefetchable)
VF Migration: offset: 00000000,BIR: 0
功能:[1a0 v1] 事务处理提示
支持设备特定模式
没有转向表可用
功能:[1b0 v1] 访问控制服务
ACSCap: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- EgressCtrl- DirectTrans-
ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- EgressCtrl- DirectTrans-
功能:[1d0 v1] 次级PCI Express
LnkCtl3:LnkEquIntrruptEn- PerformEqu-
LaneErrStat:0
正在使用的内核驱动程序:i40e
内核模块:i40e

[root@muyun ~]# lspci -vvv -s 04:00.0
04:00.0 以太网控制器:Intel Corporation I210 千兆网络连接(修订版03)
控制:I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
状态:Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- SERR- <PERR- INTx-
延迟:0,缓存行大小:64 字节
中断:引脚A路由到IRQ 16
区域0:内存位于df500000(32位,不可预取)[大小=512K]
区域2:I/O端口位于d000 [大小=32]
区域3:内存位于df580000(32位,不可预取)[大小=16K]
功能:[40] 电源管理版本3
标志:PMEClk- DSI+ D1- D2- 辅助电流=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
状态:D0 没有软复位+ PME-启用- DSel=0 DScale=1 PME-
功能:[50] MSI:启用- 计数=1/1 可屏蔽+ 64位+
地址:0000000000000000 数据:0000
屏蔽:00000000 挂起:00000000
功能:[70] MSI-X:启用+ 计数=5 屏蔽-
向量表:BAR=3 偏移=00000000
PBA:BAR=3 偏移=00002000
功能:[a0] Express(v2)端点,MSI 00
DevCap: 最大有效载荷512字节,幻影功能0,延迟L0s <512ns,L1 <64us
ExtTag- AttnBtn- AttnInd- PwrInd- RBE+ FLReset+ 插槽功率限制0W
DevCtl: 纠正错误+ 非致命错误+ 致命错误+ 不支持请求+
放松顺序+ ExtTag- 幻影功能- 辅助电源- 不窥视+ FLReset-
最大有效载荷256字节,最大读取请求512字节
DevSta: 纠正错误- 非致命错误- 致命错误- 不支持请求- 辅助电源+ 传输挂起-
LnkCap: 端口#0,速度2.5GT/s,宽度x1,ASPM L0s L1,退出延迟L0s <2us,L1 <16us
时钟PM- 惊奇- LLActRep- BwNot- ASPMOptComp+
LnkCtl: ASPM禁用;RCB 64字节,禁用- 共享时钟+
ExtSynch- 时钟PM- 自动宽度禁用- BWInt- 自动BWInt-
LnkSta: 速度2.5GT/s,宽度x1
TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt-
DevCap2:完成超时:范围ABCD,超时禁用+ NROPrPrP- LTR-
10BitTagComp- 10BitTagReq- OBFF不支持,ExtFmt- EETLPPrefix-
紧急电源减少不支持,紧急电源减少初始化-
FRS- TPHComp- ExtTPHComp-
原子操作能力:32位- 64位- 128位CAS-
DevCtl2:完成超时:50us到50ms,超时禁用- LTR- 10BitTagReq- OBFF禁用,
原子操作控制:请求启用-
LnkCtl2:目标链路速度:2.5GT/s,EnterCompliance- 速度禁用-
传输余量:正常运行范围,EnterModifiedCompliance- ComplianceSOS-
合规预设/去强调:-6dB去强调,0dB预发
LnkSta2:当前去强调级别:-6dB,均衡完成- 均衡阶段1-
均衡阶段2- 均衡阶段3- 链路均衡请求-
重定时器- 2个重定时器- 跨链路资源:不支持
功能:[100 v2] 高级错误报告
UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
UESvrt: DLP+ SDES+ TLP- FCP+ CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
CESta: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr-
CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- AdvNonFatalErr+
AERCap: 第一个错误指针:00,ECRCGenCap+ ECRCGenEn- ECRCChkCap+ ECRCChkEn-
多头记录能力- 多头记录启用- TLPPfxPres- HdrLogCap-
头日志:00000000 00000000 00000000 00000000
功能:[140 v1] 设备序列号 64-57-e5-ff-ff-8d-19-46
功能:[1a0 v1] 事务处理提示
支持设备特定模式
TPH能力结构中的转向表
正在使用的内核驱动程序:igb
内核模块:igb

怀疑是英特尔i40e网卡固件或驱动程序的已知缺陷导致的,请问现在解决了吗