Great try! Maybe the Ethersnoop interface (RJ45) could provide more Ethernet packet info. Because they said this interface is used for vehicle debugging.
In the diagram, the signal output from the Left Controller is already an "Analog Out", which is directly used to drive speakers. So I infer that they have integrated the amplifier chip into the network micro, bypassing the use of conventional audio bus.