I'm not sure if it's just my hearing, but I'm often having trouble distinguishing the consonants at the start of vocab audio. Not just between voiced or unvoiced, but also nearby sounds. Like てんき、でんき、ペンキ. Or ひる、きる. Just some examples off the top of my head.
It doesn't seem to happen with sentence audio. Or maybe the additional context makes it easier to tell which word it is. But then it seems I only have that problem with the initial sound, not anywhere else in the word.
Is it a problem with the audio, or just not being used to the accent used? Or something else?