
Originally Posted by
chaumi
I'll make a wild guess you don't know how/where to get raw log files, Jam (I assume these come from hosting)...and even if you had them, how to interpret them effectively (never seen any myself, but I'll make an equal guess I wouldn't know how to get them/what to do with them either).
But there are ways around it (although I'd be prepared to make a bet that what AD suggested is the 'easiest/best' way).
So AD is almost certainly right, the data you have is almost certainly flawed. Depends to an extent on where you're collecting it from.
But how much flawed?
If you have anything that tracks traffic in real time and shows device/IP/location/how many pages navigated etc etc, then that will be giving you some clues about how much garbage traffic you're getting. You can normally see by the general profile(s) whether it's 'real' and how much is probably (or definitely) not real.
When you combine that with tools like GSC, Bing webmaster, ahrefs etc and what they are telling you, you can build a reasonable picture.
Examples
.....if you can see Google, Bing, Yandex etc visits (and they look to be from search), then check your rankings for that search (or what you rank for relevant to that page) . If it's one or two visits, you might be getting seen for a long tail. If dozens/hundreds, then most likely you have to be ranking for a term that gives that volume. If you can't find yourself (at least on page 1) then it's a fair bet it's false traffic (though remember you have to think about originating countries, and search for yourself appropriately. And remember that some SERPs results can vary from search to search (different datacentres, Google testing, etc etc).
...if you have bursts of what looks to be direct traffic, then almost certainly this is garbage. Unless you think someone is telling all their mates ' go look at this site, it's the dogs b****cks (an English term for 'very good' for anyone that doesn't know it), don't search, go straight to it' (and that ain't gonna be happening, for ABT or for the sites/pages of 99.99% of the rest of us).
None of this is precise (which is where the raw server logs I guess may have the upper hand), but it can give a picture (probably clearer over time and wrt consistency) that helps you understand what's happening.