My question is How do you know that most of the TOR nodes are fake?

My question is How do you know that most of the TOR nodes are fake?

its impossible to know that, which is why tor wouldn't be a reliable data source. someone could create 20 tor endpoints in front of a single node relatively easily
Ok. So why did you say it appears to be 2-3% instead of 20%?
i was quoting the guy who said that from the data he was collecting. it took me time to write my script to replicate his findings, which I am running now.
Oh, ok.
Are you crawling it yourself or using snapshots from bitnodes
crawling myself
Cool. I was going to do it but it will take like 40 hours.
Whatβs the ETA on yours?
going to look into creating a visualization of all of the nodes i collect grouped by AS.