×
all 12 comments

[–]gwern 2 points3 points  (2 children)

The 6 DNMs are Apollon/CannaHome/Cannazon/Cryptonia/Empire/Samsara if anyone was wondering.

You need Tor to download the dataset. Once you have the Tor browser bundle installed, you can find the data set here: http://lolwuc3342535625.onion/2020-01-13-reviews.csv . If someone could mirror this on a clearnet hosting site, I would appreciate that. I use Tor for everything and most file hosting websites will not allow me to upload over Tor.

Mirror: https://gwern.net/doc/sr/2020-01-13-kilos-6dnms-reviews.csv.xz

[–]EndlessMorning[S] 0 points1 point  (0 children)

I edited the OP to add the link. Thanks!

[–]HCI_Fab 0 points1 point  (0 children)

Gwern is great!

[–]dgjkdsagdwqucbjsdjk 1 point2 points  (5 children)

Thank you for making this available.

[–]EndlessMorning[S] 0 points1 point  (4 children)

No problem. If you want any additional data let me know. I have plenty still.

[–]GangstaJedi 0 points1 point  (3 children)

Do you have price data over time?

[–]EndlessMorning[S] 0 points1 point  (2 children)

What do you mean by price data over time? Sorry, English is not my first language and this seems little ambiguous.

The listings table of the database does not have timestamps. However, the vendor table has the vendor join date. If you want I can assemble you some data like

vendor username, vendor join timestamp, listing price, listing local currency, listing title

[–]blarghusmaximus 1 point2 points  (0 children)

Price of drugs over time could be fun!

[–]GangstaJedi 0 points1 point  (0 children)

If it exists, a product listing with price and time of listing

[–]HCI_Fab 0 points1 point  (1 child)

Do you have associated images for the data? They could be useful for analysis where text is fairly unique or unusual from outside definitions, and for the many listings without much descriptive text

https://www.researchgate.net/publication/334365049_Making_Sense_of_Darknet_Markets_Automatic_Inference_of_Semantic_Classifications_from_Unconventional_Multimedia_Datasets

[–]gwern 1 point2 points  (0 children)

If it helps, most of the scrapes I made include the images: https://gwern.net/DNM-archives (although you probably know that).

[–]lroman 0 points1 point  (0 children)

You use tor for Reddit also?