Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon.

Pages: 1-4041-

archive of the old /prog/

Name: Anonymous 2013-11-02 2:15

So I got my hands on this 1-gigabyte file which has all of the old /prog/ up to some date in august 2013:

1.4G prog-20130813130608.db

(it's 55mb when compressed)

Big thanks to the anonymous person who published this (though the original url is gone now)

I wonder if it should be shared somehow for posterity. maybe some other people here also have this file ? Somehow I doubt archive.org would want it, what with all the racist posts.

also i wrote a little python program to browse that file via a local HTTP interface

Anyway also big thanks to whoever starte /prog/rider as a resurrection of the old /prog/

Name: Anonymous 2013-11-02 3:51

I also have a copy, I can put it up as a torrent if you want.

Name: Anonymous 2013-11-02 4:41

Please share that file please please please.

Name: Anonymous 2013-11-02 8:19

1.4GB compressed to 55MB
Somehow, I'm not very surprised by this. I wonder why.

Name: Anonymous 2013-11-02 9:01

>>4

Because it's easy to compress text.

Name: Anonymous 2013-11-02 13:15

>>4
It's text with a bunch of rerereredundant HTML <tag></tag>s.

Name: Anonymous 2013-11-02 15:30

>>4
Also, I guess lots of memes got pasted several times

Name: Anonymous 2013-11-02 18:06

>>3
magnet:?xt=urn:btih:d69e0d2dedb87d3afc780760f21e50cbdc4eaf8d&dn=prog-20130813130608.db.xz

Name: >>1 2013-11-02 19:39

BTW I wrote a little python/flask app to browse the "prog.db" file via local HTTP. It has a feature to search for a string in all posts and sort results by date, so you can find out when a meme first appeared. Optionally, you can include the original javascript and CSS from dis.4chan.org to get spoiler-tags to work and have the original colours etc. (Yes, flask probalby sucks, but this little program is useful for browsing the archive.) Hope this is useful to other people.

http://codepad.org/RLr6B5tu

here's a screenshot where it shows the history of the meme "faggot quote": http://i.imgur.com/bmipLQP.png

Name: Anonymous 2013-11-02 20:01

Name: Anonymous 2013-11-02 22:00

>>9
Goddamn, what's wrong with

$ sqlite3 prog.db
sqlite> select * from posts where body like '%faggot quote%' order by time;

Name: Anonymous 2013-11-02 22:17

Did somebody say \\faggot quotes//?

Name: Anonymous 2013-11-02 22:27

Did somebody say faggotfaggot quotesquotes?

Name: Anonymous 2013-11-02 22:43

>>12
Those are \\furious masturbation// quotes.

Name: Anonymous 2013-11-02 23:11

I felt like going to the old /prog/, >>1-san.

You could have posted that whole thread here.

Name: Anonymous 2013-11-02 23:56

"What does it mean to be normal?" rhetorically asked the doctor. The doctor pounded his chest and said, "I am normal ! I am the doctor ! I am eternal !". An Indian man 4000 years ago wrote down that I am normal ! I am the doctor! he who installs .NET installs the truth. the wheel spun and several thousand dollars were contributed towards the most prestigiuous instutition prized for its cliques of suburban children. who needs god when you can install .NET ?

Name: Anonymous 2013-11-03 4:56

can we officially fork world4ch /prog/ now?

Name: Anonymous 2013-11-03 5:02

>>17
I don't think this board software is that scalable.

Name: Anonymous 2013-11-03 5:53

Why does everyone put spoilers around /prog/. I think it's a maymay too but how did that start?

Name: Anonymous 2013-11-03 13:33

>>17
Fork a clusterfuck of PHP? No, thanks. Get working on SchemeBB instead.

Name: Anonymous 2013-11-03 22:52

Please seed!

Name: Anonymous 2013-11-03 23:02

subject.txt is no longer accessible. ;_;

Name: Anonymous 2013-11-03 23:22

>>22
It's been like that since the implementation of the captcha.

Check the onion link posted in this thread for another search engine based on a kinda recent snapshot.

Name: >>1 2013-11-04 0:01

Name: Anonymous 2013-11-04 3:36

>>23
It's been broken since the 15th of September. The CAPTCHA was introduced a few weeks earlier.

Name: Anonymous 2013-11-04 23:10

>>24
But I wasn't asking about faggot quotes. I was asking about putting spoilers when mentioning /prog/

Name: Anonymous 2013-11-04 23:23

Name: Anonymous 2013-11-05 1:26

What does s/something/something mean? I always see it on /prog/ and Hacker News and I've been bugged by this for months but I have no idea how to Google it.

Name: >>1 2013-11-05 2:55

>>28

's/foo/bar' is a substitution command for the unix program sed. here's an example log:

$ echo 'a funny joke' | sed 's/joke/clown/'
a funny clown

Name: Anonymous 2013-11-10 11:24

Why is it that there are 9 people in the swarm and I'm not seeding the file at all?

Name: Anonymous 2013-11-10 14:33

>>30
If you want to seed that badly, I'll download it.

Name: Anonymous 2013-11-10 15:48

>>31
Be sure to use I2P if you are torrenting, or a VPS.

Name: Anonymous 2013-11-10 21:49

>>32
Why? Would I compromise any of you if I download it from my home connection?

Name: >>8 2013-11-10 22:36

>>33
I'm still seeding from a VPS. You are free to download it from wherever you want.

Name: Anonymous 2013-11-11 2:04

>>33
Because anyone can record IP address if you are joining the magnet link, even a bot…

>>34
There's a smart poster.

Name: Anonymous 2013-11-11 2:35

>>35
I already know that, but I don't care much about giving out some IP that changes every day. Moreover, we're not sharing anything illegal, are we?

No, this is not a ``nothing to hide, nothing to fear'' argument, I just don't see why go through such lengths just to download 1GB of ANUS and chinkspam. I can't pay a good VPS/seedbox because I don't have a credit card and I'm not sure if there are any free VPSs you can trust.

Name: Anonymous 2013-11-11 2:49

>>36
If you want I can put it up on a Tor hidden service too as a simple http download.

Name: replaying ?v=ESA63E6SzgE 2013-11-11 3:07

>>36
Then use I2P like I said. Unless you are willing to host your own GNUnet ECRS or Freenet Address Resolution Keys after you have downloaded it (even though I have FrozenVoid's copy).

Also, here you can get a no-strings attached credit card:
https://enumbered.com/
Dear eNumbered client, the website will be offline for one week while we upgrade security and add additional functionality to the site. Full functionality will be ...

And I have a list of VPS service providers, even free ones somewhere in South America. I am just not ready to share until I found the one in Antigua, or ones Intellectually Property sane, listed on this thread:
https://bbs.progrider.org/prog/read/1378158333/100-106,126-136,138,142-147,

Name: Anonymous 2013-11-11 17:10

>>38
I totally missed that part where you suggested using I2P. Forgive me for my mental retardation.

But I'd like to accept >>37-san's offer. I already have the Tor bundle.

Name: Anonymous 2013-11-11 17:59

Name: Anonymous 2013-11-11 18:04

>>40
(´・ω・`) domo

Name: Anonymous 2013-12-01 10:03

>>40

Okay, so I downloaded this but I have no idea how to browse it.

Name: Anonymous 2013-12-01 15:24

>>42
Do you come from re/g/g/it?

Name: Anonymous 2013-12-01 23:35

>>42

OP here, I wroet software for browsing / searching it and posted it a while ago, see here: http://progrider.org/prog/read/1383362137/9

That's Business With .NET
That's Business With Smoked Meat
The Doctor Is Eternal
Let's take it to the next level
Let's all go to the hotel pool as we finish the bottle
It's very foolish to think reality is normal
Let's get this party started right

Name: Anonymous 2013-12-02 10:20

>>43
Hmm No.

Name: Anonymous 2013-12-03 3:42

>>44
You are as bad as >>42
What the fuck is wrong with SQL queries? Your stupid script can be replaced by one SQL line.

Name: >>44 2013-12-04 0:44

>>46
I'm sorry. I'm a noob when it comes to SQL queries. I don't really know anything about them, in fact. I just wanted to browse my /prog/ and I coded this up as quickly as I could. The good thing about it is that it works and I can easily use it with the web interface. Again, I'm not an SQL expert at all. I'm sorry.

Name: >>44 2013-12-04 0:46

>>46
Also much of what the script does is display the posts in a browseable HTML format close to the original site, and make sure stuff like spoilers work. Again, I'm sorry my noob-tier quickly-put-together code offended you and I appreciate your advice.

Name: Anonymous 2013-12-04 1:24

>>44,47,48,FIoC-ista
Are you also the one that didn't have his files organized in his book and music directory? If so, then it makes sense. Don't feel bad you do not know relational expression. You can at the least start learning some SQLite:
https://anonfiles.com/file/88f2d9b81c29f64189d86310187c530f
password: w4ffl3s
https://github.com/vhf/free-programming-books/blob/master/free-programming-books.md#sql-implementation-agnostic
http://www.sqlite.org/quickstart.html
http://www.sqlite.org/docs.html
http://www.askyb.com/sqlite/learn-sqlite-in-1-hour/ (you'll like this one)
http://www.tutorialspoint.com/sqlite/ (no idea, first time seen)

Then start piping some of that html into entries on a table, until you are adept regular expressions to filter out that junk with a better markup.

Name: Anonymous 2013-12-04 5:10

Name: Anonymous 2013-12-04 10:53

>>50
http://world4search.no-ip.org:8080/

world4ch stopped blocking access to subject.txt on November 24, so the non-shitty ways of scraping it work again.

Name: Anonymous 2013-12-04 17:40

>>51
Is there anything worth scraping after 2013-08-13? That's what >>50 uses. I don't want to soil the database with shit like `le pedophile sage' and all the /g/ shitposting.

Name: Anonymous 2013-12-04 19:39

>>52
Use regex to filter all that shit out too. U are going to remove a lot of "X{+}D{+}", even "8={+}D". If you want to be careful, just place them in the review queue.

Name: Anonymous 2013-12-04 19:41

>>52
there have been maybe a few more lambda arthur calculus appearances, and there was a thread the other day from an oldfag who hadn't visited in two years saying to spread love and not trolling which was kind of amusing.

a few of my threads about steam's drm, and about cloud bullshit, predate 2013-08 but didn't make it into the scrape

idk

Name: Anonymous 2013-12-04 20:58

>>52
Not
at
all.

It's full of spam, ``le pedophile sage'' and many other epic memes. Luke also posts a lot more now.

Name: Anonymous 2013-12-07 14:39

>>52
There's nothing worth scraping before 2013-08-13 either. If you're going to have a [spoiler]/prog/[/spoiler] index, you just have to take it as given that almost all of it is going to be shitposts.

>>54
oldfag
Please stop.

Name: Anonymous 2013-12-09 21:16

I'll host this here as well:

https://bbs.progrider.org/files/archives/

Name: Anonymous 2013-12-09 21:18

>>57
thanks for hosting it

btw why does it 403 on HTTP but not HTTPS ? is the /prog/ archive so wack crazy that we must always encrypt it ?! like is there nasty stuff in it ?

Name: Anonymous 2013-12-09 21:20

oh wait it just 403s the directory listing

well

i've been wanting to do this for a long time

http://web.archive.org/web/*/bbs.progrider.org/files/archives/prog-20130813130608.db.xz

Name: Anonymous 2013-12-09 21:22

>>58,59
Sorry, I had forgot to set the index directive for the http scope as well.

Should work both ways now.

Name: sage 2013-12-09 21:25

>>60

thanks a whole lot admin :) thanks to your http hosting /prog/ is now archived in the internet archive for posterity.
(and it's also easy to access via http from here)

Name: Anonymous 2013-12-09 21:48

>>61

          | ̄ ̄|
        _☆☆☆_  / ̄ ̄ ̄ ̄ ̄ ̄
         ( ´∀`) <  No problem!
        /    |    \______
       /       .|     
       / "⌒ヽ |.イ |
   __ |   .ノ | || |__
  .    ノく__つ∪∪   \
   _((_________\
    ̄ ̄ヽつ ̄ ̄ ̄ ̄ ̄ ̄ | | ̄
   ___________| |
    ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄| |

Name: Anonymous 2016-11-17 6:05

Gee, time to write a reader. All related HTML archives are dead.

Name: Anonymous 2016-11-17 8:34

I have the JSON+HTML for all of world4ch up to September 2013. Does anyone still need it?

Name: Anonymous 2016-11-17 11:02

Name: Anonymous 2016-11-18 7:09

dubs
check fucking dubs

Name: Anonymous 2016-11-18 8:52

1.4G prog-20130813130608.db
(it's 55mb when compressed)
Nice compression factor. I bet if you exclude posts below 120-150 chars, it would be much smaller.

Name: Anonymous 2016-11-19 7:27

>>65

Thank Marisa for Xarn. Even from the shadows, even years after Haskelling himself, he still fights for us.

Name: Anonymous 2016-11-19 12:28

>>68
he is a complete piece of shit

Name: Anonymous 2016-11-19 12:31

>>69
>69
Nice.

Name: Anonymous 2016-12-22 11:48

>>65
Holy shit, best thing Xarn ever did...
I was looking up google for any archives to w4c before I found this post.

Name: Anonymous 2016-12-23 0:54

>>65
Now make it searchable and we have a winner.

Name: Anonymous 2016-12-23 1:24

search my anus

Name: sage 2016-12-23 20:00

>>72
Did you read the frontpage?

Name: Anonymous 2016-12-24 21:46

>>74
going to world4search.readsicp.org gives
This doesn't matter.

Name: Anonymous 2016-12-26 6:49

tfw prog used to be good

Name: Anonymous 2016-12-26 8:10

tfw dubs used to be good

Name: Anonymous 2016-12-26 9:02

Hey Xarn, can you make the archive automatically update links to other threads...

eg
https://archive.tinychan.org/read/prog/1297292711

All the links in post 6 there.

Name: Anonymous 2016-12-26 22:51

Array.from(document.querySelectorAll('a[href*="dis.4chan"]')).forEach(x => { let l = x.href.replace("dis.4chan", "archive.tinychan"); x.textContent = l; x.href = l; })

Name: Anonymous 2016-12-27 2:41

>>79
Why do I have to do this?

Don't change these.
Name: Email:
Entire Thread Thread List