When the self-owned search function was cancel by the monkey, I planned to develop this tool. After thinking and strolling in my room for 2 hours, I thought I had known what to do and started to make it.
Python, which I fought for this semi-finished Acfun Search Tool with, was new for me 3 days ago. I had a chance to choose a script to develop the tool, for I’ve know nothing about technique newer than JAVA. Finally I chose Python, because:
- It worked immediately after it was installed without any other configurations.
- It ran as a double-click rather than “python yourname.py<enter>”, like VBScript. It is much shorter, isn’t it?
- For the reality. Google App Engine has supported it, and I meant to put the tool there to benefit from the powerful Google. Later I realized that GAE has maximum of 1M for data-transfer, so I changed my hosting place.
- 实际上是因为Google App Engine支持它。原本我打算在GAE上放这个程序，以便利用实力强大的Google，但我很快发现GAE只支持1M以内的数据读写，于是我改放到其它免费空间中。
At the very start, I planned to fecth all the data from acfun.cn first, keep it up to date, and built a RSS feed for it so others can secondary exploite it, but I decide to finish the search function first. I spent 2 days grabing datas from Acfun, modifying my crawler to suit to the irregular but met frequently data for several times, and change the data-structure from CSV file to SQLite database(CSV has been used for GAE). When Acfun became busy at prime time, I had to pause the crawler.
Finally the crawler accompalished its mission and was reformed to keep the data new. I’ve been aware of Python web frameworks such as django, but I decided to use CGI script, because it’s much easier for me who knows nothing about frameworks or MVCs or MVTs. When I directly use the hosting to test my scripts, my work increasing geometrically. Now it works, although hardly with user-interface.
数据抓完后，我便将爬虫改成数据更新器。虽然留意了很多Python的web架构，但我完全不懂那些东西，所以还是决定用最简单的Python CGI Scripts。因为要在远程服务器上调试，所以工作量非常大，无论如何现在是弄好了，但几乎没有任何用户界面……能用就行。
I’ve published to TIEBA of Acfun so as to ease maybe some load of Acfun Server. I hope people will benefit from it, and I have to go back to revise now. Visit @ http://illustrate.heliohost.org/ac.htm.