Results 11 to 18 of 18
how does this pertain to the firfox pipelining feature? I think it works by opening multiple connections on the host and thus utilizing the clients bandwidth to the fullest. (this ...
- 09-11-2004 #11
how does this pertain to the firfox pipelining feature? I think it works by opening multiple connections on the host and thus utilizing the clients bandwidth to the fullest. (this is a guess, i could be really wrong about that)
- 09-11-2004 #12Linux Guru
- Join Date
- Apr 2003
- Location
- London, UK
- Posts
- 3,284
That wouldnt be a problem at all. 1 request would be for a PHP page and the rest would be for images which the server can handle no problem.
Originally Posted by qub333
What causes the problem is 100's of requests for php pages using database connections in the space of a few seconds. Only scripts do this sort of thing.
Jason
- 09-11-2004 #13
thought so, just wanted to make sure, ive dabbled with those type scripts before, I used one to download the gentoo handbook, but Ive found them to be a pain to make and operate. And it wasnt untill if was baffled by incredably slow resonse tiems on some of my servers that i understood what a pain they are. I guess if you really wanted the whole thing, you could be curtious and set the script for one page at a time with a 5 sec cooldown between or summin, take a long time, but at least that way othres can still use the forum.
- 09-12-2004 #14
Jason, how about adding some line like:
"sleep(1);" or "usleep(250);" before/after some database ops (of better on top all pages (it would take n (µ)secons before the client would recieve any more links to traverse)) - not much of a slow down for _users_ to notice, but it'll give server (much) more time I belive.
-- it would slow down scripts the most, as they'd have to wait -at least- the sleep time specified before next db query or the next page of links to assault.
(tell me if you don't understand what I mean and I'll try to examplify)Regards Scienitca (registered user #335819 - http://counter.li.org )
--
A master is nothing more than a student who knows something of which he can teach to other students.
- 09-12-2004 #15
Jason, i don't know if it's connected with this problem at all, but i had a little problem gettin onto the site this morning at about 10:10am UK time. on Sunday 12 Sept. Then when i finally got on, it said there were no new posts since my last visit, when i know there were more posts.
"I am not an alcoholic, alcoholics go to meetings"
Registered Linux user = #372327
- 09-14-2004 #16Linux Engineer
- Join Date
- Jul 2003
- Location
- Uppsala, Sweden
- Posts
- 1,278
does this ban aply to a recursive "wget" ? not that i was planning on doing it, but im on 56k and can't see it being too much of a problem for the server if i did decide to do it.
Proud to be a GNU/Gentoo Linux user!
- 09-14-2004 #17Linux Guru
- Join Date
- Apr 2003
- Location
- London, UK
- Posts
- 3,284
Any automated hit bots, file getters, page grabbers, custom scripts are all a problem, even if they "take it slowly". I'll explain why.
Originally Posted by variant
Site speed
While it is very easy to say it shouldnt be a problem if 1 person has a script running, from a performance standpoint it does become a real issue when 100 people do it.
Loss of revenue
We have both paid advertisers and banner exchanges for 468x60 banners that you see in the top right of your screen (new design). Every time a page is requested, we have delivered 1 banner impression. Our advertisers currently pay per 1,000 impressions.
A big problem arises when scripts request multiple pages because the number of impressions goes up, while the number of clicks in comparison stays low/doesnt move, which means the advertiser gets bad value for money.
This can lead to 2 things happening, 1) the advertiser doesnt renew with us because we didnt deliver value for money, and 2) we have to lower our prices because we deliver less clicks per 1,000 impressions.
Yes, I've put in a good set of measures to stop us billing for robotic impressions and to protect our advertisers and supporters, but scripts can still occaisionally create issues.
Aggravation and a waste of my time
I have to trawl through HTTP logs to find people running these scripts, then i have to write letters to their ISP, and sort out blocking them from accessing my server again. This has taken hours of my time already, and this is time i could have better spent using developing new features for the site and promoting it. Once again, it hurts the site in the long run.
So far I have banned 3 people on the basis of running automated scripts, and will continue to ban, block IP's, and email ISP's until the message sinks in.
To summarise: Any type of automated or recursive script/program/whatever you want to call it, is not permitted.
Jason
- 09-14-2004 #18Linux Guru
- Join Date
- Apr 2003
- Location
- London, UK
- Posts
- 3,284
While i see what you are saying, i think the proper way to fix the problem and improve performance generally is to optimise SQL queries, and tune the database and site code, which i am almost constently doing.
Originally Posted by scientica


Reply With Quote
