About the scrapy concurrency model

Question

About the scrapy concurrency model

Now I plan to use scrapy in a more distributed approach, and I'm not if the spiders / pipelines / bootloaders / schedulers and the engine are all placed in separate processes or threads, can there be information about this? and can we change the number of processes / threads for each component? I know that there are two settings "CONCURRENT_REQUESTS" and "CONCURRENT_ITEMS", they will determine parallel flows for loaders and pipelines, right? and if I want to deploy spiders / pipelines / bootloaders on different machines, I need to serialize items / requests / answers, right? Appreciate so much for your help!

Thanks Edward.

+5

scrapy

user1441208 Jun 07 '12 at 3:09

source share

1 answer

escitalopram · Answer 1 · 2012-11-15T14:27:22+0000

Scrapy - . . Twisted Framework.

, Scrapy, . Redis, RabbitMQ

Scrapyd

About the scrapy concurrency model

More articles: