MCPcopy
hub / github.com/scrapy/scrapy / get_crawler

Function get_crawler

scrapy/utils/test.py:60–82  ·  view source on GitHub ↗

Return an unconfigured Crawler object. If settings_dict is given, it will be used to populate the crawler settings with a project level priority.

(
    spidercls: type[Spider] | None = None,
    settings_dict: dict[str, Any] | None = None,
    prevent_warnings: bool = True,
)

Source from the content-addressed store, hash-verified

58
59
60def get_crawler(
61 spidercls: type[Spider] | None = None,
62 settings_dict: dict[str, Any] | None = None,
63 prevent_warnings: bool = True,
64) -> Crawler:
65 """Return an unconfigured Crawler object. If settings_dict is given, it
66 will be used to populate the crawler settings with a project level
67 priority.
68 """
69 # When needed, useful settings can be added here, e.g. ones that prevent
70 # deprecation warnings.
71 settings: dict[str, Any] = {
72 **get_reactor_settings(),
73 **(settings_dict or {}),
74 }
75 runner: CrawlerRunnerBase
76 if is_reactor_installed():
77 runner = CrawlerRunner(settings)
78 else:
79 runner = AsyncCrawlerRunner(settings)
80 crawler = runner.create_crawler(spidercls or DefaultSpider)
81 crawler._apply_settings()
82 return crawler
83
84
85def get_pythonpath() -> str:

Callers

nothing calls this directly

Calls 6

is_reactor_installedFunction · 0.90
CrawlerRunnerClass · 0.90
AsyncCrawlerRunnerClass · 0.90
get_reactor_settingsFunction · 0.85
create_crawlerMethod · 0.80
_apply_settingsMethod · 0.80

Tested by

no test coverage detected