Skip to content

Pull requests: yasserg/crawler4j

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

try to solve issue#416 have a look and notified me
#480 opened Nov 15, 2024 by chshiv Loading…
Upgrade to docker-compose-rule-junit4 1.7.0
#464 opened Sep 17, 2021 by uarlouski Loading…
Synch
#461 opened Apr 8, 2021 by ZeeBeeGit Loading…
add jetty server response some info
#453 opened Sep 27, 2020 by linweisen Loading…
added disregarded protocols
#446 opened May 31, 2020 by mihalispap Loading…
Changed BasicCrawler to WebCrawler
#445 opened May 14, 2020 by SpyrosKou Loading…
Selenium basic integration
#444 opened May 10, 2020 by dgoiko Loading…
Retry on ContentFetchError
#437 opened Feb 3, 2020 by dgoiko Loading…
Generic crawl controller
#434 opened Jan 25, 2020 by dgoiko Loading…
Configurable inmediate redirection
#433 opened Jan 25, 2020 by dgoiko Loading…
Base clases provide more protected methods for subclasses
#432 opened Jan 25, 2020 by dgoiko Loading…
Cloneable CrawlConfig
#431 opened Jan 25, 2020 by dgoiko Loading…
Timeoutable regular expressions in RobotstxtServer
#429 opened Jan 24, 2020 by dgoiko Loading…
Granularity in exception
#428 opened Jan 24, 2020 by dgoiko Loading…
Configurable database names
#426 opened Jan 9, 2020 by dgoiko Loading…
Update ImageCrawler.java
#422 opened Nov 20, 2019 by zhengxl5566 Loading…
Extracted interfaces from Parser and PageFetcher
#421 opened Nov 16, 2019 by dgoiko Loading…
Added POST capabilities
#419 opened Nov 14, 2019 by dgoiko Loading…
Add tests for util
#411 opened Aug 5, 2019 by romainbrenguier Loading…
closes #399: on-the-fly calculation of checksum
#400 opened Mar 29, 2019 by cnsgithub Loading…
Feature/spring boot example
#382 opened Dec 18, 2018 by s17t Loading…
Deadlink
#373 opened Nov 20, 2018 by struberg Loading…
added custom html content filter
#168 opened Nov 7, 2016 by pdesmet Loading…
Began work on an asynchronous crawling
#157 opened Sep 8, 2016 by lostmsu Loading…
ProTip! What’s not been updated in a month: updated:<2025-09-30.