Skip to content
This repository was archived by the owner on Nov 28, 2023. It is now read-only.
This repository was archived by the owner on Nov 28, 2023. It is now read-only.

RSC速度优化 #14

@FeeiCN

Description

@FeeiCN

对泛解析域名枚举时,最大的速度问题不是网络请求耗时,而是进行响应相似度比对。

Python中difflib.SequenceMatcher有三个字符串相似度比较方法:
real_quick_ratio(速度4) > quick_ratio(速度2) > ratio(速度1)

使用最快的real_quick_ratio在不本地字符串比对时,速度低于50/s
即使网络请求耗时忽略不计,仅对17万子域名进行响应相似度比对就得接近1个小时。

目前看来只能重写一套页面相似度算法。

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions