BFS scraper for X accounts based in China

Online Demo: https://pluto0x0.github.io/X_based_china/

Run

1. Install Requirements

pip install aiohttp_client_cache loguru aiolimiter

2. API Key

You will need your own key for the following API's from RapidAPI:

Rename config.default.py to config.py and paste the key in it.

See Results & Statistics for API usage statistics.

3. Run

python main.py

4. Build Page

Once you get the result, build a static web page:

python render.py china.jsonl

Configuration

In config.py

SEED_ACCOUNTS = [
    "linboweibu17"
    # "KunDong95265"
]
REQUESTS_PER_SECOND = 9
MAX_HIT = 100000
OUTPUT_FILE = "china.jsonl"
MAX_FOLLOWINGS = 800

SEED_ACCOUNTS: Seed accounts in the initial queue
REQUESTS_PER_SECOND: Maximum number of requests per sec
MAX_HIT = 100000: Exit when finding 100000 accounts, 0 = no limit
OUTPUT_FILE: Output file name
MAX_FOLLOWINGS: Max number of accounts fetched from a following list, 0 = no limit

How It Works

Starting from seed accounts, Breadth-First Search for X accounts based on their following list.

Async http requests are cached with SQLite DB.

Results & Statistics

RUN #1: 5,200 results with 17,980 Twttr API request and 1,000 Twitter API requests
RUN #2: 4,200 results with 70,000 Twttr API request and 5,350 Twitter API requests

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
china.jsonl		china.jsonl
config.default.py		config.default.py
main.py		main.py
render.py		render.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BFS scraper for X accounts based in China

Run

1. Install Requirements

2. API Key

3. Run

4. Build Page

Configuration

How It Works

Results & Statistics

License

About

Uh oh!

Releases

Packages

Languages

License

pluto0x0/X_based_china

Folders and files

Latest commit

History

Repository files navigation

BFS scraper for X accounts based in China

Run

1. Install Requirements

2. API Key

3. Run

4. Build Page

Configuration

How It Works

Results & Statistics

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages