Joscha
|
5c3942a13d
|
Fix flake8 error
|
2023-04-19 10:12:48 +02:00 |
|
Joscha
|
6f87c5c774
|
Make ipd crawler synchronous
|
2023-04-19 10:12:48 +02:00 |
|
Joscha
|
c0d6d8b229
|
Use url after redirect for relative links
|
2022-11-21 18:10:45 +01:00 |
|
I-Al-Istannen
|
f47d2f11d8
|
Append trailing slash to kit-ipd links to ensure urljoin works as expected
|
2022-10-25 20:28:22 +02:00 |
|
Joscha
|
616b0480f7
|
Simplify IPD crawler link regex
|
2022-05-08 18:18:05 +02:00 |
|
Joscha
|
af2cc1169a
|
Mention href for users of link_regex option
|
2022-05-05 14:36:03 +02:00 |
|
Joscha
|
bc3fa36637
|
Fix IPD crawler crashing on weird HTML comments
|
2022-05-05 14:35:42 +02:00 |
|
I-Al-Istannen
|
b8fe25c580
|
Add .cpp to ipd link regex
|
2022-05-04 14:19:26 +02:00 |
|
I-Al-Istannen
|
5f527bc697
|
Remove Python 3.9 Pattern typehints
|
2022-01-08 17:14:40 +01:00 |
|
I-Al-Istannen
|
88afe64a92
|
Refactor IPD crawler a bit
|
2021-11-02 01:25:01 +00:00 |
|
Julius Rüberg
|
6b2a657573
|
Fix IPD crawler for different subpages (#42)
This patch reworks the IPD crawler to support subpages which do not use
"/intern" for links and fetches the folder names from table headings.
|
2021-11-02 01:25:01 +00:00 |
|
I-Al-Istannen
|
6673077397
|
Add kit-ipd crawler
|
2021-10-21 13:20:21 +02:00 |
|