Commit Graph

673 Commits

Author SHA1 Message Date
I-Al-Istannen a117126389 Fix video name deduplication 2023-12-09 23:08:42 +01:00
I-Al-Istannen e9f8901520 Fix typos in ilias crawler and use set literals 2023-11-30 20:57:57 +01:00
I-Al-Istannen 266812f90e Move is_logged_in helper to kit_ilias_html 2023-11-16 11:19:20 +01:00
I-Al-Istannen 533bc27439 Bump version to 3.5.0 2023-09-13 23:13:30 +02:00
I-Al-Istannen 0113a0ca10 Update flake.lock 2023-09-13 22:23:36 +02:00
I-Al-Istannen 40f8a05ad6 Add .idea to gitignore 2023-09-13 22:23:36 +02:00
I-Al-Istannen 50b50513c6 Ignore SCORM learning modules 2023-08-29 13:51:19 +02:00
I-Al-Istannen df3514cd03 Crawl paginated past meetings 2023-08-29 12:41:21 +02:00
I-Al-Istannen ad53185247 Sanitize ascii control characters on windows 2023-08-29 12:41:15 +02:00
I-Al-Istannen 87b67e9271 Crawl files in the info tab 2023-08-29 12:41:15 +02:00
I-Al-Istannen b54b3b979c Remove size suffix for content pages 2023-08-27 11:43:05 +02:00
I-Al-Istannen 2184ac8040 Add support for ILIAS mediacast listings 2023-08-27 11:43:05 +02:00
I-Al-Istannen b3d412360b Add Nix flake 2023-08-26 23:54:19 +02:00
Mr. Pine dbc2553b11 Add default `show-not-deleted` option
If set to `no`, PFERD won't print status or report messages for not deleted files
2023-08-26 18:43:01 +02:00
I-Al-Istannen 68c398f1fe Add support for ILIAS learning modules 2023-08-02 13:34:54 +02:00
I-Al-Istannen 123a57beec Fix mypy unreachable error in file_templates 2023-07-29 18:36:33 +02:00
I-Al-Istannen d204dac8ce Detect unexpected root page redirects and abort operation 2023-07-29 18:36:33 +02:00
Mr. Pine 443f7fe839 Add `no-delete-prompt-overwrite` crawler conflict resolution option (#75) 2023-07-29 18:36:33 +02:00
I-Al-Istannen 0294ceb7d5 Update github action versions 2023-03-22 00:10:54 +01:00
I-Al-Istannen 6f30c6583d Fix crawling of cards without descriptions 2023-03-21 23:52:33 +01:00
I-Al-Istannen 467fc526e8 Fix crawling of file/video cards 2023-03-21 23:52:24 +01:00
I-Al-Istannen 722d2eb393 Fix crawling of courses with preselected timeline tab 2023-03-21 23:36:47 +01:00
Joscha 6d44aac278 Bump version to 3.4.3 2022-11-29 18:22:19 +01:00
c0derMo 55a2de6b88 Fix crawling English opencast 2022-11-29 18:13:56 +01:00
Joscha c0d6d8b229 Use url after redirect for relative links 2022-11-21 18:10:45 +01:00
Joscha 635caa765d Fix typo
Thanks, burg113
2022-11-15 17:17:57 +01:00
Pavel Zwerschke e69b55b349
Add more unofficial package managers (#66) 2022-11-04 12:18:26 +01:00
Joscha 07200bbde5 Document ilias web crawler's forums option 2022-10-31 14:12:27 +01:00
I-Al-Istannen c020cccc64 Include found paths in "second path found" warning 2022-10-29 14:08:29 +02:00
Joscha 259cfc20cc Bump version to 3.4.2 2022-10-26 18:26:17 +02:00
Joscha 37b51a66d8 Update changelog 2022-10-26 18:22:37 +02:00
I-Al-Istannen f47d2f11d8 Append trailing slash to kit-ipd links to ensure urljoin works as expected 2022-10-25 20:28:22 +02:00
I-Al-Istannen 1b6be6bd79 Handle content pages in cards 2022-10-24 18:37:26 +02:00
I-Al-Istannen e1430e6298 Handle (and ignore) surveys 2022-10-24 18:37:26 +02:00
I-Al-Istannen 5fdd40204b Unwrap future meetings when ILIAS hides them behind a pagination 2022-10-24 14:33:58 +02:00
I-Al-Istannen fb4631ba18 Fix ilias background login 2022-10-24 13:13:36 +02:00
I-Al-Istannen d72fc2760b Handle empty forums 2022-10-24 13:12:17 +02:00
I-Al-Istannen 4a51aaa4f5 Fix forum crawling crashing for empty threads 2022-10-19 22:59:33 +02:00
Joscha 66a5b1ba02 Bump version to 3.4.1 2022-08-17 13:24:01 +02:00
I-Al-Istannen aa5a3a10bc Adjust changelog 2022-08-14 21:48:59 +02:00
I-Al-Istannen d9b111cec2 Correctly nest description entries 2022-08-14 21:45:33 +02:00
I-Al-Istannen 345f52a1f6 Detect new login button 2022-08-14 21:41:29 +02:00
Joscha ed24366aba Add pass authenticator 2022-06-05 10:04:42 +02:00
I-Al-Istannen 46fb782798 Add forum crawling
This downloads all forum posts when needed and saves each thread in its
own html file, named after the thread title.
2022-05-24 23:43:53 +02:00
I-Al-Istannen 846c29aee1 Download page descriptions 2022-05-11 21:16:56 +02:00
I-Al-Istannen a5015fe9b1 Correctly parse day-only meeting dates
I failed to recognize the correct format in the previous adjustment, so
this (hopefully) fixes it for good.
Meetings apparently don't always have a time portion.
2022-05-08 23:22:26 +02:00
Joscha 616b0480f7 Simplify IPD crawler link regex 2022-05-08 18:18:05 +02:00
I-Al-Istannen 2f0e04ce13 Adjust changelog 2022-05-05 22:57:55 +02:00
I-Al-Istannen bcc537468c Fix crawling of expanded meetings
The last meeting on every page is expanded by default.
Its content is then shown inline *and* in the meeting page itself.
We should skip the inline content.
2022-05-05 22:53:37 +02:00
I-Al-Istannen 694ffb4d77 Fix meeting date parsing
Apparently the new pattern "<relative time qualifier>: <date>," was
added. This patch adds support for it.
2022-05-05 22:28:30 +02:00