Commit Graph

124 Commits

Author SHA1 Message Date
Mr. Pine
549ce6cce9 Ignore unavailable elements (#119) 2025-05-28 17:04:57 +02:00
I-Al-Istannen
34564cedb4 Add support for link collections 2025-05-27 16:25:59 +02:00
I-Al-Istannen
2b0d20a1f6 Fix crawling of exercises with instructions
We do not want a second path and the instruction field has an identical
link...
2025-05-26 14:42:38 +02:00
I-Al-Istannen
4c230ef6dd Fix exercise crawling 2025-04-25 13:45:57 +02:00
Nikolas Heise
b305e1ce23 Fix login using the native ilias login form 2025-04-23 16:08:45 +02:00
I-Al-Istannen
bdf17f5c87 Ignore wikis 2025-04-23 16:03:37 +02:00
I-Al-Istannen
653bf139f0 Fix encoding of descriptions and force images to light mode 2025-04-16 10:52:18 +02:00
I-Al-Istannen
477234ad0d Support ILIAS 9 2025-04-15 21:35:20 +02:00
I-Al-Istannen
63f25277b0 Fix crawling of empty forum threads 2025-03-09 23:44:25 +01:00
I-Al-Istannen
c8eff04ae0 Make thread titles link to original ILIAS thread 2025-02-19 16:23:20 +01:00
I-Al-Istannen
edc482cdf4 Internalize images in forum threads 2025-02-19 16:23:20 +01:00
I-Al-Istannen
72cd0f77e2 Prettify forum thread exports
Co-authored-by: Tim <me@scriptim.dev>
2025-02-19 16:23:20 +01:00
I-Al-Istannen
be175f9347 Download only new/updated forum threads 2025-02-19 16:16:37 +01:00
I-Al-Istannen
ba2833dba5 Crawl all threads in a forum
Before this patch the row count was unconditionally changed to 800. This
patch tries to detect how many rows the forum has and then fetches this
amount, if it is larger than 800.
2025-02-19 12:19:33 +01:00
I-Al-Istannen
5f88539f7e Fix page size increase for forum threads 2025-02-19 12:19:11 +01:00
I-Al-Istannen
bd9d7efe64 "Fix" mypy errors
Thank you mypy, very cool. These types make things *so much better*.
They don't just complicate everything and don't really help because they
can not detect that an element queried by a tag is no navigable
string...
2025-02-19 12:15:41 +01:00
Aurelia
16a2dd5b15 fix: totp 2025-02-19 12:15:41 +01:00
I-Al-Istannen
712217e959 Handle groups in cards 2024-11-11 12:53:08 +01:00
I-Al-Istannen
6dda4c55a8 Add doctype header to forum threads
This should fix mimetype detection on most systems and is more relevant
now that the report is clickable
2024-11-05 18:36:21 +01:00
PinieP
596b6a7688 Add support for non-KIT shibboleth login (#98)
Co-authored-by: Mr-Pine <git@mr-pine.de>
Co-authored-by: I-Al-Istannen <I-Al-Istannen@users.noreply.github.com>
2024-11-05 18:30:34 +01:00
I-Al-Istannen
f5c4e82816 Delay ilias loop detection after transform
This allows users to filter out duplicated elements and suppress the
warning.
2024-11-02 22:46:51 +01:00
I-Al-Istannen
f5273f7ca0 Collapse ilias url crawling into normal page crawling 2024-11-02 22:46:51 +01:00
I-Al-Istannen
fa71a9f44f Add support for mob videos in page descriptions 2024-10-28 20:35:30 +01:00
I-Al-Istannen
81d6ff53c4 Respect row flex in descriptions 2024-10-28 19:41:03 +01:00
I-Al-Istannen
d7a2b6e019 Delete videos from course descriptions 2024-10-28 19:41:03 +01:00
I-Al-Istannen
71c65e89d1 Internalize images in course descriptions 2024-10-28 19:41:03 +01:00
I-Al-Istannen
c1046498e7 Fix download of links without a target URL
They are now downloaded as links to the empty url.
2024-10-28 19:41:03 +01:00
I-Al-Istannen
8fbd1978af Fix crawling of nested courses 2024-10-28 18:52:27 +01:00
I-Al-Istannen
c54c3bcfa1 Fix crawling of favorites 2024-10-27 10:50:59 +01:00
I-Al-Istannen
52fdeae752 Crawl custom item groups as folders 2024-10-21 23:43:48 +02:00
I-Al-Istannen
f9bb2e41cf Sanitize slashes in exercise container names 2024-10-21 22:30:16 +02:00
I-Al-Istannen
c897d9e2f5 Support finding entries for course overview page
Related to issue #93
2024-06-26 16:54:07 +02:00
Joscha
778517d8c6 Fix KIT crawler requiring base_url and client_id options 2024-05-10 14:12:45 +02:00
Joscha
428b0179fc Remove IliasConfig
Also uses urljoin() in a few places that previously used string
concatenation or fstrings.

At this point, there isn't yet a need for IliasConfig, so I'd rather
keep the code base simpler and more consistent. Should we need a
structure like IliasConfig in the future (maybe because we have a few
more ilias parsers), it's easy to add back.
2024-05-10 14:09:14 +02:00
Philipp Fruck
b01f093474 fix: Element detection for other universities
Other universities might use other URL schemes
for different element types
2024-05-04 17:52:06 +02:00
Philipp Fruck
3a05b90525 fix circular import for _io_repeat 2024-05-04 17:51:59 +02:00
Philipp Fruck
7a00f73e0e feat: Add authentication to generic ilias dl 2024-05-04 17:51:38 +02:00
Philipp Fruck
5d0621420e feat: Generic ilias_web command 2024-05-04 17:44:37 +02:00
Philipp Fruck
fc1f68ccd9 refactor: Separate generic and KIT ilias functions 2024-05-04 17:44:18 +02:00
I-Al-Istannen
3e831c7e23 Fix normalization of meeting names in cards 2024-04-24 22:32:26 +02:00
I-Al-Istannen
3db186a978 Fix personal desktop crawling HTML warnings 2024-04-10 11:15:25 +02:00
I-Al-Istannen
4a5959fd58 Fix personal desktop crawling without favorites 2024-04-10 11:15:25 +02:00
I-Al-Istannen
1cbc2b717a Fix personal desktop crawling with ILIAS 8 2024-04-10 01:20:37 +02:00
I-Al-Istannen
c1b592ac29 Fix ILIAS 8 file downloads truncating to zero bytes 2024-04-08 17:59:41 +02:00
I-Al-Istannen
eb0c956d32 Add compatibility with ILIAS 8 2024-04-05 19:08:05 +02:00
I-Al-Istannen
a117126389 Fix video name deduplication 2023-12-09 23:08:42 +01:00
I-Al-Istannen
e9f8901520 Fix typos in ilias crawler and use set literals 2023-11-30 20:57:57 +01:00
I-Al-Istannen
266812f90e Move is_logged_in helper to kit_ilias_html 2023-11-16 11:19:20 +01:00
I-Al-Istannen
50b50513c6 Ignore SCORM learning modules 2023-08-29 13:51:19 +02:00
I-Al-Istannen
df3514cd03 Crawl paginated past meetings 2023-08-29 12:41:21 +02:00