I-Al-Istannen
e9f8901520
Fix typos in ilias crawler and use set literals
2023-11-30 20:57:57 +01:00
I-Al-Istannen
266812f90e
Move is_logged_in helper to kit_ilias_html
2023-11-16 11:19:20 +01:00
I-Al-Istannen
50b50513c6
Ignore SCORM learning modules
2023-08-29 13:51:19 +02:00
I-Al-Istannen
df3514cd03
Crawl paginated past meetings
2023-08-29 12:41:21 +02:00
I-Al-Istannen
87b67e9271
Crawl files in the info tab
2023-08-29 12:41:15 +02:00
I-Al-Istannen
b54b3b979c
Remove size suffix for content pages
2023-08-27 11:43:05 +02:00
I-Al-Istannen
2184ac8040
Add support for ILIAS mediacast listings
2023-08-27 11:43:05 +02:00
I-Al-Istannen
68c398f1fe
Add support for ILIAS learning modules
2023-08-02 13:34:54 +02:00
I-Al-Istannen
123a57beec
Fix mypy unreachable error in file_templates
2023-07-29 18:36:33 +02:00
I-Al-Istannen
d204dac8ce
Detect unexpected root page redirects and abort operation
2023-07-29 18:36:33 +02:00
I-Al-Istannen
6f30c6583d
Fix crawling of cards without descriptions
2023-03-21 23:52:33 +01:00
I-Al-Istannen
467fc526e8
Fix crawling of file/video cards
2023-03-21 23:52:24 +01:00
I-Al-Istannen
722d2eb393
Fix crawling of courses with preselected timeline tab
2023-03-21 23:36:47 +01:00
c0derMo
55a2de6b88
Fix crawling English opencast
2022-11-29 18:13:56 +01:00
Joscha
c0d6d8b229
Use url after redirect for relative links
2022-11-21 18:10:45 +01:00
I-Al-Istannen
c020cccc64
Include found paths in "second path found" warning
2022-10-29 14:08:29 +02:00
I-Al-Istannen
f47d2f11d8
Append trailing slash to kit-ipd links to ensure urljoin works as expected
2022-10-25 20:28:22 +02:00
I-Al-Istannen
1b6be6bd79
Handle content pages in cards
2022-10-24 18:37:26 +02:00
I-Al-Istannen
e1430e6298
Handle (and ignore) surveys
2022-10-24 18:37:26 +02:00
I-Al-Istannen
5fdd40204b
Unwrap future meetings when ILIAS hides them behind a pagination
2022-10-24 14:33:58 +02:00
I-Al-Istannen
fb4631ba18
Fix ilias background login
2022-10-24 13:13:36 +02:00
I-Al-Istannen
d72fc2760b
Handle empty forums
2022-10-24 13:12:17 +02:00
I-Al-Istannen
4a51aaa4f5
Fix forum crawling crashing for empty threads
2022-10-19 22:59:33 +02:00
I-Al-Istannen
d9b111cec2
Correctly nest description entries
2022-08-14 21:45:33 +02:00
I-Al-Istannen
345f52a1f6
Detect new login button
2022-08-14 21:41:29 +02:00
I-Al-Istannen
46fb782798
Add forum crawling
...
This downloads all forum posts when needed and saves each thread in its
own html file, named after the thread title.
2022-05-24 23:43:53 +02:00
I-Al-Istannen
846c29aee1
Download page descriptions
2022-05-11 21:16:56 +02:00
I-Al-Istannen
a5015fe9b1
Correctly parse day-only meeting dates
...
I failed to recognize the correct format in the previous adjustment, so
this (hopefully) fixes it for good.
Meetings apparently don't always have a time portion.
2022-05-08 23:22:26 +02:00
Joscha
616b0480f7
Simplify IPD crawler link regex
2022-05-08 18:18:05 +02:00
I-Al-Istannen
bcc537468c
Fix crawling of expanded meetings
...
The last meeting on every page is expanded by default.
Its content is then shown inline *and* in the meeting page itself.
We should skip the inline content.
2022-05-05 22:53:37 +02:00
I-Al-Istannen
694ffb4d77
Fix meeting date parsing
...
Apparently the new pattern "<relative time qualifier>: <date>," was
added. This patch adds support for it.
2022-05-05 22:28:30 +02:00
Joscha
af2cc1169a
Mention href for users of link_regex option
2022-05-05 14:36:03 +02:00
Joscha
bc3fa36637
Fix IPD crawler crashing on weird HTML comments
2022-05-05 14:35:42 +02:00
I-Al-Istannen
b8fe25c580
Add .cpp
to ipd link regex
2022-05-04 14:19:26 +02:00
Joscha
b56475450d
Use utf-8 for cookies
2022-04-29 23:12:41 +02:00
Joscha
602044ff1b
Fix mypy errors and add missing await
2022-04-27 22:52:50 +02:00
I-Al-Istannen
a2831fbea2
Fix shib authentication
...
Authentication failed previously if the shib session was still valid.
If Shibboleth gets a request and the session is still valid, it directly
responds without a second redirect.
2022-04-27 13:55:24 +02:00
I-Al-Istannen
86e2e226dc
Notify user when shibboleth presents new entitlements
2022-04-03 11:37:08 +02:00
I-Al-Istannen
7872fe5221
Fix tables with more columns than expected
2022-01-18 22:38:48 +01:00
Joscha
4f022e2d19
Reword changelog
2022-01-15 15:06:02 +01:00
I-Al-Istannen
f47e7374d2
Use fixed windows path for video cache
2022-01-15 12:00:30 +01:00
I-Al-Istannen
57ec51e95a
Fix login after shib url parser change
2022-01-14 20:17:27 +01:00
I-Al-Istannen
4ee919625d
Add rudimentary support for content pages
2022-01-08 20:47:35 +01:00
I-Al-Istannen
d30f25ee97
Detect shib login page as login page
...
And do not assume we are logged in...
2022-01-08 20:28:45 +01:00
I-Al-Istannen
10d9d74528
Bail out when crawling recursive courses
2022-01-08 20:28:30 +01:00
I-Al-Istannen
43c5453e10
Correctly crawl files on desktop
...
The files on the desktop do not include a download link, so we need to
rewrite it.
2022-01-08 20:00:53 +01:00
I-Al-Istannen
e32c1f000f
Fix mtime for single streams
2022-01-08 18:05:48 +01:00
I-Al-Istannen
5f527bc697
Remove Python 3.9 Pattern typehints
2022-01-08 17:14:40 +01:00
I-Al-Istannen
ced8b9a2d0
Fix some accordions
2022-01-08 16:58:30 +01:00
I-Al-Istannen
6f3cfd4396
Fix personal desktop crawling
2022-01-08 16:58:15 +01:00