Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Can this work on intranet sites like sharepoint or confluence , which require employee SSO ?

I was trying to build a small Langchain based RAG based on internal documents but getting the documents from sharepoint/confluence (we have both) is very painful.



Technically it can. You can log in with the PlaywrightCrawler class without issue. The question is if there’s 2FA as well and how that’s handled. Crawlee does not have any abstraction for handling 2FA as it depends a lot on what verification options are supported on the SSO side. So that part would need a custom implementation within Crawlee.


For this use case, you might use this ready-made Actor: https://apify.com/apify/website-content-crawler




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: