Nicolas
e0d979edad
Merge pull request #176 from mendableai/bug/data-check-in-python-sdk
...
[Bug] Added data check for python SDK
2024-05-24 09:45:39 -07:00
Nicolas
53a214cefb
Merge pull request #168 from mendableai/nsc/allowed-keywords-in-blocklist
...
feat: Allow privacy/legal/ other pages in social media websites
2024-05-24 09:43:15 -07:00
Nicolas
e166c07690
Merge pull request #170 from qyou/fix-hardcode-timeout
...
update: wait until body attached in playwright-service
2024-05-24 09:41:27 -07:00
Rafael Miller
c00580eabd
Merge pull request #181 from JakobStadlhuber/feat/proxy-support
...
[Feat] Added proxy and media blocking support for Playwright
2024-05-24 13:34:34 -03:00
Jakob Stadlhuber
9fc5a0ff98
Update comment in .env.example for proxy settings
...
This commit modifies the comment in .env.example to specify that proxy settings are for Playwright. This clarification aims to provide users a more clear context about when and why these proxy settings are used.
2024-05-24 17:45:59 +02:00
Jakob Stadlhuber
b001aded46
Add proxy and media blocking configurations
...
Updated environment variables and application settings to include proxy configurations and media blocking option. The proxy settings allow users to use a proxy service, while the media blocking is an optional feature that can help save bandwidth. Changes have been made in the .env.example, docker-compose.yaml, and main.py files.
2024-05-24 17:41:34 +02:00
Rafael Miller
6a5b9ca314
Merge pull request #180 from mendableai/added-issue-templates
...
Update issue templates
2024-05-24 10:17:40 -03:00
Rafael Miller
9562c837eb
Update issue templates
2024-05-24 09:34:43 -03:00
Nicolas
4e39701644
Update main.py
2024-05-22 12:59:56 -07:00
Rafael Miller
df0550d2f6
Merge pull request #143 from mendableai/bug/crawl-limit
...
[Bug] Fixing /crawl limit
2024-05-22 14:51:54 -03:00
Nicolas
3aa5f26627
Update main.py
2024-05-22 10:45:43 -07:00
Nicolas
3e63985e53
Update main.py
2024-05-22 10:40:47 -07:00
rafaelsideguide
4dfc371241
Update index.test.ts
2024-05-22 14:38:41 -03:00
rafaelsideguide
f4a3469b9e
Merge branch 'main' into bug/crawl-limit
2024-05-22 14:27:28 -03:00
Nicolas
0d187f0425
Merge pull request #77 from tractorjuice/patch-1
...
Add additional file extensions to crawler.ts
2024-05-22 10:16:49 -07:00
rafaelsideguide
f9ae1729b6
Update firecrawl.py
2024-05-22 09:40:38 -03:00
Nicolas
f915b08016
Merge pull request #174 from mendableai/nsc/fire-engine-beta
...
beta: Fire-Engine fallback
2024-05-21 19:25:51 -07:00
Nicolas
cb2bd0e71f
Update index.test.ts
2024-05-21 19:03:32 -07:00
Nicolas
253abb849f
Update rate-limiter.ts
2024-05-21 18:53:58 -07:00
Nicolas
229b9908d2
Nick: only enable hyper dx in prod
2024-05-21 18:52:46 -07:00
Nicolas
a8ff295977
Update single_url.ts
2024-05-21 18:50:42 -07:00
Nicolas
a5e718b084
Nick: improvements
2024-05-21 18:34:23 -07:00
Nicolas
2e264a4c75
Update ci.yml
2024-05-21 13:24:09 -07:00
Nicolas
6285f12cd1
Merge pull request #167 from mendableai/nsc/hyper-dx-integration
...
feat: HyperDX Integration
2024-05-21 13:19:38 -07:00
youqiang
c47dae13a9
update: wait until body attached in playwright-service
2024-05-21 14:53:57 +08:00
Nicolas
7f64fe884a
Update blocklist.ts
2024-05-20 17:26:01 -07:00
Nicolas
756f54466d
Nick: allowed keywords for now
2024-05-20 17:24:21 -07:00
Nicolas
01783dc336
Update openapi.json
2024-05-20 17:10:55 -07:00
Nicolas
77a79b5a79
Nick: max num tokens for llm extract (for now) + slice the max
2024-05-20 17:07:38 -07:00
Nicolas
2644e1c029
Update .env.example
2024-05-20 13:36:51 -07:00
Nicolas
9e61d431f0
Nick: hyper dx integration init
2024-05-20 13:36:34 -07:00
Nicolas
d5d0d48848
Merge branch 'main' of https://github.com/mendableai/firecrawl
2024-05-20 10:06:52 -07:00
Nicolas
60002e79b8
Nick: python sdk bump
2024-05-20 10:06:48 -07:00
Nicolas
4bb536e5b3
Merge pull request #166 from mattjoyce/main
...
Allow override of API URL
2024-05-20 09:44:47 -07:00
Matt Joyce
7e5ef4dec4
Allow override of API URL
...
Allows python sdk to be used with local installs.
2024-05-20 18:46:32 +10:00
Nicolas
c74f757b53
Update rate-limiter.ts
2024-05-19 13:05:36 -07:00
Nicolas
842e197fd7
Merge pull request #151 from mendableai/feat/rate-limits
...
[Feat] Added rate limits
2024-05-19 13:01:37 -07:00
Nicolas
98a39b39ab
Nick: increased rate limits
2024-05-19 12:59:29 -07:00
Nicolas
18fa15df25
Update index.test.ts
2024-05-19 12:50:06 -07:00
Nicolas
614c073af0
Nick: improvements
2024-05-19 12:45:46 -07:00
Nicolas
f473793ba3
Merge branch 'main' into feat/rate-limits
2024-05-19 12:23:34 -07:00
Nicolas
713f16fdc1
Update README.md
2024-05-19 00:41:12 -07:00
Nicolas
0dc108cd33
Update README.md
2024-05-18 11:32:13 -07:00
Nicolas
43d0309b09
Merge pull request #160 from elimisteve/patch-1
...
Update README.md: Typo fix
2024-05-18 11:24:23 -07:00
Steve Phillips
81563130e3
Update README.md: Typo fix
...
Don't scrap, scrape!
2024-05-18 03:58:17 +02:00
Nicolas
fae8954eeb
Update SELF_HOST.md
2024-05-17 18:46:59 -07:00
rafaelsideguide
a480595aa7
Update index.test.ts
2024-05-17 15:41:27 -03:00
rafaelsideguide
54049be539
Added e2e tests
2024-05-17 15:37:47 -03:00
Nicolas
6feb21cc35
Update website_params.ts
2024-05-17 11:21:26 -07:00
Nicolas
2a1f2e396a
Merge pull request #29 from mendableai/detect-pdfs
...
Fixes pdfs not found if .pdf is not present
2024-05-17 10:47:08 -07:00