0
Commit Graph

376 Commits

Author SHA1 Message Date
Rafael Miller
f189589da4
Merge pull request #34 from mendableai/nsc/returnOnlyUrls
Implements the ability for the crawler to output all the links it found, without scraping
2024-04-24 10:34:42 -03:00
rafaelsideguide
07e93ee5fd Update requests.http 2024-04-24 10:32:35 -03:00
rafaelsideguide
942ac3b41c Resolved merge conflicts between feat/added-anthropic-vision-api and main 2024-04-24 09:57:45 -03:00
Nicolas
3b5b868d0d Update requests.http 2024-04-23 18:13:58 -07:00
Nicolas
8939ca570b Merge branch 'main' into nsc/returnOnlyUrls 2024-04-23 18:05:48 -07:00
Nicolas
479fa2f7f8 Nick: 2024-04-23 17:46:32 -07:00
Nicolas
fdb2789eaa Nick: added url as return param 2024-04-23 17:14:34 -07:00
Nicolas
3abfd6b4c1 Update search.ts 2024-04-23 17:06:48 -07:00
Nicolas
53cc4c396f Update search.ts 2024-04-23 17:05:58 -07:00
Nicolas
734c76fc56 Merge branch 'main' into nsc/mvp-search 2024-04-23 17:04:31 -07:00
Nicolas
f0695c7123 Update single_url.ts 2024-04-23 17:04:10 -07:00
Nicolas
4328a68ec1 Nick: 2024-04-23 16:57:53 -07:00
Nicolas
e6779aff68 Nick: tests 2024-04-23 16:56:09 -07:00
Nicolas
9ded75adb7 Merge branch 'main' into nsc/mvp-search 2024-04-23 16:52:40 -07:00
Nicolas
f3c190c21c Nick: 2024-04-23 16:47:24 -07:00
Nicolas
41263bb4b6 Nick: serper support 2024-04-23 16:45:06 -07:00
Nicolas
8cb5d7955a Update googlesearch.ts 2024-04-23 15:49:05 -07:00
Nicolas
495adc9a3f Update googlesearch.ts 2024-04-23 15:48:37 -07:00
Nicolas
5e3e2ec966 Nick: 2024-04-23 15:44:11 -07:00
Nicolas
0146157876 Nick: mvp 2024-04-23 15:28:32 -07:00
rafaelsideguide
849c0b6ebf [Feat] Added blocklist for social media urls 2024-04-23 18:50:35 -03:00
rafaelsideguide
9b01dc6281 Changed from active to waiting jobs 2024-04-23 16:07:22 -03:00
rafaelsideguide
a680c7ce84 [Feat] Server health check + slack message 2024-04-23 15:46:29 -03:00
Nicolas
306cfe4ce1 Nick: 2024-04-23 11:15:11 -07:00
Nicolas
357914c07d Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-23 10:55:42 -07:00
Nicolas
bf2df7a853 Nick: fix js-sdk 2024-04-23 10:55:40 -07:00
Nicolas
7bc7b179d4
Merge pull request #46 from mattzcarey/patch-1
chore: add context.close
2024-04-22 21:46:17 -07:00
Nicolas
de7e1f501b Update openapi.json 2024-04-22 08:41:54 -07:00
Matt
572b7e8dc5
chore: add context.close 2024-04-22 16:38:05 +01:00
Nicolas
001bf0c504 Update package.json 2024-04-21 12:05:12 -07:00
Nicolas
6560c968e1 Update types.ts 2024-04-21 12:02:11 -07:00
Nicolas
52620bab16 Nick: prod and local-no-auth tests 2024-04-21 11:39:36 -07:00
Nicolas
749bd5f44d Merge branch 'cjp/contributors-guide-and' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and 2024-04-21 11:27:37 -07:00
Nicolas
898d729a84 Nick: tests 2024-04-21 11:27:31 -07:00
Caleb Peffer
ef4ffd3a18 Adding contributors guide 2024-04-21 10:56:30 -07:00
Nicolas
5cdbf3a0ac Nick: cleaner functions to handle authenticated requests that dont require ifs everywhere 2024-04-21 10:36:48 -07:00
Nicolas
aa89e2e8b5 Merge branch 'main' into cjp/contributors-guide-and 2024-04-21 10:10:05 -07:00
Caleb Peffer
be75aaa195 Caleb: first version of supabase proxy to make db authentication optional 2024-04-21 09:31:22 -07:00
Caleb Peffer
ad7951a679 Merge branch 'main' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and 2024-04-20 19:56:55 -07:00
Nicolas
d2f808a5fd Update queue-worker.ts 2024-04-20 19:54:37 -07:00
Caleb Peffer
e6b46178dd Caleb: added .env.example 2024-04-20 19:53:27 -07:00
Caleb Peffer
b361a76282 Caleb: added logging improvement 2024-04-20 19:53:04 -07:00
Nicolas
9b31e68a7e Update queue-worker.ts 2024-04-20 19:38:44 -07:00
Nicolas
0db0874b00 Nick: 2024-04-20 19:37:45 -07:00
Nicolas
4543c57e4e Nick: 2024-04-20 19:04:27 -07:00
Nicolas
5b8aed26dd Update scrape.ts 2024-04-20 18:55:39 -07:00
Nicolas
23b2190e5d Nick: 2024-04-20 16:38:05 -07:00
Nicolas
d201a4e58d
Merge pull request #31 from mendableai/feat/js-sdk-v0011
[Feat] Added type declarations
2024-04-20 14:15:16 -07:00
Nicolas
acec76680a
Merge pull request #35 from mendableai/nsc/job-logs
Better logging
2024-04-20 14:12:44 -07:00
Nicolas
5b3c75b06e Nick: 2024-04-20 14:10:29 -07:00
Nicolas
43c2e877e7 Update index.ts 2024-04-20 14:05:01 -07:00
Nicolas
408c7a479f Nick: rate limit fixes 2024-04-20 14:02:22 -07:00
Nicolas
6aa3cc3ce8 Nick: 2024-04-20 13:53:11 -07:00
Nicolas
1a3aa2999d Nick: return the only list of urls 2024-04-20 11:59:42 -07:00
Nicolas
ddf9ff9c9a Nick: 2024-04-20 11:46:06 -07:00
Nicolas
f1dd97af0f Update index.ts 2024-04-19 15:37:27 -07:00
Nicolas
84cebf618b Nick: 2024-04-19 15:36:00 -07:00
Nicolas
005ac8f839 Merge branch 'main' into detect-pdfs 2024-04-19 15:13:32 -07:00
Nicolas
5b93799149 Nick: a bit faster 2024-04-19 15:13:17 -07:00
rafaelsideguide
890bde686f added type declarations 2024-04-19 19:10:05 -03:00
rafaelsideguide
37ef8a015c fixing scrape preview test 2024-04-19 17:55:35 -03:00
Nicolas
c5cb268b61 Update pdfProcessor.ts 2024-04-19 13:13:42 -07:00
Nicolas
43cfcec326 Nick: disabling in crawl and sitemap for now 2024-04-19 13:12:08 -07:00
Nicolas
140529c609 Nick: fixes pdfs not found 2024-04-19 13:05:21 -07:00
Nicolas
15cfc01f5d Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-19 12:23:14 -07:00
Nicolas
a144e13e30 Update rate-limiter.ts 2024-04-19 12:23:13 -07:00
rafaelsideguide
384fb1db18 updating version 2024-04-19 15:27:54 -03:00
Rafael Miller
3c14b02f8b
Merge pull request #25 from mendableai/feat/replace-all-paths-to-absolute-paths
Added option to replace all relative paths with absolute paths
2024-04-19 15:18:50 -03:00
rafaelsideguide
3ddff62a56 adding better doc and types for js-sdk 2024-04-19 14:49:35 -03:00
Ikko Eltociear Ashimine
9e9d66f7a3
refactor: fix typo in WebScraper/index.ts
breakign -> breaking
2024-04-20 02:27:53 +09:00
rafaelsideguide
72e1dadccd adding option to replace all relative paths with absolute paths 2024-04-19 11:47:20 -03:00
Nicolas
2c0660653d Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-18 13:56:25 -07:00
Nicolas
be35b32306 Nick: preview token tests 2024-04-18 13:55:55 -07:00
rafaelsideguide
c627d22179 all working now 2024-04-18 17:41:23 -03:00
rafaelsideguide
dab0568c43 testing tests 2024-04-18 17:38:12 -03:00
rafaelsideguide
3f833737f3 fixing test 2024-04-18 17:25:25 -03:00
rafaelsideguide
efbb4e8905 fixing jest parameters 2024-04-18 17:18:15 -03:00
rafaelsideguide
ddb3b25171 adding ci-cd workflow 2024-04-18 16:28:01 -03:00
Nicolas
3e9e24aaf1 Update index.ts 2024-04-18 11:01:24 -07:00
Nicolas
0f7ab4107f Update index.ts 2024-04-18 10:41:06 -07:00
Nicolas
6112cc1c2c Update index.ts 2024-04-18 10:34:41 -07:00
rafaelsideguide
c4cc4b9262 fixing document response 2024-04-18 14:12:39 -03:00
Rafael Miller
704a059448
Update index.ts 2024-04-18 13:53:11 -03:00
rafaelsideguide
57e5b36014 [Feat] Adding pdf parser 2024-04-18 11:43:57 -03:00
Nicolas
e3a6bc4de7 Create openapi.json 2024-04-17 22:23:10 -07:00
Nicolas
2bed55a3b4 Nick: 2024-04-17 19:05:28 -07:00
Nicolas
ca2bf9cc12 Update single_url.ts 2024-04-17 18:27:08 -07:00
Nicolas
36abe0f7f9 Nick: 2024-04-17 18:24:46 -07:00
Nicolas
460763ba5f
Merge pull request #11 from mendableai/feat/parse-to-markdown-tables
[Feat] Added html to markdown table parser
2024-04-17 15:52:43 -04:00
Nicolas
529e77d3e7
Merge pull request #9 from szepeviktor/typos
Fix typos
2024-04-17 15:52:35 -04:00
Nicolas
52fb28bc1a Update index.ts 2024-04-17 12:52:15 -07:00
Nicolas
de439f6529 Update index.ts 2024-04-17 12:51:29 -07:00
Nicolas
871d5d91b0 Update index.ts 2024-04-17 12:51:12 -07:00
Nicolas
08ed68ff55 Nick: fixes 2024-04-17 12:44:23 -07:00
Nicolas
650852cc5a Merge branch 'main' into feat/parse-to-markdown-tables 2024-04-17 12:28:17 -07:00
rafaelsideguide
ee8a097252 adding unit tests and fixing the parse function 2024-04-17 15:56:01 -03:00
Nicolas
2eb81545fa Update index.test.ts 2024-04-17 11:04:03 -07:00
Nicolas
60245343c9 Merge branch 'main' into feat/improving-reative-paths 2024-04-17 10:57:49 -07:00
Nicolas
417921ea33 Update index.ts 2024-04-17 10:57:01 -07:00
rafaelsideguide
b375ce3e39 adding unit tests and bugfixing 2024-04-17 14:54:54 -03:00
Nicolas
82ed9515f1 Update index.ts 2024-04-17 10:52:10 -07:00
Nicolas
c837f1cc04
Merge pull request #12 from mendableai/bugfix/normalized-api-on-crawl-status
[Bugfix] added normalized apikey to craw/status route
2024-04-17 13:42:26 -04:00
Nicolas
db15724b0c
Update imageDescription.ts 2024-04-17 10:39:29 -07:00
Nicolas
27674a624d
Update index.ts 2024-04-17 10:39:00 -07:00
rafaelsideguide
25a9255c7e [bugfix] added normalized apikey to craw/status route 2024-04-17 12:59:49 -03:00
rafaelsideguide
ff622739b7 Added a html to markdown table parser 2024-04-17 11:01:19 -03:00
Viktor Szépe
d628511b57
Delete apps/playwright-service/.DS_Store 2024-04-17 08:53:23 +02:00
Viktor Szépe
11394ef236
Delete apps/api/src/.DS_Store 2024-04-17 08:53:12 +02:00
Viktor Szépe
51f94e9e41
Delete apps/.DS_Store 2024-04-17 08:53:01 +02:00
Viktor Szépe
34ab21db59 Fix typos 2024-04-17 05:13:27 +00:00
rafaelsideguide
ed5dc808c7 Update imageDescription.ts 2024-04-16 18:05:07 -03:00
rafaelsideguide
00941d94a4 Added anthropic vision to getImageDescription function 2024-04-16 18:03:48 -03:00
rafaelsideguide
d23a7ae591 improving relative paths 2024-04-16 16:34:01 -03:00
rafaelsideguide
a04610302a Spliting relative paths for images 2024-04-16 16:31:33 -03:00
rafaelsideguide
3e4064bce2 moving js-sdk to monorepo 2024-04-16 14:02:16 -03:00
Nicolas
0113d66739 Update requests.http 2024-04-16 13:01:35 -04:00
Nicolas
8892504aea Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-16 12:49:15 -04:00
Nicolas
4c4775e0b8 Nick: 2024-04-16 12:49:14 -04:00
rafaelsideguide
df1d506d17 js-sdk ok! 2024-04-16 13:31:16 -03:00
Nicolas
15fd4e23d8 Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-16 12:07:01 -04:00
Nicolas
93627ae87c Nick: 2024-04-16 12:06:46 -04:00
rafaelsideguide
be3eb211e9 adding JS-SDK 2024-04-16 11:38:22 -03:00
Nicolas
3d260e94f3 Nick: fc- prefix 2024-04-15 20:39:25 -04:00
Nicolas
32af4ac226 Update requests.http 2024-04-15 17:38:33 -04:00
Nicolas
3418858cb7 Nick: revoked 2024-04-15 17:26:47 -04:00
Nicolas
a6c2a87811 Initial commit 2024-04-15 17:01:47 -04:00