0
Commit Graph

190 Commits

Author SHA1 Message Date
Nicolas
768166b066 Update single_url.ts 2024-04-30 16:57:44 -07:00
Nicolas
a386259511 Update scrape.ts 2024-04-30 16:35:44 -07:00
Nicolas
dfcf39f4c0 Update scrape.ts 2024-04-30 16:19:59 -07:00
Nicolas
3c7030dbb1 Nick: improvements 2024-04-30 16:19:32 -07:00
Nicolas
cbd9e88b77 Merge branch 'main' into llm-extraction 2024-04-30 14:49:20 -07:00
Nicolas
4f526cff92 Nick: cleanup 2024-04-30 12:19:43 -07:00
Caleb Peffer
d9d206aff6 Caleb: 2024-04-30 10:27:39 -07:00
Caleb Peffer
d1235a0029 Caleb: switched back to markdown for extraction 2024-04-30 10:23:12 -07:00
Caleb Peffer
ad9c8e77d1 Caleb: commented out massive test 2024-04-30 10:22:09 -07:00
Caleb Peffer
a32f2b37b6 Caleb: logs work 2024-04-30 10:21:41 -07:00
Caleb Peffer
3ca9e5153f Caleb: trying to get loggin workng 2024-04-30 09:20:15 -07:00
rafaelsideguide
d3c36adaa7 Update index.ts 2024-04-29 17:58:47 -03:00
Caleb Peffer
79cd7d2ebc Merge branch 'llm-extraction' of https://github.com/mendableai/firecrawl into llm-extraction 2024-04-29 12:12:58 -07:00
Caleb Peffer
4f7737c922 Caleb: added ajv json schema validation. 2024-04-29 12:12:55 -07:00
Nicolas
b69feab916 Merge branch 'main' into llm-extraction 2024-04-29 08:40:44 -07:00
Caleb Peffer
667f740315 Caleb: converted llm response to json 2024-04-28 19:28:28 -07:00
Caleb Peffer
2ad7a58eb7 Caleb: first test passing 2024-04-28 17:38:20 -07:00
Caleb Peffer
06497729e2 Caleb: got it to a testable state I believe 2024-04-28 15:52:09 -07:00
Caleb Peffer
6ee1f2d3bc Caleb: initially pulled inspiration code from https://github.com/mishushakov/llm-scraper 2024-04-28 13:59:35 -07:00
Nicolas
68838c9e0d Update single_url.ts 2024-04-28 12:44:00 -07:00
Nicolas
d8ee4e90d6 Update website_params.ts 2024-04-28 11:47:25 -07:00
Nicolas
8e44696c4d Nick: 2024-04-28 11:34:25 -07:00
Nicolas
7689c31d35 Update credit_billing.ts 2024-04-26 14:36:19 -07:00
Nicolas
0a607b9efa Merge branch 'main' into feat/coupons 2024-04-26 14:23:35 -07:00
Nicolas
fdf913e0f1 Update index.test.ts 2024-04-26 13:06:48 -07:00
Nicolas
8e32453424 Update auth.ts 2024-04-26 12:57:49 -07:00
rafaelsideguide
1f48998970 done 2024-04-26 16:27:31 -03:00
Nicolas
d210a57a9b Update credit_billing.ts 2024-04-26 10:24:36 -07:00
Nicolas
24e1bdec1b Update credit_billing.ts 2024-04-26 10:14:29 -07:00
rafaelsideguide
06675d1fe3 almost finished 2024-04-26 11:42:49 -03:00
Nicolas
3ac8724329 Update openapi.json 2024-04-25 13:28:07 -07:00
Nicolas
a3911bfc67 Update index.ts 2024-04-25 10:00:35 -07:00
rafaelsideguide
9c481e5e83 [Feat] Coupon system
WIP. Idea for solving #57
2024-04-25 10:05:53 -03:00
rafaelsideguide
75597f72a1 [Feat] Added allowed urls
FireCrawl should be able to scrape LinkedIn Articles (/pulse/*)
2024-04-25 08:39:45 -03:00
Nicolas
a59ddf1855 Nick: default to serper 2024-04-24 18:00:25 -07:00
Roger M
f2690f6909 Support for tbs, filter, lang, country and location with Serper search. 2024-04-25 01:35:17 +01:00
Nicolas
e7d385ad32 Update search.ts 2024-04-24 10:23:26 -07:00
Nicolas
877af4231b Update openapi.json 2024-04-24 10:11:44 -07:00
Nicolas
307ea6f5ec Nick: improvements to search 2024-04-24 10:11:01 -07:00
Rafael Miller
f189589da4
Merge pull request #34 from mendableai/nsc/returnOnlyUrls
Implements the ability for the crawler to output all the links it found, without scraping
2024-04-24 10:34:42 -03:00
rafaelsideguide
07e93ee5fd Update requests.http 2024-04-24 10:32:35 -03:00
rafaelsideguide
942ac3b41c Resolved merge conflicts between feat/added-anthropic-vision-api and main 2024-04-24 09:57:45 -03:00
Nicolas
3b5b868d0d Update requests.http 2024-04-23 18:13:58 -07:00
Nicolas
8939ca570b Merge branch 'main' into nsc/returnOnlyUrls 2024-04-23 18:05:48 -07:00
Nicolas
479fa2f7f8 Nick: 2024-04-23 17:46:32 -07:00
Nicolas
fdb2789eaa Nick: added url as return param 2024-04-23 17:14:34 -07:00
Nicolas
3abfd6b4c1 Update search.ts 2024-04-23 17:06:48 -07:00
Nicolas
53cc4c396f Update search.ts 2024-04-23 17:05:58 -07:00
Nicolas
734c76fc56 Merge branch 'main' into nsc/mvp-search 2024-04-23 17:04:31 -07:00
Nicolas
f0695c7123 Update single_url.ts 2024-04-23 17:04:10 -07:00
Nicolas
4328a68ec1 Nick: 2024-04-23 16:57:53 -07:00
Nicolas
e6779aff68 Nick: tests 2024-04-23 16:56:09 -07:00
Nicolas
9ded75adb7 Merge branch 'main' into nsc/mvp-search 2024-04-23 16:52:40 -07:00
Nicolas
f3c190c21c Nick: 2024-04-23 16:47:24 -07:00
Nicolas
41263bb4b6 Nick: serper support 2024-04-23 16:45:06 -07:00
Nicolas
8cb5d7955a Update googlesearch.ts 2024-04-23 15:49:05 -07:00
Nicolas
495adc9a3f Update googlesearch.ts 2024-04-23 15:48:37 -07:00
Nicolas
5e3e2ec966 Nick: 2024-04-23 15:44:11 -07:00
Nicolas
0146157876 Nick: mvp 2024-04-23 15:28:32 -07:00
rafaelsideguide
849c0b6ebf [Feat] Added blocklist for social media urls 2024-04-23 18:50:35 -03:00
rafaelsideguide
9b01dc6281 Changed from active to waiting jobs 2024-04-23 16:07:22 -03:00
rafaelsideguide
a680c7ce84 [Feat] Server health check + slack message 2024-04-23 15:46:29 -03:00
Nicolas
306cfe4ce1 Nick: 2024-04-23 11:15:11 -07:00
Nicolas
de7e1f501b Update openapi.json 2024-04-22 08:41:54 -07:00
Nicolas
001bf0c504 Update package.json 2024-04-21 12:05:12 -07:00
Nicolas
6560c968e1 Update types.ts 2024-04-21 12:02:11 -07:00
Nicolas
52620bab16 Nick: prod and local-no-auth tests 2024-04-21 11:39:36 -07:00
Nicolas
749bd5f44d Merge branch 'cjp/contributors-guide-and' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and 2024-04-21 11:27:37 -07:00
Nicolas
898d729a84 Nick: tests 2024-04-21 11:27:31 -07:00
Caleb Peffer
ef4ffd3a18 Adding contributors guide 2024-04-21 10:56:30 -07:00
Nicolas
5cdbf3a0ac Nick: cleaner functions to handle authenticated requests that dont require ifs everywhere 2024-04-21 10:36:48 -07:00
Nicolas
aa89e2e8b5 Merge branch 'main' into cjp/contributors-guide-and 2024-04-21 10:10:05 -07:00
Caleb Peffer
be75aaa195 Caleb: first version of supabase proxy to make db authentication optional 2024-04-21 09:31:22 -07:00
Caleb Peffer
ad7951a679 Merge branch 'main' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and 2024-04-20 19:56:55 -07:00
Nicolas
d2f808a5fd Update queue-worker.ts 2024-04-20 19:54:37 -07:00
Caleb Peffer
e6b46178dd Caleb: added .env.example 2024-04-20 19:53:27 -07:00
Caleb Peffer
b361a76282 Caleb: added logging improvement 2024-04-20 19:53:04 -07:00
Nicolas
9b31e68a7e Update queue-worker.ts 2024-04-20 19:38:44 -07:00
Nicolas
0db0874b00 Nick: 2024-04-20 19:37:45 -07:00
Nicolas
4543c57e4e Nick: 2024-04-20 19:04:27 -07:00
Nicolas
5b8aed26dd Update scrape.ts 2024-04-20 18:55:39 -07:00
Nicolas
23b2190e5d Nick: 2024-04-20 16:38:05 -07:00
Nicolas
acec76680a
Merge pull request #35 from mendableai/nsc/job-logs
Better logging
2024-04-20 14:12:44 -07:00
Nicolas
5b3c75b06e Nick: 2024-04-20 14:10:29 -07:00
Nicolas
43c2e877e7 Update index.ts 2024-04-20 14:05:01 -07:00
Nicolas
408c7a479f Nick: rate limit fixes 2024-04-20 14:02:22 -07:00
Nicolas
6aa3cc3ce8 Nick: 2024-04-20 13:53:11 -07:00
Nicolas
1a3aa2999d Nick: return the only list of urls 2024-04-20 11:59:42 -07:00
Nicolas
ddf9ff9c9a Nick: 2024-04-20 11:46:06 -07:00
rafaelsideguide
37ef8a015c fixing scrape preview test 2024-04-19 17:55:35 -03:00
Nicolas
15cfc01f5d Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-19 12:23:14 -07:00
Nicolas
a144e13e30 Update rate-limiter.ts 2024-04-19 12:23:13 -07:00
rafaelsideguide
72e1dadccd adding option to replace all relative paths with absolute paths 2024-04-19 11:47:20 -03:00
Nicolas
2c0660653d Merge branch 'main' of https://github.com/mendableai/firecrawl 2024-04-18 13:56:25 -07:00
Nicolas
be35b32306 Nick: preview token tests 2024-04-18 13:55:55 -07:00
rafaelsideguide
c627d22179 all working now 2024-04-18 17:41:23 -03:00
rafaelsideguide
dab0568c43 testing tests 2024-04-18 17:38:12 -03:00
rafaelsideguide
3f833737f3 fixing test 2024-04-18 17:25:25 -03:00
rafaelsideguide
efbb4e8905 fixing jest parameters 2024-04-18 17:18:15 -03:00
rafaelsideguide
ddb3b25171 adding ci-cd workflow 2024-04-18 16:28:01 -03:00