rafaelsideguide
|
a095e1b63d
|
Resolve merge conflicts with main
|
2024-04-30 10:54:18 -03:00 |
|
rafaelsideguide
|
35480bd2ad
|
Update index.test.ts
|
2024-04-30 10:40:32 -03:00 |
|
rafaelsideguide
|
d3c36adaa7
|
Update index.ts
|
2024-04-29 17:58:47 -03:00 |
|
Caleb Peffer
|
79cd7d2ebc
|
Merge branch 'llm-extraction' of https://github.com/mendableai/firecrawl into llm-extraction
|
2024-04-29 12:12:58 -07:00 |
|
Caleb Peffer
|
4f7737c922
|
Caleb: added ajv json schema validation.
|
2024-04-29 12:12:55 -07:00 |
|
rafaelsideguide
|
f8b207793f
|
changed the request to do a HEAD to check for a PDF instead
|
2024-04-29 15:15:32 -03:00 |
|
Nicolas
|
b69feab916
|
Merge branch 'main' into llm-extraction
|
2024-04-29 08:40:44 -07:00 |
|
Caleb Peffer
|
667f740315
|
Caleb: converted llm response to json
|
2024-04-28 19:28:28 -07:00 |
|
Caleb Peffer
|
2ad7a58eb7
|
Caleb: first test passing
|
2024-04-28 17:38:20 -07:00 |
|
Caleb Peffer
|
06497729e2
|
Caleb: got it to a testable state I believe
|
2024-04-28 15:52:09 -07:00 |
|
Caleb Peffer
|
6ee1f2d3bc
|
Caleb: initially pulled inspiration code from https://github.com/mishushakov/llm-scraper
|
2024-04-28 13:59:35 -07:00 |
|
Nicolas
|
68838c9e0d
|
Update single_url.ts
|
2024-04-28 12:44:00 -07:00 |
|
Nicolas
|
d8ee4e90d6
|
Update website_params.ts
|
2024-04-28 11:47:25 -07:00 |
|
Nicolas
|
8e44696c4d
|
Nick:
|
2024-04-28 11:34:25 -07:00 |
|
Nicolas
|
1dc6458c6a
|
Update crawler.ts
|
2024-04-27 11:17:10 -07:00 |
|
Nicolas
|
0f694e0608
|
Update crawler.ts
|
2024-04-27 11:14:52 -07:00 |
|
tractorjuice
|
a5d38039f2
|
Add additional file extensions to crawler.ts
Add additional file extensions.
|
2024-04-27 11:03:27 +01:00 |
|
Nicolas
|
7689c31d35
|
Update credit_billing.ts
|
2024-04-26 14:36:19 -07:00 |
|
Nicolas
|
0a607b9efa
|
Merge branch 'main' into feat/coupons
|
2024-04-26 14:23:35 -07:00 |
|
Nicolas
|
fdf913e0f1
|
Update index.test.ts
|
2024-04-26 13:06:48 -07:00 |
|
Nicolas
|
8e32453424
|
Update auth.ts
|
2024-04-26 12:57:49 -07:00 |
|
rafaelsideguide
|
1f48998970
|
done
|
2024-04-26 16:27:31 -03:00 |
|
Nicolas
|
d210a57a9b
|
Update credit_billing.ts
|
2024-04-26 10:24:36 -07:00 |
|
Nicolas
|
24e1bdec1b
|
Update credit_billing.ts
|
2024-04-26 10:14:29 -07:00 |
|
rafaelsideguide
|
06675d1fe3
|
almost finished
|
2024-04-26 11:42:49 -03:00 |
|
Nicolas
|
a3911bfc67
|
Update index.ts
|
2024-04-25 10:00:35 -07:00 |
|
rafaelsideguide
|
9c481e5e83
|
[Feat] Coupon system
WIP. Idea for solving #57
|
2024-04-25 10:05:53 -03:00 |
|
rafaelsideguide
|
75597f72a1
|
[Feat] Added allowed urls
FireCrawl should be able to scrape LinkedIn Articles (/pulse/*)
|
2024-04-25 08:39:45 -03:00 |
|
Nicolas
|
a59ddf1855
|
Nick: default to serper
|
2024-04-24 18:00:25 -07:00 |
|
Roger M
|
f2690f6909
|
Support for tbs, filter, lang, country and location with Serper search.
|
2024-04-25 01:35:17 +01:00 |
|
Nicolas
|
e7d385ad32
|
Update search.ts
|
2024-04-24 10:23:26 -07:00 |
|
Nicolas
|
307ea6f5ec
|
Nick: improvements to search
|
2024-04-24 10:11:01 -07:00 |
|
Rafael Miller
|
f189589da4
|
Merge pull request #34 from mendableai/nsc/returnOnlyUrls
Implements the ability for the crawler to output all the links it found, without scraping
|
2024-04-24 10:34:42 -03:00 |
|
rafaelsideguide
|
942ac3b41c
|
Resolved merge conflicts between feat/added-anthropic-vision-api and main
|
2024-04-24 09:57:45 -03:00 |
|
Nicolas
|
8939ca570b
|
Merge branch 'main' into nsc/returnOnlyUrls
|
2024-04-23 18:05:48 -07:00 |
|
Nicolas
|
479fa2f7f8
|
Nick:
|
2024-04-23 17:46:32 -07:00 |
|
Nicolas
|
fdb2789eaa
|
Nick: added url as return param
|
2024-04-23 17:14:34 -07:00 |
|
Nicolas
|
3abfd6b4c1
|
Update search.ts
|
2024-04-23 17:06:48 -07:00 |
|
Nicolas
|
53cc4c396f
|
Update search.ts
|
2024-04-23 17:05:58 -07:00 |
|
Nicolas
|
734c76fc56
|
Merge branch 'main' into nsc/mvp-search
|
2024-04-23 17:04:31 -07:00 |
|
Nicolas
|
f0695c7123
|
Update single_url.ts
|
2024-04-23 17:04:10 -07:00 |
|
Nicolas
|
4328a68ec1
|
Nick:
|
2024-04-23 16:57:53 -07:00 |
|
Nicolas
|
e6779aff68
|
Nick: tests
|
2024-04-23 16:56:09 -07:00 |
|
Nicolas
|
9ded75adb7
|
Merge branch 'main' into nsc/mvp-search
|
2024-04-23 16:52:40 -07:00 |
|
Nicolas
|
f3c190c21c
|
Nick:
|
2024-04-23 16:47:24 -07:00 |
|
Nicolas
|
41263bb4b6
|
Nick: serper support
|
2024-04-23 16:45:06 -07:00 |
|
Nicolas
|
8cb5d7955a
|
Update googlesearch.ts
|
2024-04-23 15:49:05 -07:00 |
|
Nicolas
|
495adc9a3f
|
Update googlesearch.ts
|
2024-04-23 15:48:37 -07:00 |
|
Nicolas
|
5e3e2ec966
|
Nick:
|
2024-04-23 15:44:11 -07:00 |
|
Nicolas
|
0146157876
|
Nick: mvp
|
2024-04-23 15:28:32 -07:00 |
|
rafaelsideguide
|
849c0b6ebf
|
[Feat] Added blocklist for social media urls
|
2024-04-23 18:50:35 -03:00 |
|
rafaelsideguide
|
9b01dc6281
|
Changed from active to waiting jobs
|
2024-04-23 16:07:22 -03:00 |
|
rafaelsideguide
|
a680c7ce84
|
[Feat] Server health check + slack message
|
2024-04-23 15:46:29 -03:00 |
|
Nicolas
|
306cfe4ce1
|
Nick:
|
2024-04-23 11:15:11 -07:00 |
|
Nicolas
|
6560c968e1
|
Update types.ts
|
2024-04-21 12:02:11 -07:00 |
|
Nicolas
|
52620bab16
|
Nick: prod and local-no-auth tests
|
2024-04-21 11:39:36 -07:00 |
|
Nicolas
|
898d729a84
|
Nick: tests
|
2024-04-21 11:27:31 -07:00 |
|
Nicolas
|
5cdbf3a0ac
|
Nick: cleaner functions to handle authenticated requests that dont require ifs everywhere
|
2024-04-21 10:36:48 -07:00 |
|
Nicolas
|
aa89e2e8b5
|
Merge branch 'main' into cjp/contributors-guide-and
|
2024-04-21 10:10:05 -07:00 |
|
Caleb Peffer
|
be75aaa195
|
Caleb: first version of supabase proxy to make db authentication optional
|
2024-04-21 09:31:22 -07:00 |
|
Caleb Peffer
|
ad7951a679
|
Merge branch 'main' of https://github.com/mendableai/firecrawl into cjp/contributors-guide-and
|
2024-04-20 19:56:55 -07:00 |
|
Nicolas
|
d2f808a5fd
|
Update queue-worker.ts
|
2024-04-20 19:54:37 -07:00 |
|
Caleb Peffer
|
b361a76282
|
Caleb: added logging improvement
|
2024-04-20 19:53:04 -07:00 |
|
Nicolas
|
9b31e68a7e
|
Update queue-worker.ts
|
2024-04-20 19:38:44 -07:00 |
|
Nicolas
|
0db0874b00
|
Nick:
|
2024-04-20 19:37:45 -07:00 |
|
Nicolas
|
4543c57e4e
|
Nick:
|
2024-04-20 19:04:27 -07:00 |
|
Nicolas
|
5b8aed26dd
|
Update scrape.ts
|
2024-04-20 18:55:39 -07:00 |
|
Nicolas
|
23b2190e5d
|
Nick:
|
2024-04-20 16:38:05 -07:00 |
|
Nicolas
|
acec76680a
|
Merge pull request #35 from mendableai/nsc/job-logs
Better logging
|
2024-04-20 14:12:44 -07:00 |
|
Nicolas
|
5b3c75b06e
|
Nick:
|
2024-04-20 14:10:29 -07:00 |
|
Nicolas
|
43c2e877e7
|
Update index.ts
|
2024-04-20 14:05:01 -07:00 |
|
Nicolas
|
408c7a479f
|
Nick: rate limit fixes
|
2024-04-20 14:02:22 -07:00 |
|
Nicolas
|
6aa3cc3ce8
|
Nick:
|
2024-04-20 13:53:11 -07:00 |
|
Nicolas
|
1a3aa2999d
|
Nick: return the only list of urls
|
2024-04-20 11:59:42 -07:00 |
|
Nicolas
|
ddf9ff9c9a
|
Nick:
|
2024-04-20 11:46:06 -07:00 |
|
Nicolas
|
f1dd97af0f
|
Update index.ts
|
2024-04-19 15:37:27 -07:00 |
|
Nicolas
|
84cebf618b
|
Nick:
|
2024-04-19 15:36:00 -07:00 |
|
Nicolas
|
005ac8f839
|
Merge branch 'main' into detect-pdfs
|
2024-04-19 15:13:32 -07:00 |
|
Nicolas
|
5b93799149
|
Nick: a bit faster
|
2024-04-19 15:13:17 -07:00 |
|
rafaelsideguide
|
37ef8a015c
|
fixing scrape preview test
|
2024-04-19 17:55:35 -03:00 |
|
Nicolas
|
c5cb268b61
|
Update pdfProcessor.ts
|
2024-04-19 13:13:42 -07:00 |
|
Nicolas
|
43cfcec326
|
Nick: disabling in crawl and sitemap for now
|
2024-04-19 13:12:08 -07:00 |
|
Nicolas
|
140529c609
|
Nick: fixes pdfs not found
|
2024-04-19 13:05:21 -07:00 |
|
Nicolas
|
15cfc01f5d
|
Merge branch 'main' of https://github.com/mendableai/firecrawl
|
2024-04-19 12:23:14 -07:00 |
|
Nicolas
|
a144e13e30
|
Update rate-limiter.ts
|
2024-04-19 12:23:13 -07:00 |
|
Ikko Eltociear Ashimine
|
9e9d66f7a3
|
refactor: fix typo in WebScraper/index.ts
breakign -> breaking
|
2024-04-20 02:27:53 +09:00 |
|
rafaelsideguide
|
72e1dadccd
|
adding option to replace all relative paths with absolute paths
|
2024-04-19 11:47:20 -03:00 |
|
Nicolas
|
2c0660653d
|
Merge branch 'main' of https://github.com/mendableai/firecrawl
|
2024-04-18 13:56:25 -07:00 |
|
Nicolas
|
be35b32306
|
Nick: preview token tests
|
2024-04-18 13:55:55 -07:00 |
|
rafaelsideguide
|
c627d22179
|
all working now
|
2024-04-18 17:41:23 -03:00 |
|
rafaelsideguide
|
dab0568c43
|
testing tests
|
2024-04-18 17:38:12 -03:00 |
|
rafaelsideguide
|
3f833737f3
|
fixing test
|
2024-04-18 17:25:25 -03:00 |
|
rafaelsideguide
|
efbb4e8905
|
fixing jest parameters
|
2024-04-18 17:18:15 -03:00 |
|
rafaelsideguide
|
ddb3b25171
|
adding ci-cd workflow
|
2024-04-18 16:28:01 -03:00 |
|
Nicolas
|
3e9e24aaf1
|
Update index.ts
|
2024-04-18 11:01:24 -07:00 |
|
Nicolas
|
0f7ab4107f
|
Update index.ts
|
2024-04-18 10:41:06 -07:00 |
|
Nicolas
|
6112cc1c2c
|
Update index.ts
|
2024-04-18 10:34:41 -07:00 |
|
rafaelsideguide
|
c4cc4b9262
|
fixing document response
|
2024-04-18 14:12:39 -03:00 |
|
Rafael Miller
|
704a059448
|
Update index.ts
|
2024-04-18 13:53:11 -03:00 |
|
rafaelsideguide
|
57e5b36014
|
[Feat] Adding pdf parser
|
2024-04-18 11:43:57 -03:00 |
|
Nicolas
|
ca2bf9cc12
|
Update single_url.ts
|
2024-04-17 18:27:08 -07:00 |
|
Nicolas
|
36abe0f7f9
|
Nick:
|
2024-04-17 18:24:46 -07:00 |
|
Nicolas
|
460763ba5f
|
Merge pull request #11 from mendableai/feat/parse-to-markdown-tables
[Feat] Added html to markdown table parser
|
2024-04-17 15:52:43 -04:00 |
|
Nicolas
|
529e77d3e7
|
Merge pull request #9 from szepeviktor/typos
Fix typos
|
2024-04-17 15:52:35 -04:00 |
|
Nicolas
|
52fb28bc1a
|
Update index.ts
|
2024-04-17 12:52:15 -07:00 |
|
Nicolas
|
de439f6529
|
Update index.ts
|
2024-04-17 12:51:29 -07:00 |
|
Nicolas
|
871d5d91b0
|
Update index.ts
|
2024-04-17 12:51:12 -07:00 |
|
Nicolas
|
08ed68ff55
|
Nick: fixes
|
2024-04-17 12:44:23 -07:00 |
|
Nicolas
|
650852cc5a
|
Merge branch 'main' into feat/parse-to-markdown-tables
|
2024-04-17 12:28:17 -07:00 |
|
rafaelsideguide
|
ee8a097252
|
adding unit tests and fixing the parse function
|
2024-04-17 15:56:01 -03:00 |
|
Nicolas
|
2eb81545fa
|
Update index.test.ts
|
2024-04-17 11:04:03 -07:00 |
|
Nicolas
|
60245343c9
|
Merge branch 'main' into feat/improving-reative-paths
|
2024-04-17 10:57:49 -07:00 |
|
Nicolas
|
417921ea33
|
Update index.ts
|
2024-04-17 10:57:01 -07:00 |
|
rafaelsideguide
|
b375ce3e39
|
adding unit tests and bugfixing
|
2024-04-17 14:54:54 -03:00 |
|
Nicolas
|
82ed9515f1
|
Update index.ts
|
2024-04-17 10:52:10 -07:00 |
|
Nicolas
|
c837f1cc04
|
Merge pull request #12 from mendableai/bugfix/normalized-api-on-crawl-status
[Bugfix] added normalized apikey to craw/status route
|
2024-04-17 13:42:26 -04:00 |
|
Nicolas
|
db15724b0c
|
Update imageDescription.ts
|
2024-04-17 10:39:29 -07:00 |
|
Nicolas
|
27674a624d
|
Update index.ts
|
2024-04-17 10:39:00 -07:00 |
|
rafaelsideguide
|
25a9255c7e
|
[bugfix] added normalized apikey to craw/status route
|
2024-04-17 12:59:49 -03:00 |
|
rafaelsideguide
|
ff622739b7
|
Added a html to markdown table parser
|
2024-04-17 11:01:19 -03:00 |
|
Viktor Szépe
|
11394ef236
|
Delete apps/api/src/.DS_Store
|
2024-04-17 08:53:12 +02:00 |
|
Viktor Szépe
|
34ab21db59
|
Fix typos
|
2024-04-17 05:13:27 +00:00 |
|
rafaelsideguide
|
ed5dc808c7
|
Update imageDescription.ts
|
2024-04-16 18:05:07 -03:00 |
|
rafaelsideguide
|
00941d94a4
|
Added anthropic vision to getImageDescription function
|
2024-04-16 18:03:48 -03:00 |
|
rafaelsideguide
|
d23a7ae591
|
improving relative paths
|
2024-04-16 16:34:01 -03:00 |
|
rafaelsideguide
|
a04610302a
|
Spliting relative paths for images
|
2024-04-16 16:31:33 -03:00 |
|
Nicolas
|
4c4775e0b8
|
Nick:
|
2024-04-16 12:49:14 -04:00 |
|
Nicolas
|
93627ae87c
|
Nick:
|
2024-04-16 12:06:46 -04:00 |
|
Nicolas
|
3d260e94f3
|
Nick: fc- prefix
|
2024-04-15 20:39:25 -04:00 |
|
Nicolas
|
a6c2a87811
|
Initial commit
|
2024-04-15 17:01:47 -04:00 |
|