0

55 Commits

Author SHA1 Message Date
rafaelsideguide
dc6acbf1f0 Merge remote-tracking branch 'origin/main' into feat/allowbackwardcrawling-option 2024-06-12 11:01:05 -03:00
Nicolas
520739c9f4 Nick: fixed bugs associated with absolute path replacements 2024-06-11 12:43:16 -07:00
rafaelsideguide
ee282c3d55 Added allowBackwardCrawling option 2024-06-11 15:24:39 -03:00
Nicolas
f6b06ac27a Nick: ignoreSitemap, better crawling algo 2024-06-10 18:12:41 -07:00
Nicolas
b4c6819a54 Nick: 2024-06-05 11:11:09 -07:00
Nicolas
674500affa Nick: 2024-06-04 12:15:39 -07:00
Nicolas
6bea803120 Nick: 2024-05-31 15:39:54 -07:00
Nicolas
260e31c68b Merge branch 'nsc/new-pricing' 2024-05-30 16:08:31 -07:00
Nicolas
aa8133ca7f Update load-testing-example.ts 2024-05-30 16:07:14 -07:00
Nicolas
6c939d534d Nick: small refactor 2024-05-29 19:43:51 -07:00
Eric Ciarla
a0e404f94e init commit 2024-05-29 18:56:57 -04:00
Nicolas
1b3547dcf2 Nick: 2024-05-28 12:56:24 -07:00
Nicolas
a5e718b084 Nick: improvements 2024-05-21 18:34:23 -07:00
Nicolas
77a79b5a79 Nick: max num tokens for llm extract (for now) + slice the max 2024-05-20 17:07:38 -07:00
Nicolas
8a72cf556b Nick: 2024-05-13 21:10:58 -07:00
Nicolas
a96fc5b96d Nick: 4x speed 2024-05-13 20:45:11 -07:00
Eric Ciarla
4cc46d4af8 Update models.ts 2024-05-13 15:23:31 -04:00
Nicolas
dcedb8d798 Merge branch 'main' into feat/max-depth 2024-05-07 10:20:49 -07:00
Nicolas
6505bf6bf2 Merge branch 'main' into feat/max-depth 2024-05-07 10:20:44 -07:00
Nicolas
bdbee963f7 Merge branch 'main' into nsc/cancel-job 2024-05-07 10:13:43 -07:00
rafaelsideguide
e1f52c538f nested includeHtml inside pageOptions 2024-05-07 13:40:24 -03:00
rafaelsideguide
83f3408634 Added max depth option 2024-05-07 11:06:26 -03:00
Nicolas
6d5da358cc Nick: cancel job 2024-05-06 17:16:43 -07:00
rafaelsideguide
509250c4ef changed to includeHtml 2024-05-06 19:45:56 -03:00
rafaelsideguide
538355f1af Added toMarkdown option 2024-05-06 11:36:44 -03:00
Nicolas
2aa09a3000 Nick: partial docs working, cleaner 2024-05-04 12:30:12 -07:00
Nicolas
8a95cb42f0 Update models.ts 2024-04-30 18:36:21 -07:00
Nicolas
4967536501 Update index.ts 2024-04-30 18:19:55 -07:00
Nicolas
3c7030dbb1 Nick: improvements 2024-04-30 16:19:32 -07:00
Nicolas
4f526cff92 Nick: cleanup 2024-04-30 12:19:43 -07:00
Caleb Peffer
d1235a0029 Caleb: switched back to markdown for extraction 2024-04-30 10:23:12 -07:00
Caleb Peffer
3ca9e5153f Caleb: trying to get loggin workng 2024-04-30 09:20:15 -07:00
Caleb Peffer
4f7737c922 Caleb: added ajv json schema validation. 2024-04-29 12:12:55 -07:00
Caleb Peffer
667f740315 Caleb: converted llm response to json 2024-04-28 19:28:28 -07:00
Caleb Peffer
2ad7a58eb7 Caleb: first test passing 2024-04-28 17:38:20 -07:00
Caleb Peffer
06497729e2 Caleb: got it to a testable state I believe 2024-04-28 15:52:09 -07:00
Caleb Peffer
6ee1f2d3bc Caleb: initially pulled inspiration code from https://github.com/mishushakov/llm-scraper 2024-04-28 13:59:35 -07:00
Roger M
f2690f6909 Support for tbs, filter, lang, country and location with Serper search. 2024-04-25 01:35:17 +01:00
Nicolas
307ea6f5ec Nick: improvements to search 2024-04-24 10:11:01 -07:00
Nicolas
8939ca570b Merge branch 'main' into nsc/returnOnlyUrls 2024-04-23 18:05:48 -07:00
Nicolas
fdb2789eaa Nick: added url as return param 2024-04-23 17:14:34 -07:00
Nicolas
41263bb4b6 Nick: serper support 2024-04-23 16:45:06 -07:00
Nicolas
5e3e2ec966 Nick: 2024-04-23 15:44:11 -07:00
Nicolas
0146157876 Nick: mvp 2024-04-23 15:28:32 -07:00
Nicolas
306cfe4ce1 Nick: 2024-04-23 11:15:11 -07:00
Nicolas
52620bab16 Nick: prod and local-no-auth tests 2024-04-21 11:39:36 -07:00
Nicolas
5cdbf3a0ac Nick: cleaner functions to handle authenticated requests that dont require ifs everywhere 2024-04-21 10:36:48 -07:00
Nicolas
1a3aa2999d Nick: return the only list of urls 2024-04-20 11:59:42 -07:00
rafaelsideguide
72e1dadccd adding option to replace all relative paths with absolute paths 2024-04-19 11:47:20 -03:00
rafaelsideguide
57e5b36014 [Feat] Adding pdf parser 2024-04-18 11:43:57 -03:00