Commit Graph

  • 5c10d3d58e feat: enhance EmbeddingFunc with model_name support BukeLy 2025-11-19 02:11:39 +08:00
  • 186c8f0e16 Preserve blank paragraphs in DOCX extraction to maintain spacing yangdx 2025-11-19 02:03:10 +08:00
  • fa887d811b Fix table column structure preservation in DOCX extraction yangdx 2025-11-19 01:52:02 +08:00
  • 4438ba41a3 Enhance DOCX extraction to preserve document order with tables yangdx 2025-11-19 01:31:33 +08:00
  • d16c7840ab Bump API version to 0256 yangdx 2025-11-18 23:15:31 +08:00
  • e77340d4a1 Adjust chunking parameters to match the default environment variable settings yangdx 2025-11-18 23:14:50 +08:00
  • 24423c9215 Merge branch 'fix_chunk_comment' yangdx 2025-11-18 22:47:23 +08:00
  • 1bfa1f81cb Merge branch 'main' into fix_chunk_comment yangdx 2025-11-18 22:38:50 +08:00
  • 9c10c87554 Fix linting yangdx 2025-11-18 22:38:43 +08:00
  • 9109509b1a Merge branch 'dev-postgres-vchordrq' yangdx 2025-11-18 22:25:35 +08:00
  • dbae327a17 Merge branch 'main' into dev-postgres-vchordrq yangdx 2025-11-18 22:13:27 +08:00
  • b583b8a59d Merge branch 'feature/postgres-vchordrq-indexes' into dev-postgres-vchordrq yangdx 2025-11-18 22:05:48 +08:00
  • 3096f844fb fix(postgres): allow vchordrq.epsilon config when probes is empty yangdx 2025-11-18 21:58:36 +08:00
  • dacca334e0 refactor(chunking): rename params and improve docstring for chunking_by_token_size EightyOliveira 2025-11-18 15:46:28 +08:00
  • f4bf5d279c fix: add logger to configure_vchordrq() and format code wmsnp 2025-11-18 15:31:08 +08:00
  • dfbc97363c Merge pull request #2369 from HKUDS/workspace-isolation Daniel.y 2025-11-18 15:21:10 +08:00
  • 702cfd2981 Fix document deletion concurrency control and validation logic yangdx 2025-11-18 13:59:24 +08:00
  • 656025b75e Rename GitHub workflow from "Tests" to "Offline Unit Tests" yangdx 2025-11-18 13:36:00 +08:00
  • 7e9c8ed1e8 Rename test classes to prevent warning from pytest yangdx 2025-11-18 13:33:05 +08:00
  • 4048fc4b89 Fix: auto-acquire pipeline when idle in document deletion yangdx 2025-11-18 13:25:13 +08:00
  • 1745b30a5f Fix missing workspace parameter in update flags status call yangdx 2025-11-18 12:55:48 +08:00
  • f8dd2e0724 Fix namespace parsing when workspace contains colons yangdx 2025-11-18 12:23:05 +08:00
  • 472b498ade Replace pytest group reference with explicit dependencies in evaluation yangdx 2025-11-18 12:17:21 +08:00
  • a11912ffa5 Add testing workflow guidelines to basic development rules yangdx 2025-11-18 11:54:19 +08:00
  • 41bf6d0283 Fix test to use default workspace parameter behavior yangdx 2025-11-18 11:51:17 +08:00
  • d07023c962 feat(postgres_impl): add vchordrq vector index support and unify vector index creation logic wmsnp 2025-11-18 11:45:16 +08:00
  • 4ea2124001 Add GitHub CI workflow and test markers for offline/integration tests yangdx 2025-11-18 11:36:10 +08:00
  • 4fef731f37 Standardize test directory creation and remove tempfile dependency yangdx 2025-11-18 10:39:54 +08:00
  • 1fe05df211 Refactor test configuration to use pytest fixtures and CLI options yangdx 2025-11-18 10:31:53 +08:00
  • 6ae0c14438 test: add concurrent execution to workspace isolation test yangdx 2025-11-18 10:17:34 +08:00
  • 6cef8df159 Reduce log level and improve workspace mismatch message clarity yangdx 2025-11-18 08:25:21 +08:00
  • fc9f7c705e Fix linting yangdx 2025-11-18 08:07:54 +08:00
  • f83b475ab1 Remove Dependabot configuration file yangdx 2025-11-18 01:42:15 +08:00
  • 21ad990e36 Improve workspace isolation tests with better parallelism checks and cleanup yangdx 2025-11-18 01:38:31 +08:00
  • 5da82bb096 Add pre-commit to pytest dependencies and format test code yangdx 2025-11-18 00:42:04 +08:00
  • 99262adaaa Enhance workspace isolation test with distinct mock data and persistence yangdx 2025-11-18 00:38:31 +08:00
  • b7b8d15632 Refactor pytest dependencies into separate optional group yangdx 2025-11-17 23:52:13 +08:00
  • 1874cfaf73 Fix linting yangdx 2025-11-17 23:32:38 +08:00
  • 3806892a40 Merge pull request #2371 from BukeLy/pytest-style-conversion Daniel.y 2025-11-17 23:28:56 +08:00
  • 1a1837028a docs: Update test file docstring to reflect all 11 test scenarios BukeLy 2025-11-17 19:02:46 +08:00
  • 3ec736932e test: Enhance E2E workspace isolation detection with content verification BukeLy 2025-11-17 18:55:45 +08:00
  • a990c1d40b fix: Correct Mock LLM output format in E2E test BukeLy 2025-11-17 18:49:54 +08:00
  • 288498ccdc test: Convert test_workspace_isolation.py to pytest style BukeLy 2025-11-17 18:24:52 +08:00
  • ddc76f0c80 Merge branch 'main' into workspace-isolation yangdx 2025-11-17 17:08:07 +08:00
  • 9262f66d13 Bump API version to 0255 yangdx 2025-11-17 17:07:18 +08:00
  • 393f880311 Improve LightRAG initialization checker tool with better usage docs yangdx 2025-11-17 15:42:54 +08:00
  • 9d7b7981ce Add pipeline status validation before document deletion yangdx 2025-11-17 14:58:10 +08:00
  • 98e964dfc4 Fix initialization instructions in check_lightrag_setup function yangdx 2025-11-17 14:27:26 +08:00
  • 6d6716e9f8 Add _default_workspace to shared storage finalization yangdx 2025-11-17 13:46:46 +08:00
  • cf73cb4d24 Remove unused variables from workspace isolation test yangdx 2025-11-17 13:13:12 +08:00
  • c1ec657c54 Fix linting yangdx 2025-11-17 13:08:34 +08:00
  • f1d8f18c80 Merge branch 'main' into workspace-isolation yangdx 2025-11-17 13:01:33 +08:00
  • 3e759f46d1 test: Add real integration and E2E tests for workspace isolation BukeLy 2025-11-17 12:16:32 +08:00
  • 436e41439e test: Enhance workspace isolation test suite to 100% coverage BukeLy 2025-11-17 11:46:45 +08:00
  • 4742fc8efa test: Add comprehensive workspace isolation test suite for PR #2366 BukeLy 2025-11-17 11:33:07 +08:00
  • cdd53ee875 Remove manual initialize_pipeline_status() calls across codebase yangdx 2025-11-17 07:28:41 +08:00
  • e22ac52ebc Auto-initialize pipeline status in LightRAG.initialize_storages() yangdx 2025-11-17 07:14:02 +08:00
  • e8383df3b8 Fix NamespaceLock context variable timing to prevent lock bricking yangdx 2025-11-17 06:43:37 +08:00
  • 95e1fb1612 Remove final_namespace attribute for in-memory storage and use namespace in clean_llm_query_cache.py yangdx 2025-11-17 06:28:34 +08:00
  • 7ed0eac4c9 Fix workspace filtering logic in get_all_update_flags_status yangdx 2025-11-17 06:16:26 +08:00
  • 78689e8837 Fix pipeline status namespace check to handle root case yangdx 2025-11-17 06:01:23 +08:00
  • d54d0d55d9 Standardize empty workspace handling from "_" to "" across storage yangdx 2025-11-17 05:58:11 +08:00
  • b6a5a90eaf Fix NamespaceLock concurrent coroutine safety with ContextVar yangdx 2025-11-17 05:27:31 +08:00
  • fd486bc922 Refactor storage classes to use namespace instead of final_namespace yangdx 2025-11-17 05:07:53 +08:00
  • 01814bfc7a Fix missing function call parentheses in get_all_update_flags_status yangdx 2025-11-17 04:11:06 +08:00
  • 7deb9a64b9 Refactor namespace lock to support reusable async context manager yangdx 2025-11-17 04:07:37 +08:00
  • 52c812b9a0 Fix workspace isolation for pipeline status across all operations yangdx 2025-11-17 03:45:51 +08:00
  • 926960e957 Refactor workspace handling to use default workspace and namespace locks yangdx 2025-11-17 02:32:00 +08:00
  • acae404f04 Update env.example yangdx 2025-11-15 01:25:56 +08:00
  • ec05d89c2a Add macOS fork safety check for Gunicorn multi-worker mode yangdx 2025-11-15 00:58:23 +08:00
  • 8abc2ac1cb Update edge keywords extraction in graph visualization Sleeep 2025-11-13 15:52:14 +08:00
  • e5addf4d94 Improve embedding config priority and add debug logging yangdx 2025-11-14 23:22:44 +08:00
  • 2fb57e767d Fix embedding token limit initialization order yangdx 2025-11-14 22:56:03 +08:00
  • 6b2af2b579 Refactor embedding function creation with proper attribute inheritance yangdx 2025-11-14 22:29:08 +08:00
  • f0254773c6 Convert embedding_token_limit from property to field with __post_init__ yangdx 2025-11-14 20:58:41 +08:00
  • 14a6c24ed7 Add configurable embedding token limit with validation yangdx 2025-11-14 19:28:36 +08:00
  • f5b48587ed Improve Bedrock error handling with retry logic and custom exceptions yangdx 2025-11-14 18:51:41 +08:00
  • 77221564b0 Add max_token_size parameter to embedding function decorators yangdx 2025-11-14 18:41:43 +08:00
  • 8283c86bce Refactor exception handling in MemgraphStorage label methods yangdx 2025-11-14 11:01:26 +08:00
  • 423e4e927a Fix null reference errors in graph database error handling yangdx 2025-11-14 10:39:04 +08:00
  • 2f2f35b883 Add macOS compatibility check for DOCLING with multi-worker Gunicorn yangdx 2025-11-13 19:18:04 +08:00
  • c246eff725 Improve docling integration with macOS compatibility and CLI flag yangdx 2025-11-13 18:58:09 +08:00
  • 63510478e5 Improve error handling and logging in cloud model detection yangdx 2025-11-13 20:41:44 +08:00
  • 67dfd85679 Add a better regex LacombeLouis 2025-11-13 12:17:51 +01:00
  • 5127bf20ae Add support for environment variable fallback for API key and default host for cloud models Louis Lacombe 2025-11-12 16:11:05 +00:00
  • fa9206d69a Update uv.lock yangdx 2025-11-13 15:31:51 +08:00
  • 7b7f93d77c Implement lazy configuration initialization for API server yangdx 2025-11-13 15:28:05 +08:00
  • 69a0b74ce7 refactor: move document deps to api group, remove dynamic imports yangdx 2025-11-13 13:34:09 +08:00
  • 7d394fb0a4 Replace asyncio.iscoroutine with inspect.isawaitable for better detection yangdx 2025-11-13 12:56:01 +08:00
  • 72f68c2a61 Update env.example yangdx 2025-11-13 11:40:56 +08:00
  • a08bc72635 Fix empty dict handling after JSON sanitization yangdx 2025-11-12 16:40:57 +08:00
  • cca0800ed4 Fix migration to reload sanitized data and prevent memory corruption yangdx 2025-11-12 16:16:28 +08:00
  • 7f54f47093 Optimize JSON string sanitization with precompiled regex and zero-copy yangdx 2025-11-12 15:42:07 +08:00
  • f289cf6225 Optimize JSON write with fast/slow path to reduce memory usage yangdx 2025-11-12 13:48:56 +08:00
  • 93a3e47134 Remove deprecated response_type parameter from query settings yangdx 2025-11-12 12:19:30 +08:00
  • abeaac84fa Improve JSON data sanitization to handle tuples and dict keys yangdx 2025-11-12 00:50:18 +08:00
  • 5885637ebf Add specialized JSON string sanitizer to prevent UTF-8 encoding errors yangdx 2025-11-12 00:38:47 +08:00
  • 23cbb9c9b2 Add data sanitization to JSON writing to prevent UTF-8 encoding errors yangdx 2025-11-12 00:11:13 +08:00
  • ff8f158891 Update env.example yangdx 2025-11-11 12:02:37 +08:00
  • c434879c7a Replace PyPDF2 with pypdf for PDF processing yangdx 2025-11-11 01:38:09 +08:00