video2document

mirror of https://gitlab.rlp.net/proj-wise2526-video2document/video2document.git synced 2026-06-15 18:01:52 +02:00

Author	SHA1	Message	Date
Spanier, Pit	6cff6b9981	Merge branch 'develop' into 'feature/38-sprecher-audio-snippets-s4-11' # Conflicts: # main.js	2025-12-16 16:18:17 +01:00
santa	0003d99041	extract speaker snippets mit main verknüpft.	2025-12-16 15:32:31 +01:00
MikeHughes-BIN	04b2457ca3	Changes to the LLMs to return a Promise (outp path) and main now calls the export process	2025-12-15 17:58:03 +01:00
Hughes, Mike	a8c284edd2	Merge branch 'feature/30-backend-export-funktion-des-dokuments-in-verschiedene-datei-formate-s4-05' into 'develop' Feature/30 backend export funktion des dokuments in verschiedene datei formate s4 05 See merge request proj-wise2526-video2document/video2document!43	2025-12-15 15:03:36 +01:00
MikeHughes-BIN	3a5808d97c	Merge branch 'develop' into feature/30-backend-export-funktion-des-dokuments-in-verschiedene-datei-formate-s4-05	2025-12-15 14:41:28 +01:00
MikeHughes-BIN	405a32098c	Remove success and error icons from console log messages for consistency	2025-12-15 14:15:40 +01:00
Hughes, Mike	1683502aa1	Merge branch 'develop' into 'feature/35-backend-llm-chat-gpt-integration-s4-10' # Conflicts: # services/modules/llm-gemini/gemini.js	2025-12-15 14:07:32 +01:00
MikeHughes-BIN	ec57411992	Refactor document generation to output HTML format and update system instructions for clarity	2025-12-15 13:53:14 +01:00
MikeHughes-BIN	271fe78b7b	Changed the module to use puppeteer and html-to-docx	2025-12-14 18:14:16 +01:00
MikeHughes-BIN	1e38cc79f4	Extended the Role description	2025-12-14 16:53:35 +01:00
MikeHughes-BIN	746fec05d4	Changed model from llama to quen3 because llama was timing out due to token limit	2025-12-14 16:28:48 +01:00
Hughes, Mike	5ea8ec6a1a	Merge branch 'feature/implementation-of-ci-s4-09' into 'develop' implemented first CI pipeline version with a test test that only checks if 1... See merge request proj-wise2526-video2document/video2document!34	2025-12-13 14:13:06 +01:00
MikeHughes-BIN	7cd334645f	Added a module that safes the file in a requested format.	2025-12-11 14:22:04 +01:00
Spanier, Pit	15e2e35bda	Merge branch 'feature/35-backend-llm-chat-gpt-integration-s4-10' into 'develop' Feature/35 backend llm chat gpt integration s4 10 See merge request proj-wise2526-video2document/video2document!35	2025-12-11 13:30:14 +01:00
MikeHughes-BIN	53508b175a	changed Display name	2025-12-11 13:27:00 +01:00
MikeHughes-BIN	2b597add6c	fixed a unicode bug	2025-12-11 13:03:04 +01:00
MikeHughes-BIN	3af038d195	Multiple AI models implemented - chatgpt, llama	2025-12-11 12:41:11 +01:00
MikeHughes-BIN	b546c96238	Refactor code structure for improved readability and maintainability	2025-12-11 12:08:07 +01:00
emily	faee605f12	Initial version of the working CI pipeline Added mocha based unit tests for each module Did a bit of cleanup in the modules to remove debug console.log calls Removed the Progress bar in the extractor and the library requirement Promisified the gemini module to make sure it returns the path as a promise instead of just on the cli Fixed gitignore so that it now only ignores the content int the storage directories, and not the whole directories Added neetingReport.json for the LLMs to use	2025-12-09 22:07:43 +01:00
emily	f858f2edbb	added functionality for transcription tool to tell the frontend what audio format to use	2025-12-05 15:58:17 +01:00
MikeHughes-BIN	425e24853e	Similar to the Gemini LLM we now have a ChatGPT REST call. The API Key is still missing	2025-12-04 10:58:50 +01:00
MikeHughes-BIN	84cc94aa34	REST API & Auto Naming	2025-11-26 21:55:19 +01:00
santa	9ab98825a2	made it so that the transcript summary file name is not hardcoded anymore	2025-11-24 17:24:08 +01:00
emily	9a0a349813	Switched to transcription summarizer2 due to a better output format	2025-11-24 17:01:49 +01:00
emily	9dfc05e987	Fixed Transcription Summarizer, and now the whole pipeline works (except for LLM, because we need the keys for that)	2025-11-24 16:51:48 +01:00
santa	a5a60635fc	worked on fixing the code	2025-11-24 16:40:12 +01:00
emily	812bca8cfb	working on implementing everything Currently getting stuck on the assembly module as assembly ai for some fucking reason blocks me from making any requests This includes making a new user account	2025-11-24 15:35:32 +01:00
emily	34d644d7d7	Merge commit '2f5efee9c7f81cadc22f033f9ba20bb5cbc2d66e' into develop	2025-11-24 14:41:10 +01:00
santa	2f5efee9c7	Summarizer und weiters angepasst	2025-11-24 14:27:49 +01:00
santa	465fe8bd41	Summarizer angepasst	2025-11-24 14:25:29 +01:00
emily	474e587ff2	Added functionality to group modules for frontend	2025-11-24 12:53:05 +01:00
emily	c2f22b3525	merged Mikes code for the google gemini module included the requires libraries in the package.json implemented the library into requires.js cleaned up ffmpeg from the events	2025-11-24 12:12:46 +01:00
emily	6257ad05a8	Merge commit '4ade9575f33fd353522a07abe2f60e2d1be6feb4' into feature/modular-ipc-system-implementation	2025-11-24 12:08:35 +01:00
emily	bf7438b753	cleaned up ffmpegextractor.js	2025-11-24 12:07:54 +01:00
emily	868945fb92	Merge commit '925eb33eab45c084e9386ada66d9cb14b966e312' into feature/modular-ipc-system-implementation	2025-11-24 12:06:58 +01:00
Azeufack Noupeu Willy	a0ed2ab7bd	feat(extraction): add audio_ready event emission - Add EventEmitter to emit audio_ready when extraction completes - Pass sessionId and audioPath in event data - Export audioEvents for Main process orchestrator Refs: S3-06 AC1,AC3,AC7	2025-11-20 14:05:52 +01:00
santa	b87bfd444d	Test erstellt	2025-11-20 13:27:15 +01:00
santa	97b571b7f9	Einbindung des Summarizer	2025-11-20 11:51:34 +01:00
santa	455147a41b	Summarizer Tool erstellt zur zusammenfassung der vom TranskriptionTool kommenden json	2025-11-20 10:15:11 +01:00
MikeHughes-BIN	a178ccf30f	Added comments to my code to make it more understandable	2025-11-18 19:53:19 +01:00
MikeHughes-BIN	a1d804f463	Test created and changes to gemini.js file	2025-11-18 19:16:51 +01:00
MikeHughes-BIN	8e7e0b5043	Implement Gemini LLM module for document generation (first non tested prototype)	2025-11-17 21:16:50 +01:00
emily	4dc53b9d5f	implemented first version of the modular IPC system	2025-11-17 18:00:04 +01:00
MikeHughes-BIN	6083773f88	New Folder structure	2025-11-15 14:45:13 +01:00
emily	d9eacafc3a	fixed the program by moving the example module back to where the program can ACTUALLY load it	2025-11-14 14:28:11 +01:00
MikeHughes-BIN	9254ddc57f	Changed the Folder Structure for better maintainability	2025-11-13 17:34:22 +01:00
Azeufack Noupeu Willy	c021272ca4	merge: Resolve conflicts with develop	2025-11-13 15:22:33 +01:00
Azeufack Noupeu Willy	e7e97a7f60	feat(S2-02b): Implement AssemblyAI external transcription with speaker diarization - Add assembly.ts module for REST API transcription via AssemblyAI - Implement 5-step pipeline: upload → create job → poll status → download → save - Enable speaker_labels for diarization (Speaker A, B, C...) - Add millisecond-precision timestamps for each utterance - Store JSON transcripts in storage/transcripts/{session_id}.json - Add axios, dotenv dependencies - Add transcribeLatest.ts helper for quick testing User Story: S2-02b - Externe Transkription per REST API	2025-11-13 13:07:18 +01:00
emily	94f390f28b	Implemented audio extractor into IPC call, sooooooo, frontend and backend are now officially linked successfully	2025-11-12 20:28:55 +01:00
emily	92043440fe	Combined Frontend with backend, and implemented the extractor function into the module structure	2025-11-10 16:18:57 +01:00

1 2

57 Commits