diff --git a/.autodoc/docs/data/docstore.json b/.autodoc/docs/data/docstore.json index 7d5f8cd..dc401f7 100644 --- a/.autodoc/docs/data/docstore.json +++ b/.autodoc/docs/data/docstore.json @@ -1 +1 @@ -[["0",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts)\n\nThe `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe main steps involved in the function are:\n\n1. Set the output path for the JSON files generated during the process.\n2. Update the spinner text to display \"Estimating cost...\".\n3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stop the spinner once the dry run is complete.\n5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Display the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository.\n## Questions: \n 1. **What is the purpose of the `estimate` function and what parameters does it accept?**\n\n The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings.\n\n2. **How does the `estimate` function calculate the cost estimate?**\n\n The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details.\n\n3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?**\n\n The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/estimate/index.md"}}],["1",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/estimate)\n\nThe `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe main steps involved in the `estimate` function are:\n\n1. Setting the output path for the JSON files generated during the process.\n2. Updating the spinner text to display \"Estimating cost...\".\n3. Performing a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stopping the spinner once the dry run is complete.\n5. Printing the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculating the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Displaying the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. The function is designed to work seamlessly with other parts of the Autodoc project, such as the `processRepository` function, which is responsible for the actual processing of the repository.\n\nBy providing an estimated cost upfront, the `estimate` function helps users make informed decisions about whether to proceed with the indexing process or not. This can be particularly useful for users with large repositories or those who are working within a budget. Overall, the `estimate` function is an essential tool for users looking to leverage the power of Autodoc while managing their costs effectively.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/estimate/summary.md"}}],["2",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts)\n\nThe `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project.\n\nFirst, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile: () => {\n files++;\n return Promise.resolve();\n },\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nNext, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name.\n\nThe function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension.\n\n```javascript\nconst outputPath = getFileName(markdownFilePath, '.', '.md');\nawait fs.writeFile(outputPath, markdown, 'utf-8');\n```\n\nThe `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile,\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nIn summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories.\n## Questions: \n 1. **What is the purpose of the `convertJsonToMarkdown` function?**\n\n The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information.\n\n2. **How does the `traverseFileSystem` function work and what are its parameters?**\n\n The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders.\n\n3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?**\n\n The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/index/convertJsonToMarkdown.md"}}],["3",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts)\n\nThe code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings.\n\nThe `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata.\n\nThe `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n\nThis code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file.\n## Questions: \n 1. **Question:** What is the purpose of the `processFile` function and how does it handle errors?\n **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error.\n\n2. **Question:** How does the `processDirectory` function handle nested directories and files?\n **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class?\n **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/index/createVectorStore.md"}}],["4",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts)\n\nThe code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe `index` function performs the following steps:\n\n1. Define the paths for JSON, Markdown, and data output directories within the `output` folder.\n\n2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step.\n\n3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\n4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.\n## Questions: \n 1. **What is the purpose of the `index` function in this code?**\n\n The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options.\n\n2. **What are the different steps involved in processing the repository?**\n\n The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files.\n\n3. **What is the role of the `AutodocRepoConfig` type?**\n\n The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/index/index.md"}}],["5",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts)\n\nThe `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings.\n\nThe function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders.\n\nThe `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory.\n\nThe `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory.\n\nThe main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing.\n\nExample usage of the `processRepository` function:\n\n```javascript\nconst autodocConfig = {\n name: 'myProject',\n repositoryUrl: 'https://github.com/user/myProject',\n root: 'src',\n output: 'output',\n llms: [LLMModels.GPT3, LLMModels.GPT4],\n ignore: ['.git', 'node_modules'],\n filePrompt: 'Explain this code file',\n folderPrompt: 'Summarize this folder',\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nprocessRepository(autodocConfig).then((models) => {\n console.log('Processing complete');\n});\n```\n\nThis code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory.\n## Questions: \n 1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters?\n **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk.\n\n2. **Question:** How does the code determine the best language model to use for generating summaries and questions?\n **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration.\n\n3. **Question:** How does the code handle traversing the file system and processing files and folders?\n **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/index/processRepository.md"}}],["6",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts)\n\nThe code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders.\n\n1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example:\n\n```javascript\ncreateCodeFileSummary('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'Write a detailed technical explanation of what this code does.');\n```\n\n2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example:\n\n```javascript\ncreateCodeQuestions('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example:\n\n```javascript\nfolderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.');\n```\n\nThese functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders.\n## Questions: \n 1. **Question:** What is the purpose of the `createCodeFileSummary` function?\n **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?\n **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take?\n **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/index/prompts.md"}}],["7",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/index)\n\nThe code in this folder is responsible for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\nFor example, the `processRepository` function processes a code repository and generates summaries and questions for each file and folder within the repository. It uses helper functions like `callLLM` to make API calls to language models and `processFile` and `processFolder` to process individual files and folders. The results are saved as JSON files in the output directory.\n\nThe `convertJsonToMarkdown` function converts JSON files containing documentation information into Markdown files. It counts the number of files in the project and creates Markdown files for each code file in the project using the `traverseFileSystem` utility.\n\nThe `createVectorStore` function processes a directory of text files, splits the text into chunks, and creates a vector store using the HNSWLib library and OpenAIEmbeddings. It processes the files in the directory and calls `processFile` for each file, creating a vector store and saving it to the output file path.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.\n\nIn summary, the code in this folder plays a crucial role in the Autodoc project by processing code repositories, generating documentation in various formats, and creating vector files for the documentation. This helps developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/index/summary.md"}}],["8",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts)\n\nThis code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values.\n## Questions: \n 1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available.\n\n2. **Question:** How does the `init` function work and what does it do with the user's input?\n **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`.\n\n3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration?\n **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/init/index.md"}}],["9",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/init)\n\nThe `index.ts` file in the `init` folder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. This configuration file is crucial for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. The `init` function is a crucial part of the project, as it sets up the necessary configuration for the project to work correctly. It interacts with other parts of the project by providing the required settings and values, ensuring that the project can adapt to different user requirements and preferences.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/init/summary.md"}}],["10",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts)\n\nThis code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain.\n\nThe question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model.\n\nThe document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate.\n\nThe `makeChain` function takes the following parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's GitHub repository.\n- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation).\n- `chatPrompt`: Additional instructions for answering questions about the content.\n- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users).\n- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors.\n- `llms`: An array of language models (e.g., GPT-3, GPT-4).\n- `onTokenStream`: An optional callback function to handle streaming tokens.\n\nExample usage:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nThis creates a chatbot that can answer questions about the \"autodoc\" project, using the provided language models and vector store.\n## Questions: \n 1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code?\n **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` callback function work and when is it used?\n **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/query/createChatChain.md"}}],["11",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts)\n\nThis code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nThe code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot.\n\nThe `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top.\n\nThe main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nIf an error occurs during the process, the chatbot displays an error message and prompts the user for another question.\n\nExample usage:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.\n## Questions: \n 1. **What is the purpose of the `query` function and what are its input parameters?**\n\n The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration.\n\n2. **How does the `vectorStore` work and what is its role in the code?**\n\n The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions.\n\n3. **How does the chat history work and what is its purpose?**\n\n The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/query/index.md"}}],["12",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/query)\n\nThe `query` folder in the Autodoc project contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nIn `createChatChain.ts`, the `makeChain` function is defined, which creates a chatbot using the `ChatVectorDBQAChain` class. This class combines two separate language models: a question generator and a document chain. The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input.\n\nExample usage of `makeChain`:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nIn `index.ts`, the main chatbot interface is defined. It starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nExample usage of the chatbot interface:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/query/summary.md"}}],["13",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands)\n\nThe code in the `src/cli/commands` folder is responsible for handling various command-line tasks in the Autodoc project. It contains several subfolders, each dedicated to a specific command or functionality, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations.\n\nFor instance, the `estimate` subfolder contains a function that allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input and performs a dry run of the `processRepository` function. It then calculates the total estimated cost and displays it to the user. This helps users make informed decisions about whether to proceed with the indexing process or not.\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n // ...configuration options...\n};\n\nestimate(config);\n```\n\nThe `index` subfolder contains code for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n // ...configuration options...\n};\n\nautodoc.index(config);\n```\n\nThe `init` subfolder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values.\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThe `query` subfolder contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThe `user` subfolder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs).\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIn summary, the code in the `src/cli/commands` folder plays a crucial role in the Autodoc project by providing various command-line functionalities, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations. These functionalities help developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/summary.md"}}],["14",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts)\n\nThis code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.\n## Questions: \n 1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`.\n\n2. **Question:** How does the `user` function handle existing user configuration files?\n **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration.\n\n3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration?\n **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/user/index.md"}}],["15",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/user)\n\nThe `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\n```typescript\nfunction makeConfigTemplate(llms: string[]): ConfigTemplate {\n // ...\n}\n```\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\n```typescript\nconst configTemplate = makeConfigTemplate(selectedLLMs);\nawait fs.promises.writeFile(configPath, JSON.stringify(configTemplate, null, 2));\n```\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.\n\nThis code is essential for setting up the user's environment and preferences for the Autodoc project. It ensures that the user has the correct configuration file in place, which is necessary for the proper functioning of the project. The user configuration file is used by other parts of the project to determine which LLMs the user has access to and can query.\n\nFor example, when a user runs the `doc q` command, the project will read the user configuration file to determine which LLMs are available for querying. This ensures that the user only queries the LLMs they have access to, preventing any unauthorized access or usage.\n\nIn summary, the `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project, ensuring that the user has the correct configuration file in place, and allowing the user to select the LLMs they have access to. This is essential for the proper functioning of the project and for maintaining the user's preferences and access to different LLMs.","metadata":{"source":".autodoc/docs/markdown/src/cli/commands/user/summary.md"}}],["16",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/spinner.ts)\n\nThis code provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.\n## Questions: \n 1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the terminal, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **What are the different states of the spinner and how are they updated?**\n\n The spinner can have different states such as spinning, stopped, failed, succeeded, and displaying information. The functions `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` are used to update the spinner's state and text accordingly.\n\n3. **How does the `updateSpinnerText` function work and when should it be used?**\n\n The `updateSpinnerText` function updates the spinner's text with the provided message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message. This function should be used when you want to change the spinner's text while it is spinning or start it with a new message.","metadata":{"source":".autodoc/docs/markdown/src/cli/spinner.md"}}],["17",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli)\n\nThe `spinner.ts` file in the `.autodoc/docs/json/src/cli` folder provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.","metadata":{"source":".autodoc/docs/markdown/src/cli/summary.md"}}],["18",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/APIRateLimit.ts)\n\nThe `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server.\n\nThe class has a constructor that takes an optional `maxConcurrentCalls` parameter, which defaults to 50. This parameter determines the maximum number of API calls that can be made concurrently.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. This method takes a function `apiFunction` that returns a promise and wraps it in a rate-limited execution. The method returns a promise that resolves with the result of the API call or rejects with an error if the call fails.\n\nWhen `callApi` is called, it adds the `executeCall` function to the `queue`. The `executeCall` function is responsible for executing the API call, resolving or rejecting the promise, and managing the `inProgress` counter. After adding the `executeCall` function to the queue, the code checks if there are available slots for concurrent calls by comparing `inProgress` with `maxConcurrentCalls`. If there are available slots, it calls the `dequeueAndExecute` method.\n\nThe `dequeueAndExecute` method is responsible for executing the queued API calls while ensuring that the number of concurrent calls does not exceed the `maxConcurrentCalls` limit. It dequeues the next API call from the queue and executes it if there are available slots for concurrent calls.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchData(id) {\n // Simulate an API call\n return new Promise((resolve) => setTimeout(() => resolve(`Data for ${id}`), 1000));\n}\n\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\n\n// Usage\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetchData` function, which simulates an API call.\n## Questions: \n 1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and manages the execution of queued calls based on the available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How does the `dequeueAndExecute` method work?**\n\n The `dequeueAndExecute` method is responsible for executing the queued API calls. It checks if there are any calls in the queue and if there are available slots for concurrent calls. If both conditions are met, it dequeues the next call from the queue and executes it. This method is called whenever a new API call is added to the queue or when an in-progress call is completed.","metadata":{"source":".autodoc/docs/markdown/src/cli/utils/APIRateLimit.md"}}],["19",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/FileUtil.ts)\n\nThis code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for files and folders.\n\n1. `getFileName(input: string, delimiter = '.', extension = '.md'): string`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new file name with the given extension. If the delimiter is not found in the input string, the function appends the extension to the input string. If the delimiter is found, the function replaces the part after the last delimiter with the extension. For example:\n\n ```javascript\n getFileName(\"example.txt\"); // returns \"example.md\"\n getFileName(\"example\"); // returns \"example.md\"\n ```\n\n2. `githubFileUrl(githubRoot: string, inputRoot: string, filePath: string, linkHosted: boolean): string`: This function generates a GitHub URL for a file. It takes the GitHub root URL, the input root path, the file path, and a boolean flag `linkHosted`. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the file. If `linkHosted` is false, the function returns a URL pointing to the file in the GitHub repository. For example:\n\n ```javascript\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", false); // returns \"https://github.com/user/repo/blob/master/example.md\"\n ```\n\n3. `githubFolderUrl(githubRoot: string, inputRoot: string, folderPath: string, linkHosted: boolean): string`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the folder. If `linkHosted` is false, the function returns a URL pointing to the folder in the GitHub repository. For example:\n\n ```javascript\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", true); // returns \"https://github.com/user/repo/folder\"\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", false); // returns \"https://github.com/user/repo/tree/master/folder\"\n ```\n\nThese utility functions can be used in the autodoc project to generate file names and URLs for documentation files and folders, making it easier to manage and navigate the documentation structure.\n## Questions: \n 1. **What does the `getFileName` function do?**\n\n The `getFileName` function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns the input string with the specified extension, replacing the part after the last occurrence of the delimiter if it exists.\n\n2. **What is the purpose of the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both `githubFileUrl` and `githubFolderUrl` functions are used to generate URLs for files and folders, respectively, in a GitHub repository. They take a `githubRoot`, `inputRoot`, a `filePath` or `folderPath`, and a `linkHosted` boolean flag. If `linkHosted` is true, the generated URL will point to the hosted version of the file or folder; otherwise, it will point to the file or folder in the GitHub repository.\n\n3. **Why is the `inputRoot.length - 1` used in the `substring` method for both `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `inputRoot.length - 1` is used to remove the `inputRoot` part from the `filePath` or `folderPath` when generating the final URL. This ensures that the generated URL only contains the relevant path relative to the GitHub repository root.","metadata":{"source":".autodoc/docs/markdown/src/cli/utils/FileUtil.md"}}],["20",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/LLMUtil.ts)\n\nThis code defines and manages different language models (LLMs) and their associated costs for a project. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has a set of properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of `OpenAIChat` with specific configurations. The `inputTokens`, `outputTokens`, `succeeded`, `failed`, and `total` properties are initialized to 0.\n\n```javascript\n{\n name: LLMModels.GPT3,\n inputCostPer1KTokens: 0.002,\n outputCostPer1KTokens: 0.002,\n maxLength: 3050,\n llm: new OpenAIChat({ ... }),\n inputTokens: 0,\n outputTokens: 0,\n succeeded: 0,\n failed: 0,\n total: 0,\n}\n```\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the number of input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost for all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.\n## Questions: \n 1. **Question**: What is the purpose of the `models` object and what are the different models available?\n **Answer**: The `models` object is a record that maps the available LLMModels (GPT3, GPT4, and GPT432k) to their respective details, such as name, input and output costs, maxLength, and an instance of OpenAIChat with the corresponding model.\n\n2. **Question**: How does the `printModelDetails` function work and what information does it display?\n **Answer**: The `printModelDetails` function takes an array of LLMModelDetails and generates an output object containing the model name, file count, succeeded, failed, tokens, and cost. It then calculates the totals for each property and displays the information in a console table.\n\n3. **Question**: What is the purpose of the `totalIndexCostEstimate` function and how does it calculate the total cost?\n **Answer**: The `totalIndexCostEstimate` function calculates the total cost of indexing the given models by iterating through the models array and summing up the input and output costs per 1K tokens for each model.","metadata":{"source":".autodoc/docs/markdown/src/cli/utils/LLMUtil.md"}}],["21",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/WaitUtil.ts)\n\nThe code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, which is a JavaScript object that represents the eventual completion (or failure) of an asynchronous operation and its resulting value.\n\n### wait function\n\nThe `wait` function takes two arguments: `timeoutMs`, which is the number of milliseconds to wait before resolving the promise, and `value`, which is an optional value to be returned when the promise resolves. The function creates a new `Promise` and uses `setTimeout` to resolve it with the given `value` after the specified `timeoutMs` has passed.\n\nExample usage:\n\n```javascript\n// Wait for 2 seconds and then log \"Hello, world!\"\nwait(2000, \"Hello, world!\").then(console.log);\n```\n\n### forTrue function\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. The purpose of this function is to repeatedly check if the given function `fn` returns `true`. If it does, the promise resolves with `true`. If the function does not return `true` after 200 checks, the promise is rejected.\n\nThe function uses `setInterval` to repeatedly call the given function `fn` every 50 milliseconds. If `fn` returns `true`, the interval is cleared, and the promise is resolved. If the function has been called 200 times without returning `true`, the promise is rejected.\n\nExample usage:\n\n```javascript\n// Check if a certain element is visible on the page\nconst isElementVisible = () => document.querySelector(\"#my-element\").offsetParent !== null;\n\n// Wait for the element to become visible, then log \"Element is visible!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\"));\n```\n\nIn summary, these utility functions help manage asynchronous operations by providing a way to wait for a certain amount of time or for a specific condition to be met. They can be used in various parts of the larger project to handle timing and conditional logic in an asynchronous manner.\n## Questions: \n 1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds. It can be used to introduce a delay in the execution of asynchronous code.\n\n2. **How does the `forTrue` function work and what is its use case?**\n\n The `forTrue` function takes a function `fn` as an argument, which returns a boolean value. It repeatedly checks the result of `fn` every 50 milliseconds until it returns `true` or the maximum number of checks (200) is reached. This function can be used to wait for a specific condition to be met before proceeding with the execution of asynchronous code.\n\n3. **Is there any error handling or customization for the `forTrue` function, such as customizing the interval or maximum number of checks?**\n\n Currently, there is no error handling or customization options for the `forTrue` function. The interval is hardcoded to 50 milliseconds, and the maximum number of checks is hardcoded to 200. To add customization, additional parameters could be added to the function signature and used in the implementation.","metadata":{"source":".autodoc/docs/markdown/src/cli/utils/WaitUtil.md"}}],["22",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/utils)\n\nThe code in the `.autodoc/docs/json/src/cli/utils` folder provides utility functions and classes that help manage various aspects of the autodoc project, such as rate-limiting API calls, handling file and folder paths, managing language models, and traversing file systems.\n\n`APIRateLimit.ts` contains the `APIRateLimit` class, which is designed to manage and limit the number of concurrent API calls made by the application. This is useful when the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server. For example:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\n`FileUtil.ts` provides utility functions for handling file and folder paths, such as generating file names and GitHub URLs for files and folders. These functions can be used to manage and navigate the documentation structure. For example:\n\n```javascript\ngetFileName(\"example.txt\"); // returns \"example.md\"\ngithubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project. It provides functions like `printModelDetails` and `totalIndexCostEstimate` to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations in the larger project. They can be used in various parts of the project to handle timing and conditional logic in an asynchronous manner. For example:\n\n```javascript\nwait(2000, \"Hello, world!\").then(console.log); // Waits for 2 seconds and then logs \"Hello, world!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\")); // Waits for an element to become visible, then logs \"Element is visible!\"\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used for processing and generating documentation for a given project. For example:\n\n```javascript\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\ntraverseFileSystem(params);\n```\n\nIn summary, the code in this folder provides various utility functions and classes that help manage different aspects of the autodoc project, making it easier to handle tasks such as rate-limiting, file and folder management, language model management, asynchronous operations, and file system traversal.","metadata":{"source":".autodoc/docs/markdown/src/cli/utils/summary.md"}}],["23",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/traverseFileSystem.ts)\n\nThe `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used in the larger project for processing and generating documentation for a given project.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains the following properties:\n\n- `inputPath`: The root folder path to start traversing.\n- `projectName`: The name of the project being documented.\n- `processFile`: An optional callback function to process files.\n- `processFolder`: An optional callback function to process folders.\n- `ignore`: An array of patterns to ignore files and folders.\n- `filePrompt`: An optional prompt for processing files.\n- `folderPrompt`: An optional prompt for processing folders.\n- `contentType`: The type of content being processed.\n- `targetAudience`: The target audience for the documentation.\n- `linkHosted`: A flag indicating if the documentation should be linked to a hosted version.\n\nThe function first checks if the provided `inputPath` exists. If not, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which recursively traverses the file system. It reads the contents of the current folder, filters out ignored files and folders, and processes them accordingly. If an entry is a directory, it calls `dfs` recursively and then calls the `processFolder` callback if provided. If an entry is a file and is a text file, it calls the `processFile` callback if provided.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nimport { traverseFileSystem } from './autodoc';\n\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\n\ntraverseFileSystem(params);\n```\n\nThis example would traverse the `myProject` folder, ignoring any files and folders within `node_modules` and `.git`, and process the remaining files and folders using the provided callback functions.\n## Questions: \n 1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes files and folders based on the provided parameters, and ignores files and folders that match the specified ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file or folder name as input and returns a boolean value indicating whether the file or folder should be ignored based on the provided ignore patterns. It uses the `minimatch` library to check if the file or folder name matches any of the ignore patterns.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory.","metadata":{"source":".autodoc/docs/markdown/src/cli/utils/traverseFileSystem.md"}}],["24",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/const.ts)\n\nThe code in this file is responsible for managing the user configuration file for the Autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively.\n\nThe `userConfigFileName` constant is defined as `'autodoc.user.json'`. This constant represents the name of the user configuration file that will be used by the Autodoc project.\n\nThe `userConfigFilePath` constant is created using the `path.resolve()` function, which resolves a sequence of paths into an absolute path. It takes three arguments:\n\n1. `os.homedir()`: This function returns the current user's home directory. It ensures that the user configuration file is stored in the user's home directory, making it user-specific.\n2. `'./.config/autodoc/'`: This string specifies the subdirectory within the user's home directory where the configuration file will be stored. The `.config` directory is a common location for storing configuration files on Unix-based systems, and the `autodoc` subdirectory is used to keep the Autodoc configuration files organized.\n3. `userConfigFileName`: This constant is used as the file name for the user configuration file.\n\nThe `userConfigFilePath` constant will store the absolute path to the user configuration file, which can be used by other parts of the Autodoc project to read or write user-specific settings.\n\nIn summary, this code is responsible for defining the location and name of the user configuration file for the Autodoc project. It ensures that the configuration file is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences.\n## Questions: \n 1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?**\n\n The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder.\n\n2. **Why are the `node:path` and `node:os` modules imported?**\n\n The `node:path` module is imported to provide utilities for working with file and directory paths, such as the `path.resolve()` function used to construct the `userConfigFilePath`. The `node:os` module is imported to provide operating system-related utility methods, such as `os.homedir()` which returns the current user's home directory.\n\n3. **Is this code compatible with different operating systems?**\n\n Yes, this code is compatible with different operating systems. The `os.homedir()` function from the `node:os` module returns the correct home directory path for the current user, regardless of the operating system. Additionally, the `path.resolve()` function from the `node:path` module handles path separators and other OS-specific details, ensuring the correct file path is generated.","metadata":{"source":".autodoc/docs/markdown/src/const.md"}}],["25",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/index.ts)\n\nThe code is a CLI (Command Line Interface) tool for the Autodoc project, which helps in generating documentation for a codebase. It uses the `commander` package to define and manage commands, and `inquirer` for interactive prompts. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`.\n\n1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration.\n ```bash\n autodoc init\n ```\n\n2. `estimate`: Estimates the cost of running the `index` command on the repository. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc estimate\n ```\n\n3. `index`: Traverses the codebase, writes documentation using LLM (Language Model), and creates a locally stored index. It prompts the user to confirm before starting the indexing process.\n ```bash\n autodoc index\n ```\n\n4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration; otherwise, it creates a new one.\n ```bash\n autodoc user\n ```\n\n5. `q`: Queries an Autodoc index. It requires both `autodoc.config.json` and user configuration files to be present.\n ```bash\n autodoc q\n ```\n\nThe code also handles unhandled promise rejections by logging the error stack, showing an error spinner, stopping the spinner, and exiting with an error code.\n\nOverall, this CLI tool simplifies the process of generating documentation for a codebase by providing an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities.\n## Questions: \n 1. **Question:** What is the purpose of the `autodoc.config.json` file and how is it used in the code?\n **Answer:** The `autodoc.config.json` file is used to store the configuration for the Autodoc repository. It is read and parsed in various commands like `init`, `estimate`, `index`, and `q` to provide the necessary configuration for each command's execution.\n\n2. **Question:** How does the `estimate` command work and what does it do?\n **Answer:** The `estimate` command reads the `autodoc.config.json` file, parses it into a configuration object, and then calls the `estimate` function with the configuration. The purpose of this command is to estimate the cost of running the `index` command on the repository.\n\n3. **Question:** What is the purpose of the `user` command and how does it handle user configuration?\n **Answer:** The `user` command is used to set the Autodoc user configuration. It reads the user configuration file specified by `userConfigFilePath`, parses it into a configuration object, and then calls the `user` function with the configuration. If the configuration file is not found, it calls the `user` function without any configuration, allowing the user to set up their configuration.","metadata":{"source":".autodoc/docs/markdown/src/index.md"}}],["26",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/langchain/hnswlib.ts)\n\nThe `HNSWLib` class in this code is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index.\n\nThe constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method takes an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method is responsible for initializing the index, resizing it if necessary, and adding the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nExample usage:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, this class can be used to efficiently store and search for similar documents based on their embeddings, which can be useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.\n## Questions: \n 1. **Question:** What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer:** The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question:** How does the `addDocuments` method work and what is its purpose?\n **Answer:** The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them into vectors using the `embedDocuments` method from the `embeddings` object. It then adds these vectors and the corresponding documents to the HNSW index and the `docstore` respectively.\n\n3. **Question:** How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer:** The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input. It checks if the query vector has the same length as the number of dimensions and if `k` is not greater than the number of elements in the index. It then performs a k-nearest neighbors search on the HNSW index using the query vector and returns an array of `[Document, number]` tuples, where each tuple contains a document from the `docstore` and its corresponding distance score to the query vector.","metadata":{"source":".autodoc/docs/markdown/src/langchain/hnswlib.md"}}],["27",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/langchain)\n\nThe `hnswlib.ts` file in the `.autodoc/docs/json/src/langchain` folder contains the `HNSWLib` class, which is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.\n\nThe `HNSWLib` class extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index. It takes an `Embeddings` object and an `HNSWLibArgs` object as arguments in its constructor. The `Embeddings` object is responsible for converting text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method accepts an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method initializes the index, resizes it if necessary, and adds the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nHere's an example of how this code might be used:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, the `HNSWLib` class can be integrated with other components to build efficient and scalable systems for document similarity search, clustering, and recommendations based on text embeddings.","metadata":{"source":".autodoc/docs/markdown/src/langchain/summary.md"}}],["28",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src)\n\nThe `.autodoc/docs/json/src` folder contains the core components of the Autodoc project, which aims to automatically generate documentation for a given code repository using OpenAI's language models (LLMs). The main files in this folder are `const.ts`, `index.ts`, and `types.ts`.\n\n`const.ts` manages the user configuration file for the Autodoc project. It defines the location and name of the user configuration file, ensuring that it is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences.\n\n`index.ts` is a CLI (Command Line Interface) tool for the Autodoc project, which simplifies the process of generating documentation for a codebase. It provides an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`. For example:\n\n```bash\nautodoc init\nautodoc estimate\nautodoc index\nautodoc user\nautodoc q\n```\n\n`types.ts` defines the types and interfaces for the Autodoc project, providing the foundation for processing code repositories and generating documentation using OpenAI's language models. It includes types such as `AutodocUserConfig`, `AutodocRepoConfig`, `FileSummary`, `FolderSummary`, and more.\n\nThe `cli` subfolder contains the `spinner.ts` file, which provides a utility for managing a command-line spinner using the `ora` library. This utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\nstopSpinner();\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nThe `langchain` subfolder contains the `hnswlib.ts` file, which implements a vector store using the Hierarchical Navigable Small World (HNSW) algorithm. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems. For example:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn summary, the code in this folder provides the core components and utilities for the Autodoc project, enabling the automatic generation of documentation for code repositories using OpenAI's language models. The CLI tool simplifies the process, while the types and interfaces lay the foundation for processing and generating documentation. The additional utilities, such as the spinner and HNSWLib, enhance the user experience and provide efficient search capabilities.","metadata":{"source":".autodoc/docs/markdown/src/summary.md"}}],["29",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src/types.ts)\n\nThis code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders within the repository.\n\nThe code starts by importing `OpenAIChat` from the `langchain/llms` package. It then defines several types and interfaces that are used throughout the project:\n\n- `AutodocUserConfig`: Represents the user configuration for the autodoc project, including the LLM models to be used.\n- `AutodocRepoConfig`: Represents the configuration for a specific repository, including its name, URL, root directory, output directory, LLM models, and other settings.\n- `FileSummary` and `FolderSummary`: Represent the summaries and questions generated for files and folders, respectively.\n- `ProcessFileParams`, `ProcessFolderParams`, and `TraverseFileSystemParams`: Define the parameters for processing files, folders, and traversing the file system, respectively.\n- `ProcessFile` and `ProcessFolder`: Define the function types for processing files and folders, respectively.\n- `LLMModels`: Enumerates the available LLM models, such as GPT-3.5-turbo, GPT-4, and GPT-4-32k.\n- `LLMModelDetails`: Represents the details of an LLM model, including its name, cost per 1K tokens, maximum length, and other statistics.\n\nFor example, when using this code in the larger project, you might define a `ProcessFile` function that takes a `ProcessFileParams` object as input and generates a summary and questions for the file using the specified LLM model. Similarly, you could define a `ProcessFolder` function that processes all files and subfolders within a folder, generating summaries and questions for each.\n\nThe `TraverseFileSystemParams` type allows you to configure how the file system is traversed, including specifying which files and folders to ignore, and what prompts to use for generating summaries and questions.\n\nOverall, this code provides the foundation for the `autodoc` project by defining the types and interfaces needed to process code repositories and generate documentation using OpenAI's language models.\n## Questions: \n 1. **Question:** What is the purpose of the `LLMModels` enum and how is it used in the code?\n **Answer:** The `LLMModels` enum defines the available language models for the autodoc project. It is used in the `AutodocUserConfig` and `AutodocRepoConfig` types to specify which language models should be used for processing files and folders.\n\n2. **Question:** What are the `ProcessFile` and `ProcessFolder` types and how are they used in the code?\n **Answer:** `ProcessFile` and `ProcessFolder` are types for functions that process a file or a folder, respectively. They are used as optional parameters in the `TraverseFileSystemParams` type, allowing developers to provide custom processing functions when traversing the file system.\n\n3. **Question:** What is the purpose of the `TraverseFileSystemParams` type and how is it used in the code?\n **Answer:** The `TraverseFileSystemParams` type defines the parameters required for traversing the file system. It is used to pass configuration options, such as input path, project name, custom processing functions, and other settings, to a function that will traverse the file system and process files and folders accordingly.","metadata":{"source":".autodoc/docs/markdown/src/types.md"}}],["30",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/tsconfig.json)\n\nThis code is a configuration file for the TypeScript compiler in a project. The purpose of this configuration is to define various options and settings that the TypeScript compiler should use when transpiling TypeScript code into JavaScript. This is important for ensuring that the compiled output is consistent and compatible with the intended runtime environment.\n\nHere's a brief explanation of the key options set in this configuration:\n\n- `\"rootDir\": \"src\"`: Specifies the root directory containing the TypeScript source files. This tells the compiler where to look for the input files.\n- `\"outDir\": \"dist\"`: Specifies the output directory for the compiled JavaScript files. This is where the transpiled code will be saved.\n- `\"strict\": true`: Enables strict type checking, which enforces stronger type safety and helps catch potential issues during development.\n- `\"target\": \"es2020\"`: Sets the target ECMAScript version for the compiled output. In this case, the output will be compatible with ECMAScript 2020 (ES11) features.\n- `\"module\": \"ES2020\"`: Specifies the module system to use in the compiled output. This setting is aligned with the target ECMAScript version.\n- `\"sourceMap\": true`: Generates source map files alongside the compiled output. This helps with debugging by mapping the compiled code back to the original TypeScript source.\n- `\"esModuleInterop\": true` and `\"allowSyntheticDefaultImports\": true`: These options enable better compatibility with different module systems and allow for more flexible import statements.\n- `\"moduleResolution\": \"node\"`: Sets the module resolution strategy to Node.js-style, which is the most common approach for resolving module imports in JavaScript projects.\n- `\"declaration\": true`: Generates TypeScript declaration files (`.d.ts`) alongside the compiled output. These files provide type information for the compiled code, which can be useful for other TypeScript projects that depend on this one.\n- `\"skipLibCheck\": true`: Skips type checking of declaration files, which can speed up the compilation process.\n\nIn the larger project, this configuration file ensures that the TypeScript compiler produces consistent and compatible JavaScript output, making it easier to integrate the compiled code with other parts of the project or with external dependencies.\n## Questions: \n 1. **What is the purpose of the `rootDir` and `outDir` options in the configuration?**\n\n The `rootDir` option specifies the root folder of the source files, while the `outDir` option specifies the output directory for the compiled files.\n\n2. **What does the `strict` option do in the configuration?**\n\n The `strict` option enables a set of strict type-checking options in the TypeScript compiler, ensuring a higher level of type safety in the code.\n\n3. **What is the significance of the `target` and `module` options in the configuration?**\n\n The `target` option sets the ECMAScript target version for the compiled JavaScript output, while the `module` option specifies the module system to be used in the generated code. In this case, both are set to \"es2020\", indicating that the output will be ECMAScript 2020 compliant.","metadata":{"source":".autodoc/docs/markdown/tsconfig.md"}}]] \ No newline at end of file +[["0",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\estimate\\index.ts)\n\nThe `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress.\n\nNext, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository.\n\nOnce the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.\n## Questions: \n 1. **What is the purpose of the `estimate` function?**\n\n The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost.\n\n2. **What are the parameters passed to the `processRepository` function?**\n\n The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run.\n\n3. **How is the total estimated cost calculated and displayed?**\n\n The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\estimate\\index.md"}}],["1",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\estimate)\n\nThe `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input, containing various configuration options such as repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function begins by constructing the path to the JSON output directory, which stores intermediate results of the processing. It then updates the spinner text to indicate that cost estimation is in progress. The `processRepository` function is called with the provided configuration options and a `true` flag, signifying a dry run. This dry run returns the details of what would happen if the repository were processed, which is used to calculate the estimated cost.\n\nUpon completion of the dry run, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in red. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\estimate\\summary.md"}}],["2",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\convertJsonToMarkdown.ts)\n\nThe `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process.\n\nThe code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function.\n\nNext, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory.\n\nFinally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function.\n\nExample usage:\n\n```javascript\nconvertJsonToMarkdown({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory.\n## Questions: \n 1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs?\n **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted.\n\n2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code?\n **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project.\n\n3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content?\n **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\index\\convertJsonToMarkdown.md"}}],["3",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\createVectorStore.ts)\n\nThe code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project.\n\nThe `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document.\n\nThe `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n## Questions: \n 1. **Question:** What is the purpose of the `processFile` function and what does it return?\n **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object.\n\n2. **Question:** How does the `processDirectory` function work and what does it return?\n **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it work?\n **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\index\\createVectorStore.md"}}],["4",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\index.ts)\n\nThe code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository.\n\nThe `index` function performs three main tasks:\n\n1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory.\n\n ```javascript\n updateSpinnerText('Processing repository...');\n await processRepository({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory.\n\n ```javascript\n updateSpinnerText('Creating markdown files...');\n await convertJsonToMarkdown({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory.\n\n ```javascript\n updateSpinnerText('Create vector files...');\n await createVectorStore({ /* configuration options */ });\n spinnerSuccess();\n ```\n\nThroughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks.\n\nIn the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.\n## Questions: \n 1. **What does the `index` function do in this code?**\n\n The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files.\n\n2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?**\n\n The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files.\n\n3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?**\n\n These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\index\\index.md"}}],["5",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\processRepository.ts)\n\nThe `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing.\n\nThe function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory.\n\nThe `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder.\n\nThe main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics.\n\nThe `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder.\n## Questions: \n 1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs?\n **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing.\n\n2. **Question:** How does the `calculateChecksum` function work and what is its purpose?\n **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed.\n\n3. **Question:** How does the `reindexCheck` function work and when is it used?\n **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\index\\processRepository.md"}}],["6",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\prompts.ts)\n\nThis code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert.\n\n1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.');\n```\n\n2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers.\n\nExample usage:\n```javascript\nconst prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.');\n```\n\nThese functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project.\n## Questions: \n 1. **What is the purpose of the `createCodeFileSummary` function?**\n\n The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?**\n\n The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **What is the role of the `folderSummaryPrompt` function?**\n\n The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\index\\prompts.md"}}],["7",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\index)\n\nThe code in this folder is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It consists of several functions and utilities that work together to automate the documentation generation process.\n\nThe main function, `index`, takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. It performs three main tasks:\n\n1. **Process the repository**: It calls the `processRepository` function to traverse the repository, generate summaries and questions for code files and folders using the LLMS (Language Learning Management System), and create JSON files with the results. These JSON files are stored in the `output/docs/json/` directory.\n\n2. **Create Markdown files**: It uses the `convertJsonToMarkdown` function to convert the generated JSON files into Markdown files. These Markdown files are stored in the `output/docs/markdown/` directory.\n\n3. **Create vector files**: It calls the `createVectorStore` function to create vector files from the generated Markdown files. These vector files are stored in the `output/docs/data/` directory.\n\nThroughout the execution of these tasks, the code provides visual feedback on the progress of the tasks using `updateSpinnerText` and `spinnerSuccess` functions.\n\nHere's an example of how this code might be used:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will process the repository located at `./input`, generate documentation in JSON, Markdown, and vector formats, and save the results in the `./output` directory.\n\nThe `prompts.ts` file contains utility functions that generate prompts for documentation experts. These functions create markdown formatted strings with specific instructions for the documentation expert, ensuring consistent formatting and instructions across different parts of the project.\n\nIn summary, the code in this folder automates the process of generating documentation for a given repository based on the provided configuration options. The generated documentation can be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\index\\summary.md"}}],["8",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\init\\index.ts)\n\nThis code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n## Questions: \n 1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties.\n\n2. **How does the `init` function work and when is it called?**\n\n The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project.\n\n3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?**\n\n The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\init\\index.md"}}],["9",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\init)\n\nThe `index.ts` file in the `.autodoc\\docs\\json\\src\\cli\\commands\\init` folder is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\nThis code is essential for setting up the Autodoc project, as it creates the necessary configuration file and gathers user input to customize the project. It works in conjunction with other parts of the project, such as the CLI and the documentation generation process, which rely on the configuration file to function correctly.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\init\\summary.md"}}],["10",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\createChatChain.ts)\n\nThis code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's repository.\n- `contentType`: The type of content the chatbot is trained on.\n- `chatPrompt`: Additional instructions for answering questions about the content type.\n- `targetAudience`: The intended audience for the chatbot's answers.\n- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search.\n- `llms`: An array of LLMModels, which are language models used for generating answers.\n- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model.\n\nThe `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code.\n\nNext, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate.\n\nFinally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model.\n## Questions: \n 1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n\n **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code?\n\n **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` function work and when is it used?\n\n **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\query\\createChatChain.md"}}],["11",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\index.ts)\n\nThis code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks.\n\nThe `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function.\n\nThe chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal.\n\nIf an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the \"MyProject\" codebase.\n## Questions: \n 1. **What is the purpose of the `query` function in this code?**\n\n The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results.\n\n2. **How does the code handle rendering Markdown text in the terminal?**\n\n The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`.\n\n3. **What is the purpose of the `chatHistory` variable and how is it used?**\n\n The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\query\\index.md"}}],["12",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\query)\n\nThe `query` folder in the Autodoc project contains code for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot is trained on the content of the project and provides answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate.\n\nThe main entry point for the chatbot is the `query` function in `index.ts`. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store and creates a chat chain using the `makeChain` function from `createChatChain.ts`.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example initializes the chatbot with the specified repository and user configurations and starts the chatbot interface for the user to ask questions about the \"MyProject\" codebase.\n\nThe `createChatChain.ts` file defines the `makeChain` function, which creates a chatbot for answering questions about a software project. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters, such as `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and `onTokenStream`. It first creates a question generator using the `LLMChain` class, then creates a `QA_PROMPT` template using the `makeQAPrompt` function, and finally creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project.\n\nIn summary, the code in the `query` folder is responsible for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot uses a combination of natural language processing techniques and efficient nearest neighbor search to generate accurate and relevant answers for the user.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\query\\summary.md"}}],["13",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands)\n\nThe code in the `.autodoc\\docs\\json\\src\\cli\\commands` folder is responsible for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project. The folder contains several subfolders, each with a specific purpose.\n\n### estimate\n\nThe `estimate` function provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input and performs a dry run of the repository processing to calculate the estimated cost. Example usage:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\n### index\n\nThe code in this folder processes a given repository and generates documentation in JSON, Markdown, and vector formats. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository, creating Markdown files, and creating vector files. Example usage:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\n### init\n\nThe `init` function initializes the configuration of the Autodoc project. It prompts the user to input necessary information to set up the project and creates the `autodoc.config.json` file in the project root. Example usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\n### query\n\nThe `query` folder contains code for creating a chatbot that can answer questions about a specific software project. The main entry point is the `query` function, which takes an `AutodocRepoConfig` object and an `AutodocUserConfig` object as input. Example usage:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\n### user\n\nThe `user` folder manages the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs). Example usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the code in this folder is essential for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\summary.md"}}],["14",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\commands\\user\\index.ts)\n\nThis code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n## Questions: \n 1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models.\n\n2. **How does the `user` function handle existing user configuration files?**\n\n The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0.\n\n3. **What are the available choices for LLM models in the `user` function?**\n\n The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\user\\index.md"}}],["15",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\user)\n\nThe `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function creates a default configuration object with either the provided `config` parameter or GPT-3 as the default LLM. This function is useful for generating a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nThis code is essential for the Autodoc project as it allows users to manage their access to different LLMs and store their preferences in a configuration file. This configuration file can then be used by other parts of the project to determine which LLMs the user has access to and tailor the querying process accordingly.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the `index.ts` file in the `user` folder is a crucial part of the Autodoc project, allowing users to manage their LLM access and preferences. This configuration is then used by other parts of the project to provide a tailored experience based on the user's access to different LLMs.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\commands\\user\\summary.md"}}],["16",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\spinner.ts)\n\nThis code is responsible for managing a spinner, which is a visual element that indicates a process is running in the background. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe code starts by importing the `ora` library and creating a singleton spinner instance with the 'dots' style. This ensures that there will only be one spinner active at any given time.\n\nThere are several functions exported by this module to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: This function updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: This function stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: This function stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: This function stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: This function displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.\n## Questions: \n 1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the command line interface, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **How does the `updateSpinnerText` function work?**\n\n The `updateSpinnerText` function takes a message as an input and updates the spinner's text with the given message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message.\n\n3. **What are the differences between `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions?**\n\n These functions are used to update the spinner's state and message based on the outcome of a process. `spinnerError` is called when there is an error, and it stops the spinner with a failure message. `spinnerSuccess` is called when the process is successful, and it stops the spinner with a success message. `spinnerInfo` is used to display an informational message without stopping the spinner.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\spinner.md"}}],["17",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli)\n\nThe code in the `spinner.ts` file, located in the `.autodoc\\docs\\json\\src\\cli` folder, is responsible for managing a spinner, a visual element that indicates a background process is running. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe module exports several functions to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: Updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: Stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: Stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: Stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: Displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\summary.md"}}],["18",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\utils\\APIRateLimit.ts)\n\nThe `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to prevent overwhelming the server with too many requests at once.\n\nThe class constructor takes an optional parameter `maxConcurrentCalls`, which defaults to 50, to set the maximum number of concurrent API calls allowed. It maintains a queue of API calls and keeps track of the number of calls in progress.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. It takes a function `apiFunction` that returns a promise and wraps it in a new promise. The purpose of this wrapping is to control the execution of the API calls and ensure that they do not exceed the specified rate limit.\n\nWhen `callApi` is called, the provided `apiFunction` is added to the queue and the `dequeueAndExecute` method is triggered if there are available slots for concurrent calls. The `dequeueAndExecute` method checks if there are any API calls in the queue and if the number of in-progress calls is below the maximum limit. If both conditions are met, it dequeues the next API call and executes it.\n\nThe `executeCall` function inside `callApi` is responsible for actually calling the API function, resolving or rejecting the promise based on the result, and updating the number of in-progress calls. Once an API call is completed, the `dequeueAndExecute` method is called again to process any remaining calls in the queue.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchSomeData(id) {\n // Call the API using the rate limiter\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetch` function, ensuring that no more than 10 calls are made at once.\n## Questions: \n 1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and executes it when there are available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How can the maximum number of concurrent calls be configured?**\n\n The maximum number of concurrent calls can be configured by passing a value to the `maxConcurrentCalls` parameter in the constructor of the `APIRateLimit` class. If no value is provided, the default value is set to 50.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\utils\\APIRateLimit.md"}}],["19",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\utils\\FileUtil.ts)\n\nThis code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for documentation files.\n\n1. `getFileName(input, delimiter, extension)`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new string with the given extension. If the delimiter is found in the input string, the function removes the part of the string after the last occurrence of the delimiter and appends the extension. If the delimiter is not found, the function simply appends the extension to the input string. This function can be used to generate file names for documentation files with the desired extension.\n\n Example usage:\n\n ```\n getFileName('example.txt'); // returns 'example.md'\n getFileName('example', '_', '.html'); // returns 'example.html'\n ```\n\n2. `githubFileUrl(githubRoot, inputRoot, filePath, linkHosted)`: This function generates a GitHub URL for a file. It takes the GitHub repository root URL, the input root folder path, the file path, and a boolean flag indicating whether the URL should be for the hosted version of the file or the source code. It returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true);\n // returns 'https://github.com/user/repo/example.md'\n ```\n\n3. `githubFolderUrl(githubRoot, inputRoot, folderPath, linkHosted)`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. It takes the same arguments as `githubFileUrl` and returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFolderUrl('https://github.com/user/repo', '/input', '/input/folder', true);\n // returns 'https://github.com/user/repo/folder'\n ```\n\nThese utility functions can be used throughout the autodoc project to generate file names and GitHub URLs for documentation files and folders, ensuring consistent naming and URL generation across the project.\n## Questions: \n 1. **What is the purpose of the `getFileName` function?**\n\n The `getFileName` function takes an input string, an optional delimiter, and an optional extension, and returns a new string with the given extension. If the delimiter is not found in the input string, the extension is simply appended to the input string. If the delimiter is found, the input string is sliced up to the last delimiter index and the extension is appended.\n\n2. **What are the differences between the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both functions take the same parameters: `githubRoot`, `inputRoot`, a path (either `filePath` or `folderPath`), and a `linkHosted` boolean. The main difference is in the returned URL: `githubFileUrl` returns a URL pointing to a file in the GitHub repository, while `githubFolderUrl` returns a URL pointing to a folder in the GitHub repository. The URL structure differs slightly, with `/blob/master/` for files and `/tree/master/` for folders.\n\n3. **What is the purpose of the `linkHosted` parameter in the `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `linkHosted` parameter is a boolean that determines whether the returned URL should point to the hosted version of the file or folder on GitHub Pages (if `true`) or to the file or folder within the GitHub repository itself (if `false`). Depending on the value of `linkHosted`, the functions will return different URL structures.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\utils\\FileUtil.md"}}],["20",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\utils\\LLMUtil.ts)\n\nThis code defines and manages different language models (LLMs) and their associated costs for a project that utilizes OpenAI's GPT models. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has its own properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of the `OpenAIChat` class with the respective model name and API key. Additionally, each model has counters for input tokens, output tokens, succeeded, failed, and total files processed.\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost of indexing all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different LLMs. For example, the `printModelDetails` function can be called to display a summary of the models' usage and costs:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nprintModelDetails(Object.values(models));\n```\n\nAnd the `totalIndexCostEstimate` function can be used to estimate the total cost of indexing all models:\n\n```javascript\nimport { models, totalIndexCostEstimate } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```\n## Questions: \n 1. **Question:** What is the purpose of the `models` object and how are the different GPT models being used?\n **Answer:** The `models` object is a record that maps different GPT models (GPT3, GPT4, and GPT432k) to their respective details, such as cost per tokens, maximum length, and an instance of `OpenAIChat` with the corresponding model configuration.\n\n2. **Question:** How does the `printModelDetails` function work and what information does it display?\n **Answer:** The `printModelDetails` function takes an array of `LLMModelDetails` as input, processes the information for each model, and then prints a summary table to the console. The table includes the model name, file count, succeeded and failed counts, total tokens, and cost.\n\n3. **Question:** What is the purpose of the `totalIndexCostEstimate` function and how is it calculating the total cost?\n **Answer:** The `totalIndexCostEstimate` function calculates the total cost of processing the given models by iterating through the input `models` array and summing up the costs based on the input and output tokens and their respective costs per 1K tokens.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\utils\\LLMUtil.md"}}],["21",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\utils)\n\nThe `.autodoc\\docs\\json\\src\\cli\\utils` folder contains utility functions and classes that assist in managing API rate limits, handling file and folder paths, managing language models, traversing file systems, and controlling asynchronous operations. These utilities can be used throughout the autodoc project to ensure consistent behavior and improve code organization.\n\n`APIRateLimit.ts` provides the `APIRateLimit` class, which manages and limits the number of concurrent API calls made by the application. This is useful when working with rate-limited APIs or preventing server overload. Example usage:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function fetchSomeData(id) {\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\n`FileUtil.ts` offers utility functions for generating file names and GitHub URLs for documentation files. These functions ensure consistent naming and URL generation across the project. Example usage:\n\n```javascript\ngetFileName('example.txt'); // returns 'example.md'\ngithubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true); // returns 'https://github.com/user/repo/example.md'\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project utilizing OpenAI's GPT models. Functions like `printModelDetails` and `totalIndexCostEstimate` can be used to manage and analyze the usage and costs of different LLMs. Example usage:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\nprintModelDetails(Object.values(models));\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processing files and folders based on provided parameters. This is useful for generating documentation or performing tasks that require processing files and folders in a directory structure. Example usage:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => { /* Process file logic */ },\n processFolder: (params) => { /* Process folder logic */ },\n ignore: ['node_modules/**', '.git/**'],\n});\n```\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used to control the flow of asynchronous code execution. Example usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n```\n\nIn summary, the utilities in this folder enhance the autodoc project by providing consistent behavior, improving code organization, and managing various aspects of the project, such as API rate limits, file and folder paths, language models, file system traversal, and asynchronous operations.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\utils\\summary.md"}}],["22",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\utils\\traverseFileSystem.ts)\n\nThe `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processing files and folders based on the provided parameters. It is designed to be used in the larger project for generating documentation or performing other tasks that require processing files and folders in a directory structure.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains various properties to control the traversal and processing behavior. These properties include:\n\n- `inputPath`: The root path to start the traversal from.\n- `projectName`: The name of the project being processed.\n- `processFile`: An optional callback function to process a file.\n- `processFolder`: An optional callback function to process a folder.\n- `ignore`: An array of patterns to ignore during traversal.\n- `filePrompt`, `folderPrompt`: Optional prompts for user interaction.\n- `contentType`, `targetAudience`, `linkHosted`: Additional metadata for processing.\n\nThe function first checks if the provided `inputPath` exists using `fs.access`. If the path does not exist, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which is called recursively to traverse the file system. It reads the contents of the current directory using `fs.readdir`, filters out ignored items, and processes the remaining items.\n\nFor each item, if it is a directory, the `dfs` function is called recursively, and the `processFolder` callback is invoked if provided. If it is a file and its content is text (checked using `isText`), the `processFile` callback is invoked if provided.\n\nThe traversal is performed using `Promise.all` to process items concurrently, improving performance. If an error occurs during traversal, it is logged and rethrown.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => {\n // Process file logic here\n },\n processFolder: (params) => {\n // Process folder logic here\n },\n ignore: ['node_modules/**', '.git/**'],\n});\n```\n## Questions: \n 1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes folders and files based on the provided parameters, and ignores files and folders based on the given ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file name as input and returns a boolean value indicating whether the file should be ignored or not. It checks if the file name matches any of the ignore patterns provided in the `ignore` parameter using the `minimatch` library.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory found.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\utils\\traverseFileSystem.md"}}],["23",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\cli\\utils\\WaitUtil.ts)\n\nThe code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, making them suitable for use with `async/await` syntax.\n\n### wait\n\nThe `wait` function takes two arguments: `timeoutMs`, a number representing the desired waiting time in milliseconds, and an optional `value` that defaults to `null`. It returns a `Promise` that resolves with the provided `value` after the specified `timeoutMs` has elapsed. This function can be used to introduce a delay in the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\ndelayedEcho(); // Output: Start -> (1 second delay) -> End\n```\n\n### forTrue\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. It returns a `Promise` that resolves with `true` when the provided function `fn` returns `true`. The function `fn` is checked every 50 milliseconds, up to a maximum of 200 times (i.e., 10 seconds). If `fn` does not return `true` within this time, the `Promise` is rejected.\n\nThis function can be used to wait for a specific condition to be met before continuing the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nlet condition = false;\n\nsetTimeout(() => {\n condition = true;\n}, 3000);\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n\nwaitForCondition(); // Output: Waiting for condition... -> (3 second delay) -> Condition met!\n```\n\nIn summary, this file provides two utility functions that help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used in the larger project to control the flow of asynchronous code execution.\n## Questions: \n 1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds, optionally returning a value when the promise is resolved.\n\n2. **How does the `forTrue` function work?**\n\n The `forTrue` function takes a function `fn` as an argument, which should return a boolean value. It checks the result of `fn` every 50 milliseconds and resolves the promise when `fn` returns `true`. If `fn` does not return `true` after 200 attempts, the promise is rejected.\n\n3. **What is the use case for the `forTrue` function?**\n\n The `forTrue` function can be used to wait for a certain condition to be met before proceeding with the execution of the code. This can be useful in situations where you need to wait for an asynchronous operation to complete or a specific state to be reached before continuing.","metadata":{"source":".autodoc\\docs\\markdown\\src\\cli\\utils\\WaitUtil.md"}}],["24",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\const.ts)\n\nThe code in this file is responsible for managing the user configuration file for the autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively.\n\nThe `userConfigFileName` constant is defined as `'autodoc.user.json'`, which represents the name of the user configuration file. This file is expected to store user-specific settings for the autodoc project in JSON format.\n\nThe `userConfigFilePath` constant is created using the `path.resolve()` function, which combines the provided arguments into an absolute file path. The `os.homedir()` function is used to get the current user's home directory, and `./.config/autodoc/` is appended to it as the folder where the user configuration file should be stored. Finally, the `userConfigFileName` constant is appended to the path, resulting in the complete file path for the user configuration file.\n\nBy exporting both `userConfigFileName` and `userConfigFilePath`, other parts of the autodoc project can easily access and use these constants to read or write user-specific settings. For example, when the autodoc application starts, it can read the user configuration file from the specified path, and apply the settings accordingly.\n\nHere's a code example of how these constants might be used in another part of the autodoc project:\n\n```javascript\nimport { userConfigFilePath } from './path/to/this/file';\n\n// Read user configuration from the file\nconst userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8'));\n\n// Apply user settings\napplyUserSettings(userConfig);\n```\n\nIn summary, this code is responsible for defining the name and file path of the user configuration file for the autodoc project, allowing other parts of the project to easily access and manage user-specific settings.\n## Questions: \n 1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?**\n\n The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder.\n\n2. **Why are the `node:path` and `node:os` modules being imported?**\n\n The `node:path` module is imported to provide utilities for working with file and directory paths, such as resolving the absolute path to the user configuration file. The `node:os` module is imported to provide operating system-related utility methods, such as getting the user's home directory.\n\n3. **Is this code compatible with different operating systems?**\n\n Yes, this code is compatible with different operating systems. The `os.homedir()` method returns the home directory of the current user, which is platform-specific, and the `path.resolve()` method takes care of handling the correct path separators for the current operating system.","metadata":{"source":".autodoc\\docs\\markdown\\src\\const.md"}}],["25",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\index.ts)\n\nThis code is the main entry point for the Autodoc CLI tool, which provides a set of commands to help developers automatically generate documentation for their codebase. The tool uses the `commander` library to define and handle commands, and `inquirer` for interactive prompts.\n\nThe available commands are:\n\n1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration.\n ```bash\n autodoc init\n ```\n\n2. `estimate`: Estimates the cost of running the `index` command on the repository. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc estimate\n ```\n\n3. `index`: Traverses the codebase, writes documentation using LLM, and creates a locally stored index. Before starting the indexing process, it prompts the user for confirmation. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc index\n ```\n\n4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration.\n ```bash\n autodoc user\n ```\n\n5. `q`: Queries an Autodoc index. It requires both the `autodoc.config.json` and user configuration files to be present.\n ```bash\n autodoc q\n ```\n\nThe code also listens for unhandled promise rejections and handles them gracefully by showing an error spinner, stopping the spinner, and exiting with an error code.\n\nIn the larger project, this CLI tool serves as the primary interface for users to interact with Autodoc, allowing them to easily generate and manage documentation for their codebase.\n## Questions: \n 1. **What is the purpose of the Autodoc CLI Tool?**\n\n The Autodoc CLI Tool is designed to help developers automatically generate documentation for their codebase by traversing the code, writing docs via LLM, and creating a locally stored index.\n\n2. **How does the `estimate` command work and what does it return?**\n\n The `estimate` command reads the `autodoc.config.json` file and estimates the cost of running the `index` command on the repository. It provides an estimation of the resources required to generate the documentation.\n\n3. **What is the role of the `user` command and how does it interact with the user configuration file?**\n\n The `user` command is responsible for setting the Autodoc user configuration. It reads the user configuration file (if it exists) and allows the user to update or create a new configuration. This configuration is then used in other commands, such as the `query` command, to interact with the Autodoc index.","metadata":{"source":".autodoc\\docs\\markdown\\src\\index.md"}}],["26",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\langchain\\hnswlib.ts)\n\nThe `HNSWLib` class in this code is a specialized vector store that uses the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. It is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. The main purpose of this class is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe constructor of the `HNSWLib` class takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method takes an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods allow for persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nIn the larger project, the `HNSWLib` class can be used to efficiently store and search for documents based on their content similarity, which can be useful for tasks such as document clustering, recommendation systems, or information retrieval.\n## Questions: \n 1. **Question**: What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer**: The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question**: How does the `addDocuments` method work and what is its purpose?\n **Answer**: The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them using the provided `Embeddings` instance. It then adds the resulting vectors and documents to the HNSW index and the `InMemoryDocstore`, respectively.\n\n3. **Question**: How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer**: The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input, and searches for the `k` most similar vectors in the HNSW index. It returns an array of tuples, where each tuple contains a `Document` object and its corresponding similarity score to the query vector.","metadata":{"source":".autodoc\\docs\\markdown\\src\\langchain\\hnswlib.md"}}],["27",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\langchain)\n\nThe `hnswlib.ts` file in the `.autodoc\\docs\\json\\src\\langchain` folder contains the `HNSWLib` class, which is a specialized vector store utilizing the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. This class is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. Its primary purpose is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe `HNSWLib` class constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is responsible for converting documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method accepts an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods enable persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nIn the larger project, the `HNSWLib` class can be employed to efficiently store and search for documents based on their content similarity, which can be beneficial for tasks such as document clustering, recommendation systems, or information retrieval.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nThis code snippet demonstrates how to create an `HNSWLib` instance, add documents to the index, and perform a similarity search. The results can then be used for various purposes, such as finding related documents or generating recommendations based on content similarity.","metadata":{"source":".autodoc\\docs\\markdown\\src\\langchain\\summary.md"}}],["28",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src)\n\nThe `.autodoc\\docs\\json\\src` folder contains the core components of the autodoc project, which is designed to automatically generate documentation for a given code repository using OpenAI's language models (LLMs). The folder consists of three main files: `const.ts`, `index.ts`, and `types.ts`, as well as two subfolders: `cli` and `langchain`.\n\n`const.ts` defines the name and file path of the user configuration file for the autodoc project. This file stores user-specific settings in JSON format. Other parts of the project can easily access and use these constants to read or write user-specific settings. For example:\n\n```javascript\nimport { userConfigFilePath } from './path/to/this/file';\n\n// Read user configuration from the file\nconst userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8'));\n\n// Apply user settings\napplyUserSettings(userConfig);\n```\n\n`index.ts` serves as the main entry point for the Autodoc CLI tool, providing a set of commands for developers to generate and manage documentation for their codebase. The available commands include `init`, `estimate`, `index`, `user`, and `q`. The CLI tool uses the `commander` library for command handling and `inquirer` for interactive prompts.\n\n`types.ts` defines the types and interfaces for the autodoc project, such as `AutodocUserConfig`, `AutodocRepoConfig`, `FileSummary`, `FolderSummary`, and more. These types are used to configure and run the autodoc tool, allowing users to generate documentation for their code repositories using OpenAI's LLMs.\n\nThe `cli` subfolder contains the `spinner.ts` file, which manages a spinner for visual feedback during background processes. It exports functions like `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` for easy interaction with the spinner.\n\nThe `langchain` subfolder contains the `hnswlib.ts` file, which provides the `HNSWLib` class for efficient similarity search using the Hierarchical Navigable Small World (HNSW) algorithm. This class is used to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content. Example usage:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nIn summary, the code in this folder is responsible for the core functionality of the autodoc project, including user configuration management, CLI tool commands, type definitions, spinner management, and efficient similarity search using the HNSW algorithm.","metadata":{"source":".autodoc\\docs\\markdown\\src\\summary.md"}}],["29",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/src\\types.ts)\n\nThis code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders in the repository.\n\nThe `AutodocUserConfig` and `AutodocRepoConfig` types define the configuration options for the user and repository, respectively. These include settings such as the LLM models to use, repository URL, output directory, and content type.\n\n`FileSummary` and `FolderSummary` types represent the generated summaries for files and folders, including their paths, URLs, and checksums. The `ProcessFileParams` and `ProcessFolderParams` types define the parameters required for processing files and folders, such as the file or folder name, path, and content type.\n\n`ProcessFile` and `ProcessFolder` are function types that take the respective parameters and return a promise. These functions are responsible for processing the files and folders, generating summaries, and updating the documentation.\n\n`TraverseFileSystemParams` type defines the parameters for traversing the file system, including the input path, project name, and optional `processFile` and `processFolder` functions. It also includes settings for ignoring certain files or folders and content type preferences.\n\nThe `LLMModels` enum lists the available language models, such as GPT-3.5 Turbo, GPT-4, and GPT-4 32k. The `LLMModelDetails` type provides information about each model, including the cost per 1K tokens, maximum length, and success/failure statistics.\n\nIn the larger project, these types and interfaces would be used to configure and run the `autodoc` tool, allowing users to automatically generate documentation for their code repositories using OpenAI's language models. For example, a user could provide an `AutodocRepoConfig` object to configure the tool, and then use the `TraverseFileSystem` function to process the repository and generate the documentation.\n## Questions: \n 1. **What is the purpose of the `AutodocUserConfig` and `AutodocRepoConfig` types?**\n\n The `AutodocUserConfig` type is used to define the user configuration for the autodoc project, which includes an array of LLMModels. The `AutodocRepoConfig` type is used to define the repository configuration for the autodoc project, which includes various properties such as name, repository URL, root, output, LLMModels, and more.\n\n2. **What are the different LLMModels available in the `LLMModels` enum?**\n\n The `LLMModels` enum lists the available language models for the autodoc project. Currently, there are three models: GPT3 (gpt-3.5-turbo), GPT4 (gpt-4), and GPT432k (gpt-4-32k).\n\n3. **What is the purpose of the `ProcessFile` and `ProcessFolder` types?**\n\n The `ProcessFile` type is a function type that takes a `ProcessFileParams` object as input and returns a Promise. It is used to process a single file in the autodoc project. The `ProcessFolder` type is a function type that takes a `ProcessFolderParams` object as input and returns a Promise. It is used to process a folder in the autodoc project.","metadata":{"source":".autodoc\\docs\\markdown\\src\\types.md"}}],["30",{"pageContent":"[View code on GitHub](https://github.com/context-labs/autodoc/tsconfig.json)\n\nThe code provided is a configuration file for the TypeScript compiler in a project. It specifies various options that control how the TypeScript compiler should process the source code and generate the output JavaScript files. This configuration file is typically named `tsconfig.json` and is placed at the root of a TypeScript project.\n\nThe `compilerOptions` object contains several key-value pairs that define the behavior of the TypeScript compiler:\n\n- `rootDir`: Specifies the root directory of the source files. In this case, it is set to \"src\", meaning that the source files are located in the \"src\" folder.\n- `outDir`: Specifies the output directory for the compiled JavaScript files. In this case, it is set to \"dist\", meaning that the compiled files will be placed in the \"dist\" folder.\n- `strict`: Enables strict type checking, which helps catch potential issues in the code.\n- `target`: Specifies the ECMAScript target version for the output JavaScript files. In this case, it is set to \"es2020\", meaning that the output files will be compatible with ECMAScript 2020 features.\n- `module`: Specifies the module system to be used. In this case, it is set to \"ES2020\", meaning that the output files will use the ECMAScript 2020 module system.\n- `sourceMap`: Generates source map files, which help in debugging the compiled code by mapping it back to the original TypeScript source files.\n- `esModuleInterop`: Enables compatibility with ECMAScript modules for importing CommonJS modules.\n- `moduleResolution`: Specifies the module resolution strategy. In this case, it is set to \"node\", meaning that the Node.js module resolution algorithm will be used.\n- `allowSyntheticDefaultImports`: Allows default imports from modules with no default export.\n- `declaration`: Generates TypeScript declaration files (`.d.ts`) alongside the compiled JavaScript files, which can be useful for other projects that depend on this one.\n- `skipLibCheck`: Skips type checking of declaration files, which can speed up the compilation process.\n\nOverall, this configuration file helps ensure that the TypeScript compiler processes the source code according to the specified options, resulting in compiled JavaScript files that are compatible with the desired ECMAScript version and module system, while also providing useful features like source maps and strict type checking.\n## Questions: \n 1. **What is the purpose of the `rootDir` and `outDir` options in the configuration?**\n\n The `rootDir` option specifies the root directory of the input files, while the `outDir` option specifies the output directory for the compiled files.\n\n2. **What does the `strict` option do in the configuration?**\n\n The `strict` option enables a wide range of type checking behavior that results in stronger guarantees of program correctness.\n\n3. **What is the significance of the `target` and `module` options in the configuration?**\n\n The `target` option specifies the ECMAScript target version for the output code, and the `module` option specifies the module system used in the output code. In this case, both are set to \"es2020\", which means the output code will be compatible with ECMAScript 2020 features and module system.","metadata":{"source":".autodoc\\docs\\markdown\\tsconfig.md"}}]] \ No newline at end of file diff --git a/.autodoc/docs/data/hnswlib.index b/.autodoc/docs/data/hnswlib.index index 617b3e4..e0b61c6 100644 Binary files a/.autodoc/docs/data/hnswlib.index and b/.autodoc/docs/data/hnswlib.index differ diff --git a/.autodoc/docs/json/src/cli/commands/estimate/index.json b/.autodoc/docs/json/src/cli/commands/estimate/index.json index 01fb65d..c388997 100644 --- a/.autodoc/docs/json/src/cli/commands/estimate/index.json +++ b/.autodoc/docs/json/src/cli/commands/estimate/index.json @@ -1,7 +1,8 @@ { "fileName": "index.ts", - "filePath": "src/cli/commands/estimate/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts", - "summary": "The `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe main steps involved in the function are:\n\n1. Set the output path for the JSON files generated during the process.\n2. Update the spinner text to display \"Estimating cost...\".\n3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stop the spinner once the dry run is complete.\n5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Display the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository.", - "questions": "1. **What is the purpose of the `estimate` function and what parameters does it accept?**\n\n The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings.\n\n2. **How does the `estimate` function calculate the cost estimate?**\n\n The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details.\n\n3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?**\n\n The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate." + "filePath": "src\\cli\\commands\\estimate\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\estimate\\index.ts", + "summary": "The `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress.\n\nNext, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository.\n\nOnce the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "1. **What is the purpose of the `estimate` function?**\n\n The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost.\n\n2. **What are the parameters passed to the `processRepository` function?**\n\n The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run.\n\n3. **How is the total estimated cost calculated and displayed?**\n\n The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary.", + "checksum": "2b0b3903432ae423bbc597d04b052ecb" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/estimate/summary.json b/.autodoc/docs/json/src/cli/commands/estimate/summary.json index 2a640a1..e88bd06 100644 --- a/.autodoc/docs/json/src/cli/commands/estimate/summary.json +++ b/.autodoc/docs/json/src/cli/commands/estimate/summary.json @@ -1,17 +1,19 @@ { "folderName": "estimate", - "folderPath": ".autodoc/docs/json/src/cli/commands/estimate", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/estimate", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\estimate", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\estimate", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/estimate/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts", - "summary": "The `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe main steps involved in the function are:\n\n1. Set the output path for the JSON files generated during the process.\n2. Update the spinner text to display \"Estimating cost...\".\n3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stop the spinner once the dry run is complete.\n5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Display the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository.", - "questions": "1. **What is the purpose of the `estimate` function and what parameters does it accept?**\n\n The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings.\n\n2. **How does the `estimate` function calculate the cost estimate?**\n\n The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details.\n\n3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?**\n\n The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate." + "filePath": "src\\cli\\commands\\estimate\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\estimate\\index.ts", + "summary": "The `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress.\n\nNext, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository.\n\nOnce the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "1. **What is the purpose of the `estimate` function?**\n\n The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost.\n\n2. **What are the parameters passed to the `processRepository` function?**\n\n The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run.\n\n3. **How is the total estimated cost calculated and displayed?**\n\n The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary.", + "checksum": "2b0b3903432ae423bbc597d04b052ecb" } ], "folders": [], - "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe main steps involved in the `estimate` function are:\n\n1. Setting the output path for the JSON files generated during the process.\n2. Updating the spinner text to display \"Estimating cost...\".\n3. Performing a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stopping the spinner once the dry run is complete.\n5. Printing the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculating the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Displaying the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. The function is designed to work seamlessly with other parts of the Autodoc project, such as the `processRepository` function, which is responsible for the actual processing of the repository.\n\nBy providing an estimated cost upfront, the `estimate` function helps users make informed decisions about whether to proceed with the indexing process or not. This can be particularly useful for users with large repositories or those who are working within a budget. Overall, the `estimate` function is an essential tool for users looking to leverage the power of Autodoc while managing their costs effectively.", - "questions": "" + "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input, containing various configuration options such as repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function begins by constructing the path to the JSON output directory, which stores intermediate results of the processing. It then updates the spinner text to indicate that cost estimation is in progress. The `processRepository` function is called with the provided configuration options and a `true` flag, signifying a dry run. This dry run returns the details of what would happen if the repository were processed, which is used to calculate the estimated cost.\n\nUpon completion of the dry run, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in red. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/index/convertJsonToMarkdown.json b/.autodoc/docs/json/src/cli/commands/index/convertJsonToMarkdown.json index 8d9d783..e0676ac 100644 --- a/.autodoc/docs/json/src/cli/commands/index/convertJsonToMarkdown.json +++ b/.autodoc/docs/json/src/cli/commands/index/convertJsonToMarkdown.json @@ -1,7 +1,8 @@ { "fileName": "convertJsonToMarkdown.ts", - "filePath": "src/cli/commands/index/convertJsonToMarkdown.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts", - "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project.\n\nFirst, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile: () => {\n files++;\n return Promise.resolve();\n },\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nNext, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name.\n\nThe function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension.\n\n```javascript\nconst outputPath = getFileName(markdownFilePath, '.', '.md');\nawait fs.writeFile(outputPath, markdown, 'utf-8');\n```\n\nThe `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile,\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nIn summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories.", - "questions": "1. **What is the purpose of the `convertJsonToMarkdown` function?**\n\n The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information.\n\n2. **How does the `traverseFileSystem` function work and what are its parameters?**\n\n The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders.\n\n3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?**\n\n The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file." + "filePath": "src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process.\n\nThe code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function.\n\nNext, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory.\n\nFinally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function.\n\nExample usage:\n\n```javascript\nconvertJsonToMarkdown({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory.", + "questions": "1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs?\n **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted.\n\n2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code?\n **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project.\n\n3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content?\n **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist.", + "checksum": "79c860becf47b9882441682f0213d534" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/index/createVectorStore.json b/.autodoc/docs/json/src/cli/commands/index/createVectorStore.json index ab4f379..300717a 100644 --- a/.autodoc/docs/json/src/cli/commands/index/createVectorStore.json +++ b/.autodoc/docs/json/src/cli/commands/index/createVectorStore.json @@ -1,7 +1,8 @@ { "fileName": "createVectorStore.ts", - "filePath": "src/cli/commands/index/createVectorStore.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts", - "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings.\n\nThe `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata.\n\nThe `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n\nThis code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file.", - "questions": "1. **Question:** What is the purpose of the `processFile` function and how does it handle errors?\n **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error.\n\n2. **Question:** How does the `processDirectory` function handle nested directories and files?\n **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class?\n **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path." + "filePath": "src\\cli\\commands\\index\\createVectorStore.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\createVectorStore.ts", + "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project.\n\nThe `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document.\n\nThe `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```", + "questions": "1. **Question:** What is the purpose of the `processFile` function and what does it return?\n **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object.\n\n2. **Question:** How does the `processDirectory` function work and what does it return?\n **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it work?\n **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file.", + "checksum": "a3409c4340753a867c72eebef7626fb9" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/index/index.json b/.autodoc/docs/json/src/cli/commands/index/index.json index c524105..46652c2 100644 --- a/.autodoc/docs/json/src/cli/commands/index/index.json +++ b/.autodoc/docs/json/src/cli/commands/index/index.json @@ -1,7 +1,8 @@ { "fileName": "index.ts", - "filePath": "src/cli/commands/index/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts", - "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe `index` function performs the following steps:\n\n1. Define the paths for JSON, Markdown, and data output directories within the `output` folder.\n\n2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step.\n\n3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\n4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.", - "questions": "1. **What is the purpose of the `index` function in this code?**\n\n The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options.\n\n2. **What are the different steps involved in processing the repository?**\n\n The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files.\n\n3. **What is the role of the `AutodocRepoConfig` type?**\n\n The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files." + "filePath": "src\\cli\\commands\\index\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\index.ts", + "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository.\n\nThe `index` function performs three main tasks:\n\n1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory.\n\n ```javascript\n updateSpinnerText('Processing repository...');\n await processRepository({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory.\n\n ```javascript\n updateSpinnerText('Creating markdown files...');\n await convertJsonToMarkdown({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory.\n\n ```javascript\n updateSpinnerText('Create vector files...');\n await createVectorStore({ /* configuration options */ });\n spinnerSuccess();\n ```\n\nThroughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks.\n\nIn the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "1. **What does the `index` function do in this code?**\n\n The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files.\n\n2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?**\n\n The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files.\n\n3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?**\n\n These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing.", + "checksum": "4060b1affae5a6c385cda308b3cd1750" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/index/processRepository.json b/.autodoc/docs/json/src/cli/commands/index/processRepository.json index 5ff7f39..339cc2a 100644 --- a/.autodoc/docs/json/src/cli/commands/index/processRepository.json +++ b/.autodoc/docs/json/src/cli/commands/index/processRepository.json @@ -1,7 +1,8 @@ { "fileName": "processRepository.ts", - "filePath": "src/cli/commands/index/processRepository.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts", - "summary": "The `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings.\n\nThe function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders.\n\nThe `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory.\n\nThe `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory.\n\nThe main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing.\n\nExample usage of the `processRepository` function:\n\n```javascript\nconst autodocConfig = {\n name: 'myProject',\n repositoryUrl: 'https://github.com/user/myProject',\n root: 'src',\n output: 'output',\n llms: [LLMModels.GPT3, LLMModels.GPT4],\n ignore: ['.git', 'node_modules'],\n filePrompt: 'Explain this code file',\n folderPrompt: 'Summarize this folder',\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nprocessRepository(autodocConfig).then((models) => {\n console.log('Processing complete');\n});\n```\n\nThis code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory.", - "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters?\n **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk.\n\n2. **Question:** How does the code determine the best language model to use for generating summaries and questions?\n **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration.\n\n3. **Question:** How does the code handle traversing the file system and processing files and folders?\n **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively." + "filePath": "src\\cli\\commands\\index\\processRepository.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\processRepository.ts", + "summary": "The `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing.\n\nThe function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory.\n\nThe `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder.\n\nThe main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics.\n\nThe `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder.", + "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs?\n **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing.\n\n2. **Question:** How does the `calculateChecksum` function work and what is its purpose?\n **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed.\n\n3. **Question:** How does the `reindexCheck` function work and when is it used?\n **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents.", + "checksum": "5b3ae9ffad1d4b4a22c6f7fd66bbde6f" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/index/prompts.json b/.autodoc/docs/json/src/cli/commands/index/prompts.json index 3c89ae9..4b43d3c 100644 --- a/.autodoc/docs/json/src/cli/commands/index/prompts.json +++ b/.autodoc/docs/json/src/cli/commands/index/prompts.json @@ -1,7 +1,8 @@ { "fileName": "prompts.ts", - "filePath": "src/cli/commands/index/prompts.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts", - "summary": "The code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders.\n\n1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example:\n\n```javascript\ncreateCodeFileSummary('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'Write a detailed technical explanation of what this code does.');\n```\n\n2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example:\n\n```javascript\ncreateCodeQuestions('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example:\n\n```javascript\nfolderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.');\n```\n\nThese functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders.", - "questions": "1. **Question:** What is the purpose of the `createCodeFileSummary` function?\n **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?\n **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take?\n **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt." + "filePath": "src\\cli\\commands\\index\\prompts.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\prompts.ts", + "summary": "This code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert.\n\n1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.');\n```\n\n2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers.\n\nExample usage:\n```javascript\nconst prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.');\n```\n\nThese functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project.", + "questions": "1. **What is the purpose of the `createCodeFileSummary` function?**\n\n The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?**\n\n The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **What is the role of the `folderSummaryPrompt` function?**\n\n The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt.", + "checksum": "e44b82bf4912be69149685a997b6bde3" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/index/summary.json b/.autodoc/docs/json/src/cli/commands/index/summary.json index 587234e..93bc582 100644 --- a/.autodoc/docs/json/src/cli/commands/index/summary.json +++ b/.autodoc/docs/json/src/cli/commands/index/summary.json @@ -1,45 +1,51 @@ { "folderName": "index", - "folderPath": ".autodoc/docs/json/src/cli/commands/index", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/index", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\index", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\index", "files": [ { "fileName": "convertJsonToMarkdown.ts", - "filePath": "src/cli/commands/index/convertJsonToMarkdown.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts", - "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project.\n\nFirst, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile: () => {\n files++;\n return Promise.resolve();\n },\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nNext, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name.\n\nThe function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension.\n\n```javascript\nconst outputPath = getFileName(markdownFilePath, '.', '.md');\nawait fs.writeFile(outputPath, markdown, 'utf-8');\n```\n\nThe `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile,\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nIn summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories.", - "questions": "1. **What is the purpose of the `convertJsonToMarkdown` function?**\n\n The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information.\n\n2. **How does the `traverseFileSystem` function work and what are its parameters?**\n\n The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders.\n\n3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?**\n\n The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file." + "filePath": "src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process.\n\nThe code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function.\n\nNext, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory.\n\nFinally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function.\n\nExample usage:\n\n```javascript\nconvertJsonToMarkdown({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory.", + "questions": "1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs?\n **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted.\n\n2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code?\n **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project.\n\n3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content?\n **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist.", + "checksum": "79c860becf47b9882441682f0213d534" }, { "fileName": "createVectorStore.ts", - "filePath": "src/cli/commands/index/createVectorStore.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts", - "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings.\n\nThe `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata.\n\nThe `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n\nThis code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file.", - "questions": "1. **Question:** What is the purpose of the `processFile` function and how does it handle errors?\n **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error.\n\n2. **Question:** How does the `processDirectory` function handle nested directories and files?\n **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class?\n **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path." + "filePath": "src\\cli\\commands\\index\\createVectorStore.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\createVectorStore.ts", + "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project.\n\nThe `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document.\n\nThe `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```", + "questions": "1. **Question:** What is the purpose of the `processFile` function and what does it return?\n **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object.\n\n2. **Question:** How does the `processDirectory` function work and what does it return?\n **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it work?\n **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file.", + "checksum": "a3409c4340753a867c72eebef7626fb9" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/index/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts", - "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe `index` function performs the following steps:\n\n1. Define the paths for JSON, Markdown, and data output directories within the `output` folder.\n\n2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step.\n\n3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\n4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.", - "questions": "1. **What is the purpose of the `index` function in this code?**\n\n The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options.\n\n2. **What are the different steps involved in processing the repository?**\n\n The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files.\n\n3. **What is the role of the `AutodocRepoConfig` type?**\n\n The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files." + "filePath": "src\\cli\\commands\\index\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\index.ts", + "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository.\n\nThe `index` function performs three main tasks:\n\n1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory.\n\n ```javascript\n updateSpinnerText('Processing repository...');\n await processRepository({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory.\n\n ```javascript\n updateSpinnerText('Creating markdown files...');\n await convertJsonToMarkdown({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory.\n\n ```javascript\n updateSpinnerText('Create vector files...');\n await createVectorStore({ /* configuration options */ });\n spinnerSuccess();\n ```\n\nThroughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks.\n\nIn the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "1. **What does the `index` function do in this code?**\n\n The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files.\n\n2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?**\n\n The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files.\n\n3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?**\n\n These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing.", + "checksum": "4060b1affae5a6c385cda308b3cd1750" }, { "fileName": "processRepository.ts", - "filePath": "src/cli/commands/index/processRepository.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts", - "summary": "The `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings.\n\nThe function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders.\n\nThe `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory.\n\nThe `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory.\n\nThe main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing.\n\nExample usage of the `processRepository` function:\n\n```javascript\nconst autodocConfig = {\n name: 'myProject',\n repositoryUrl: 'https://github.com/user/myProject',\n root: 'src',\n output: 'output',\n llms: [LLMModels.GPT3, LLMModels.GPT4],\n ignore: ['.git', 'node_modules'],\n filePrompt: 'Explain this code file',\n folderPrompt: 'Summarize this folder',\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nprocessRepository(autodocConfig).then((models) => {\n console.log('Processing complete');\n});\n```\n\nThis code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory.", - "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters?\n **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk.\n\n2. **Question:** How does the code determine the best language model to use for generating summaries and questions?\n **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration.\n\n3. **Question:** How does the code handle traversing the file system and processing files and folders?\n **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively." + "filePath": "src\\cli\\commands\\index\\processRepository.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\processRepository.ts", + "summary": "The `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing.\n\nThe function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory.\n\nThe `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder.\n\nThe main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics.\n\nThe `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder.", + "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs?\n **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing.\n\n2. **Question:** How does the `calculateChecksum` function work and what is its purpose?\n **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed.\n\n3. **Question:** How does the `reindexCheck` function work and when is it used?\n **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents.", + "checksum": "5b3ae9ffad1d4b4a22c6f7fd66bbde6f" }, { "fileName": "prompts.ts", - "filePath": "src/cli/commands/index/prompts.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts", - "summary": "The code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders.\n\n1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example:\n\n```javascript\ncreateCodeFileSummary('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'Write a detailed technical explanation of what this code does.');\n```\n\n2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example:\n\n```javascript\ncreateCodeQuestions('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example:\n\n```javascript\nfolderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.');\n```\n\nThese functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders.", - "questions": "1. **Question:** What is the purpose of the `createCodeFileSummary` function?\n **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?\n **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take?\n **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt." + "filePath": "src\\cli\\commands\\index\\prompts.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\prompts.ts", + "summary": "This code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert.\n\n1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.');\n```\n\n2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers.\n\nExample usage:\n```javascript\nconst prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.');\n```\n\nThese functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project.", + "questions": "1. **What is the purpose of the `createCodeFileSummary` function?**\n\n The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?**\n\n The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **What is the role of the `folderSummaryPrompt` function?**\n\n The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt.", + "checksum": "e44b82bf4912be69149685a997b6bde3" } ], "folders": [], - "summary": "The code in this folder is responsible for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\nFor example, the `processRepository` function processes a code repository and generates summaries and questions for each file and folder within the repository. It uses helper functions like `callLLM` to make API calls to language models and `processFile` and `processFolder` to process individual files and folders. The results are saved as JSON files in the output directory.\n\nThe `convertJsonToMarkdown` function converts JSON files containing documentation information into Markdown files. It counts the number of files in the project and creates Markdown files for each code file in the project using the `traverseFileSystem` utility.\n\nThe `createVectorStore` function processes a directory of text files, splits the text into chunks, and creates a vector store using the HNSWLib library and OpenAIEmbeddings. It processes the files in the directory and calls `processFile` for each file, creating a vector store and saving it to the output file path.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.\n\nIn summary, the code in this folder plays a crucial role in the Autodoc project by processing code repositories, generating documentation in various formats, and creating vector files for the documentation. This helps developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in this folder is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It consists of several functions and utilities that work together to automate the documentation generation process.\n\nThe main function, `index`, takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. It performs three main tasks:\n\n1. **Process the repository**: It calls the `processRepository` function to traverse the repository, generate summaries and questions for code files and folders using the LLMS (Language Learning Management System), and create JSON files with the results. These JSON files are stored in the `output/docs/json/` directory.\n\n2. **Create Markdown files**: It uses the `convertJsonToMarkdown` function to convert the generated JSON files into Markdown files. These Markdown files are stored in the `output/docs/markdown/` directory.\n\n3. **Create vector files**: It calls the `createVectorStore` function to create vector files from the generated Markdown files. These vector files are stored in the `output/docs/data/` directory.\n\nThroughout the execution of these tasks, the code provides visual feedback on the progress of the tasks using `updateSpinnerText` and `spinnerSuccess` functions.\n\nHere's an example of how this code might be used:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will process the repository located at `./input`, generate documentation in JSON, Markdown, and vector formats, and save the results in the `./output` directory.\n\nThe `prompts.ts` file contains utility functions that generate prompts for documentation experts. These functions create markdown formatted strings with specific instructions for the documentation expert, ensuring consistent formatting and instructions across different parts of the project.\n\nIn summary, the code in this folder automates the process of generating documentation for a given repository based on the provided configuration options. The generated documentation can be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "", + "checksum": "376f96417f8cbea6a5ab2463268fe4af" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/init/index.json b/.autodoc/docs/json/src/cli/commands/init/index.json index 884e632..9f3af03 100644 --- a/.autodoc/docs/json/src/cli/commands/init/index.json +++ b/.autodoc/docs/json/src/cli/commands/init/index.json @@ -1,7 +1,8 @@ { "fileName": "index.ts", - "filePath": "src/cli/commands/init/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts", - "summary": "This code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available.\n\n2. **Question:** How does the `init` function work and what does it do with the user's input?\n **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`.\n\n3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration?\n **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation." + "filePath": "src\\cli\\commands\\init\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\init\\index.ts", + "summary": "This code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties.\n\n2. **How does the `init` function work and when is it called?**\n\n The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project.\n\n3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?**\n\n The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file.", + "checksum": "b93831ff1f4023ab61c3bea963a8a112" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/init/summary.json b/.autodoc/docs/json/src/cli/commands/init/summary.json index 285900b..49e0a77 100644 --- a/.autodoc/docs/json/src/cli/commands/init/summary.json +++ b/.autodoc/docs/json/src/cli/commands/init/summary.json @@ -1,17 +1,19 @@ { "folderName": "init", - "folderPath": ".autodoc/docs/json/src/cli/commands/init", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/init", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\init", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\init", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/init/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts", - "summary": "This code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available.\n\n2. **Question:** How does the `init` function work and what does it do with the user's input?\n **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`.\n\n3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration?\n **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation." + "filePath": "src\\cli\\commands\\init\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\init\\index.ts", + "summary": "This code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties.\n\n2. **How does the `init` function work and when is it called?**\n\n The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project.\n\n3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?**\n\n The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file.", + "checksum": "b93831ff1f4023ab61c3bea963a8a112" } ], "folders": [], - "summary": "The `index.ts` file in the `init` folder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. This configuration file is crucial for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. The `init` function is a crucial part of the project, as it sets up the necessary configuration for the project to work correctly. It interacts with other parts of the project by providing the required settings and values, ensuring that the project can adapt to different user requirements and preferences.", - "questions": "" + "summary": "The `index.ts` file in the `.autodoc\\docs\\json\\src\\cli\\commands\\init` folder is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\nThis code is essential for setting up the Autodoc project, as it creates the necessary configuration file and gathers user input to customize the project. It works in conjunction with other parts of the project, such as the CLI and the documentation generation process, which rely on the configuration file to function correctly.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/query/createChatChain.json b/.autodoc/docs/json/src/cli/commands/query/createChatChain.json index 82bf9d0..f9e4f02 100644 --- a/.autodoc/docs/json/src/cli/commands/query/createChatChain.json +++ b/.autodoc/docs/json/src/cli/commands/query/createChatChain.json @@ -1,7 +1,8 @@ { "fileName": "createChatChain.ts", - "filePath": "src/cli/commands/query/createChatChain.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts", - "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain.\n\nThe question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model.\n\nThe document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate.\n\nThe `makeChain` function takes the following parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's GitHub repository.\n- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation).\n- `chatPrompt`: Additional instructions for answering questions about the content.\n- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users).\n- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors.\n- `llms`: An array of language models (e.g., GPT-3, GPT-4).\n- `onTokenStream`: An optional callback function to handle streaming tokens.\n\nExample usage:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nThis creates a chatbot that can answer questions about the \"autodoc\" project, using the provided language models and vector store.", - "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code?\n **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` callback function work and when is it used?\n **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time." + "filePath": "src\\cli\\commands\\query\\createChatChain.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\createChatChain.ts", + "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's repository.\n- `contentType`: The type of content the chatbot is trained on.\n- `chatPrompt`: Additional instructions for answering questions about the content type.\n- `targetAudience`: The intended audience for the chatbot's answers.\n- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search.\n- `llms`: An array of LLMModels, which are language models used for generating answers.\n- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model.\n\nThe `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code.\n\nNext, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate.\n\nFinally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model.", + "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n\n **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code?\n\n **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` function work and when is it used?\n\n **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process.", + "checksum": "6869048a06de62499933b14c37cddc1d" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/query/index.json b/.autodoc/docs/json/src/cli/commands/query/index.json index 2b7acb4..9497900 100644 --- a/.autodoc/docs/json/src/cli/commands/query/index.json +++ b/.autodoc/docs/json/src/cli/commands/query/index.json @@ -1,7 +1,8 @@ { "fileName": "index.ts", - "filePath": "src/cli/commands/query/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts", - "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nThe code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot.\n\nThe `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top.\n\nThe main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nIf an error occurs during the process, the chatbot displays an error message and prompts the user for another question.\n\nExample usage:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "1. **What is the purpose of the `query` function and what are its input parameters?**\n\n The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration.\n\n2. **How does the `vectorStore` work and what is its role in the code?**\n\n The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions.\n\n3. **How does the chat history work and what is its purpose?**\n\n The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions." + "filePath": "src\\cli\\commands\\query\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\index.ts", + "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks.\n\nThe `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function.\n\nThe chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal.\n\nIf an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the \"MyProject\" codebase.", + "questions": "1. **What is the purpose of the `query` function in this code?**\n\n The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results.\n\n2. **How does the code handle rendering Markdown text in the terminal?**\n\n The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`.\n\n3. **What is the purpose of the `chatHistory` variable and how is it used?**\n\n The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array.", + "checksum": "19807a33957666422f31136970c37245" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/query/summary.json b/.autodoc/docs/json/src/cli/commands/query/summary.json index ee82d66..273457f 100644 --- a/.autodoc/docs/json/src/cli/commands/query/summary.json +++ b/.autodoc/docs/json/src/cli/commands/query/summary.json @@ -1,24 +1,27 @@ { "folderName": "query", - "folderPath": ".autodoc/docs/json/src/cli/commands/query", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/query", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\query", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\query", "files": [ { "fileName": "createChatChain.ts", - "filePath": "src/cli/commands/query/createChatChain.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts", - "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain.\n\nThe question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model.\n\nThe document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate.\n\nThe `makeChain` function takes the following parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's GitHub repository.\n- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation).\n- `chatPrompt`: Additional instructions for answering questions about the content.\n- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users).\n- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors.\n- `llms`: An array of language models (e.g., GPT-3, GPT-4).\n- `onTokenStream`: An optional callback function to handle streaming tokens.\n\nExample usage:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nThis creates a chatbot that can answer questions about the \"autodoc\" project, using the provided language models and vector store.", - "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code?\n **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` callback function work and when is it used?\n **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time." + "filePath": "src\\cli\\commands\\query\\createChatChain.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\createChatChain.ts", + "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's repository.\n- `contentType`: The type of content the chatbot is trained on.\n- `chatPrompt`: Additional instructions for answering questions about the content type.\n- `targetAudience`: The intended audience for the chatbot's answers.\n- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search.\n- `llms`: An array of LLMModels, which are language models used for generating answers.\n- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model.\n\nThe `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code.\n\nNext, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate.\n\nFinally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model.", + "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n\n **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code?\n\n **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` function work and when is it used?\n\n **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process.", + "checksum": "6869048a06de62499933b14c37cddc1d" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/query/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts", - "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nThe code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot.\n\nThe `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top.\n\nThe main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nIf an error occurs during the process, the chatbot displays an error message and prompts the user for another question.\n\nExample usage:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "1. **What is the purpose of the `query` function and what are its input parameters?**\n\n The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration.\n\n2. **How does the `vectorStore` work and what is its role in the code?**\n\n The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions.\n\n3. **How does the chat history work and what is its purpose?**\n\n The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions." + "filePath": "src\\cli\\commands\\query\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\index.ts", + "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks.\n\nThe `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function.\n\nThe chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal.\n\nIf an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the \"MyProject\" codebase.", + "questions": "1. **What is the purpose of the `query` function in this code?**\n\n The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results.\n\n2. **How does the code handle rendering Markdown text in the terminal?**\n\n The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`.\n\n3. **What is the purpose of the `chatHistory` variable and how is it used?**\n\n The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array.", + "checksum": "19807a33957666422f31136970c37245" } ], "folders": [], - "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nIn `createChatChain.ts`, the `makeChain` function is defined, which creates a chatbot using the `ChatVectorDBQAChain` class. This class combines two separate language models: a question generator and a document chain. The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input.\n\nExample usage of `makeChain`:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nIn `index.ts`, the main chatbot interface is defined. It starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nExample usage of the chatbot interface:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "" + "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot is trained on the content of the project and provides answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate.\n\nThe main entry point for the chatbot is the `query` function in `index.ts`. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store and creates a chat chain using the `makeChain` function from `createChatChain.ts`.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example initializes the chatbot with the specified repository and user configurations and starts the chatbot interface for the user to ask questions about the \"MyProject\" codebase.\n\nThe `createChatChain.ts` file defines the `makeChain` function, which creates a chatbot for answering questions about a software project. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters, such as `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and `onTokenStream`. It first creates a question generator using the `LLMChain` class, then creates a `QA_PROMPT` template using the `makeQAPrompt` function, and finally creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project.\n\nIn summary, the code in the `query` folder is responsible for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot uses a combination of natural language processing techniques and efficient nearest neighbor search to generate accurate and relevant answers for the user.", + "questions": "", + "checksum": "9e0d0f111bf588e2df66862dce9db288" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/summary.json b/.autodoc/docs/json/src/cli/commands/summary.json index 32cd3e8..fd42bfc 100644 --- a/.autodoc/docs/json/src/cli/commands/summary.json +++ b/.autodoc/docs/json/src/cli/commands/summary.json @@ -1,130 +1,146 @@ { "folderName": "commands", - "folderPath": ".autodoc/docs/json/src/cli/commands", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands", "files": [], "folders": [ { "folderName": "estimate", - "folderPath": ".autodoc/docs/json/src/cli/commands/estimate", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/estimate", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\estimate", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\estimate", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/estimate/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts", - "summary": "The `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe main steps involved in the function are:\n\n1. Set the output path for the JSON files generated during the process.\n2. Update the spinner text to display \"Estimating cost...\".\n3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stop the spinner once the dry run is complete.\n5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Display the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository.", - "questions": "1. **What is the purpose of the `estimate` function and what parameters does it accept?**\n\n The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings.\n\n2. **How does the `estimate` function calculate the cost estimate?**\n\n The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details.\n\n3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?**\n\n The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate." + "filePath": "src\\cli\\commands\\estimate\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\estimate\\index.ts", + "summary": "The `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress.\n\nNext, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository.\n\nOnce the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "1. **What is the purpose of the `estimate` function?**\n\n The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost.\n\n2. **What are the parameters passed to the `processRepository` function?**\n\n The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run.\n\n3. **How is the total estimated cost calculated and displayed?**\n\n The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary.", + "checksum": "2b0b3903432ae423bbc597d04b052ecb" } ], "folders": [], - "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe main steps involved in the `estimate` function are:\n\n1. Setting the output path for the JSON files generated during the process.\n2. Updating the spinner text to display \"Estimating cost...\".\n3. Performing a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stopping the spinner once the dry run is complete.\n5. Printing the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculating the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Displaying the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. The function is designed to work seamlessly with other parts of the Autodoc project, such as the `processRepository` function, which is responsible for the actual processing of the repository.\n\nBy providing an estimated cost upfront, the `estimate` function helps users make informed decisions about whether to proceed with the indexing process or not. This can be particularly useful for users with large repositories or those who are working within a budget. Overall, the `estimate` function is an essential tool for users looking to leverage the power of Autodoc while managing their costs effectively.", - "questions": "" + "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input, containing various configuration options such as repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function begins by constructing the path to the JSON output directory, which stores intermediate results of the processing. It then updates the spinner text to indicate that cost estimation is in progress. The `processRepository` function is called with the provided configuration options and a `true` flag, signifying a dry run. This dry run returns the details of what would happen if the repository were processed, which is used to calculate the estimated cost.\n\nUpon completion of the dry run, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in red. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" }, { "folderName": "index", - "folderPath": ".autodoc/docs/json/src/cli/commands/index", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/index", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\index", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\index", "files": [ { "fileName": "convertJsonToMarkdown.ts", - "filePath": "src/cli/commands/index/convertJsonToMarkdown.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts", - "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project.\n\nFirst, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile: () => {\n files++;\n return Promise.resolve();\n },\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nNext, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name.\n\nThe function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension.\n\n```javascript\nconst outputPath = getFileName(markdownFilePath, '.', '.md');\nawait fs.writeFile(outputPath, markdown, 'utf-8');\n```\n\nThe `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile,\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nIn summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories.", - "questions": "1. **What is the purpose of the `convertJsonToMarkdown` function?**\n\n The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information.\n\n2. **How does the `traverseFileSystem` function work and what are its parameters?**\n\n The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders.\n\n3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?**\n\n The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file." + "filePath": "src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process.\n\nThe code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function.\n\nNext, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory.\n\nFinally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function.\n\nExample usage:\n\n```javascript\nconvertJsonToMarkdown({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory.", + "questions": "1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs?\n **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted.\n\n2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code?\n **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project.\n\n3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content?\n **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist.", + "checksum": "79c860becf47b9882441682f0213d534" }, { "fileName": "createVectorStore.ts", - "filePath": "src/cli/commands/index/createVectorStore.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts", - "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings.\n\nThe `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata.\n\nThe `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n\nThis code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file.", - "questions": "1. **Question:** What is the purpose of the `processFile` function and how does it handle errors?\n **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error.\n\n2. **Question:** How does the `processDirectory` function handle nested directories and files?\n **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class?\n **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path." + "filePath": "src\\cli\\commands\\index\\createVectorStore.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\createVectorStore.ts", + "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project.\n\nThe `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document.\n\nThe `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```", + "questions": "1. **Question:** What is the purpose of the `processFile` function and what does it return?\n **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object.\n\n2. **Question:** How does the `processDirectory` function work and what does it return?\n **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it work?\n **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file.", + "checksum": "a3409c4340753a867c72eebef7626fb9" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/index/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts", - "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe `index` function performs the following steps:\n\n1. Define the paths for JSON, Markdown, and data output directories within the `output` folder.\n\n2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step.\n\n3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\n4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.", - "questions": "1. **What is the purpose of the `index` function in this code?**\n\n The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options.\n\n2. **What are the different steps involved in processing the repository?**\n\n The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files.\n\n3. **What is the role of the `AutodocRepoConfig` type?**\n\n The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files." + "filePath": "src\\cli\\commands\\index\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\index.ts", + "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository.\n\nThe `index` function performs three main tasks:\n\n1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory.\n\n ```javascript\n updateSpinnerText('Processing repository...');\n await processRepository({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory.\n\n ```javascript\n updateSpinnerText('Creating markdown files...');\n await convertJsonToMarkdown({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory.\n\n ```javascript\n updateSpinnerText('Create vector files...');\n await createVectorStore({ /* configuration options */ });\n spinnerSuccess();\n ```\n\nThroughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks.\n\nIn the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "1. **What does the `index` function do in this code?**\n\n The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files.\n\n2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?**\n\n The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files.\n\n3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?**\n\n These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing.", + "checksum": "4060b1affae5a6c385cda308b3cd1750" }, { "fileName": "processRepository.ts", - "filePath": "src/cli/commands/index/processRepository.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts", - "summary": "The `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings.\n\nThe function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders.\n\nThe `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory.\n\nThe `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory.\n\nThe main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing.\n\nExample usage of the `processRepository` function:\n\n```javascript\nconst autodocConfig = {\n name: 'myProject',\n repositoryUrl: 'https://github.com/user/myProject',\n root: 'src',\n output: 'output',\n llms: [LLMModels.GPT3, LLMModels.GPT4],\n ignore: ['.git', 'node_modules'],\n filePrompt: 'Explain this code file',\n folderPrompt: 'Summarize this folder',\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nprocessRepository(autodocConfig).then((models) => {\n console.log('Processing complete');\n});\n```\n\nThis code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory.", - "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters?\n **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk.\n\n2. **Question:** How does the code determine the best language model to use for generating summaries and questions?\n **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration.\n\n3. **Question:** How does the code handle traversing the file system and processing files and folders?\n **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively." + "filePath": "src\\cli\\commands\\index\\processRepository.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\processRepository.ts", + "summary": "The `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing.\n\nThe function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory.\n\nThe `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder.\n\nThe main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics.\n\nThe `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder.", + "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs?\n **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing.\n\n2. **Question:** How does the `calculateChecksum` function work and what is its purpose?\n **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed.\n\n3. **Question:** How does the `reindexCheck` function work and when is it used?\n **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents.", + "checksum": "5b3ae9ffad1d4b4a22c6f7fd66bbde6f" }, { "fileName": "prompts.ts", - "filePath": "src/cli/commands/index/prompts.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts", - "summary": "The code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders.\n\n1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example:\n\n```javascript\ncreateCodeFileSummary('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'Write a detailed technical explanation of what this code does.');\n```\n\n2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example:\n\n```javascript\ncreateCodeQuestions('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example:\n\n```javascript\nfolderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.');\n```\n\nThese functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders.", - "questions": "1. **Question:** What is the purpose of the `createCodeFileSummary` function?\n **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?\n **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take?\n **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt." + "filePath": "src\\cli\\commands\\index\\prompts.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\prompts.ts", + "summary": "This code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert.\n\n1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.');\n```\n\n2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers.\n\nExample usage:\n```javascript\nconst prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.');\n```\n\nThese functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project.", + "questions": "1. **What is the purpose of the `createCodeFileSummary` function?**\n\n The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?**\n\n The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **What is the role of the `folderSummaryPrompt` function?**\n\n The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt.", + "checksum": "e44b82bf4912be69149685a997b6bde3" } ], "folders": [], - "summary": "The code in this folder is responsible for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\nFor example, the `processRepository` function processes a code repository and generates summaries and questions for each file and folder within the repository. It uses helper functions like `callLLM` to make API calls to language models and `processFile` and `processFolder` to process individual files and folders. The results are saved as JSON files in the output directory.\n\nThe `convertJsonToMarkdown` function converts JSON files containing documentation information into Markdown files. It counts the number of files in the project and creates Markdown files for each code file in the project using the `traverseFileSystem` utility.\n\nThe `createVectorStore` function processes a directory of text files, splits the text into chunks, and creates a vector store using the HNSWLib library and OpenAIEmbeddings. It processes the files in the directory and calls `processFile` for each file, creating a vector store and saving it to the output file path.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.\n\nIn summary, the code in this folder plays a crucial role in the Autodoc project by processing code repositories, generating documentation in various formats, and creating vector files for the documentation. This helps developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in this folder is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It consists of several functions and utilities that work together to automate the documentation generation process.\n\nThe main function, `index`, takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. It performs three main tasks:\n\n1. **Process the repository**: It calls the `processRepository` function to traverse the repository, generate summaries and questions for code files and folders using the LLMS (Language Learning Management System), and create JSON files with the results. These JSON files are stored in the `output/docs/json/` directory.\n\n2. **Create Markdown files**: It uses the `convertJsonToMarkdown` function to convert the generated JSON files into Markdown files. These Markdown files are stored in the `output/docs/markdown/` directory.\n\n3. **Create vector files**: It calls the `createVectorStore` function to create vector files from the generated Markdown files. These vector files are stored in the `output/docs/data/` directory.\n\nThroughout the execution of these tasks, the code provides visual feedback on the progress of the tasks using `updateSpinnerText` and `spinnerSuccess` functions.\n\nHere's an example of how this code might be used:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will process the repository located at `./input`, generate documentation in JSON, Markdown, and vector formats, and save the results in the `./output` directory.\n\nThe `prompts.ts` file contains utility functions that generate prompts for documentation experts. These functions create markdown formatted strings with specific instructions for the documentation expert, ensuring consistent formatting and instructions across different parts of the project.\n\nIn summary, the code in this folder automates the process of generating documentation for a given repository based on the provided configuration options. The generated documentation can be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "", + "checksum": "376f96417f8cbea6a5ab2463268fe4af" }, { "folderName": "init", - "folderPath": ".autodoc/docs/json/src/cli/commands/init", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/init", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\init", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\init", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/init/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts", - "summary": "This code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available.\n\n2. **Question:** How does the `init` function work and what does it do with the user's input?\n **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`.\n\n3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration?\n **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation." + "filePath": "src\\cli\\commands\\init\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\init\\index.ts", + "summary": "This code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties.\n\n2. **How does the `init` function work and when is it called?**\n\n The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project.\n\n3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?**\n\n The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file.", + "checksum": "b93831ff1f4023ab61c3bea963a8a112" } ], "folders": [], - "summary": "The `index.ts` file in the `init` folder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. This configuration file is crucial for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. The `init` function is a crucial part of the project, as it sets up the necessary configuration for the project to work correctly. It interacts with other parts of the project by providing the required settings and values, ensuring that the project can adapt to different user requirements and preferences.", - "questions": "" + "summary": "The `index.ts` file in the `.autodoc\\docs\\json\\src\\cli\\commands\\init` folder is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\nThis code is essential for setting up the Autodoc project, as it creates the necessary configuration file and gathers user input to customize the project. It works in conjunction with other parts of the project, such as the CLI and the documentation generation process, which rely on the configuration file to function correctly.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" }, { "folderName": "query", - "folderPath": ".autodoc/docs/json/src/cli/commands/query", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/query", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\query", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\query", "files": [ { "fileName": "createChatChain.ts", - "filePath": "src/cli/commands/query/createChatChain.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts", - "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain.\n\nThe question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model.\n\nThe document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate.\n\nThe `makeChain` function takes the following parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's GitHub repository.\n- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation).\n- `chatPrompt`: Additional instructions for answering questions about the content.\n- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users).\n- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors.\n- `llms`: An array of language models (e.g., GPT-3, GPT-4).\n- `onTokenStream`: An optional callback function to handle streaming tokens.\n\nExample usage:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nThis creates a chatbot that can answer questions about the \"autodoc\" project, using the provided language models and vector store.", - "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code?\n **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` callback function work and when is it used?\n **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time." + "filePath": "src\\cli\\commands\\query\\createChatChain.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\createChatChain.ts", + "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's repository.\n- `contentType`: The type of content the chatbot is trained on.\n- `chatPrompt`: Additional instructions for answering questions about the content type.\n- `targetAudience`: The intended audience for the chatbot's answers.\n- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search.\n- `llms`: An array of LLMModels, which are language models used for generating answers.\n- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model.\n\nThe `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code.\n\nNext, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate.\n\nFinally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model.", + "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n\n **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code?\n\n **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` function work and when is it used?\n\n **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process.", + "checksum": "6869048a06de62499933b14c37cddc1d" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/query/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts", - "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nThe code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot.\n\nThe `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top.\n\nThe main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nIf an error occurs during the process, the chatbot displays an error message and prompts the user for another question.\n\nExample usage:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "1. **What is the purpose of the `query` function and what are its input parameters?**\n\n The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration.\n\n2. **How does the `vectorStore` work and what is its role in the code?**\n\n The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions.\n\n3. **How does the chat history work and what is its purpose?**\n\n The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions." + "filePath": "src\\cli\\commands\\query\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\index.ts", + "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks.\n\nThe `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function.\n\nThe chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal.\n\nIf an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the \"MyProject\" codebase.", + "questions": "1. **What is the purpose of the `query` function in this code?**\n\n The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results.\n\n2. **How does the code handle rendering Markdown text in the terminal?**\n\n The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`.\n\n3. **What is the purpose of the `chatHistory` variable and how is it used?**\n\n The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array.", + "checksum": "19807a33957666422f31136970c37245" } ], "folders": [], - "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nIn `createChatChain.ts`, the `makeChain` function is defined, which creates a chatbot using the `ChatVectorDBQAChain` class. This class combines two separate language models: a question generator and a document chain. The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input.\n\nExample usage of `makeChain`:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nIn `index.ts`, the main chatbot interface is defined. It starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nExample usage of the chatbot interface:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "" + "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot is trained on the content of the project and provides answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate.\n\nThe main entry point for the chatbot is the `query` function in `index.ts`. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store and creates a chat chain using the `makeChain` function from `createChatChain.ts`.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example initializes the chatbot with the specified repository and user configurations and starts the chatbot interface for the user to ask questions about the \"MyProject\" codebase.\n\nThe `createChatChain.ts` file defines the `makeChain` function, which creates a chatbot for answering questions about a software project. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters, such as `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and `onTokenStream`. It first creates a question generator using the `LLMChain` class, then creates a `QA_PROMPT` template using the `makeQAPrompt` function, and finally creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project.\n\nIn summary, the code in the `query` folder is responsible for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot uses a combination of natural language processing techniques and efficient nearest neighbor search to generate accurate and relevant answers for the user.", + "questions": "", + "checksum": "9e0d0f111bf588e2df66862dce9db288" }, { "folderName": "user", - "folderPath": ".autodoc/docs/json/src/cli/commands/user", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/user", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\user", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\user", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/user/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts", - "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`.\n\n2. **Question:** How does the `user` function handle existing user configuration files?\n **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration.\n\n3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration?\n **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object." + "filePath": "src\\cli\\commands\\user\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\user\\index.ts", + "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models.\n\n2. **How does the `user` function handle existing user configuration files?**\n\n The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0.\n\n3. **What are the available choices for LLM models in the `user` function?**\n\n The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object.", + "checksum": "76bc1e6d5d61e24907832c4cac443225" } ], "folders": [], - "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\n```typescript\nfunction makeConfigTemplate(llms: string[]): ConfigTemplate {\n // ...\n}\n```\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\n```typescript\nconst configTemplate = makeConfigTemplate(selectedLLMs);\nawait fs.promises.writeFile(configPath, JSON.stringify(configTemplate, null, 2));\n```\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.\n\nThis code is essential for setting up the user's environment and preferences for the Autodoc project. It ensures that the user has the correct configuration file in place, which is necessary for the proper functioning of the project. The user configuration file is used by other parts of the project to determine which LLMs the user has access to and can query.\n\nFor example, when a user runs the `doc q` command, the project will read the user configuration file to determine which LLMs are available for querying. This ensures that the user only queries the LLMs they have access to, preventing any unauthorized access or usage.\n\nIn summary, the `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project, ensuring that the user has the correct configuration file in place, and allowing the user to select the LLMs they have access to. This is essential for the proper functioning of the project and for maintaining the user's preferences and access to different LLMs.", - "questions": "" + "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function creates a default configuration object with either the provided `config` parameter or GPT-3 as the default LLM. This function is useful for generating a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nThis code is essential for the Autodoc project as it allows users to manage their access to different LLMs and store their preferences in a configuration file. This configuration file can then be used by other parts of the project to determine which LLMs the user has access to and tailor the querying process accordingly.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the `index.ts` file in the `user` folder is a crucial part of the Autodoc project, allowing users to manage their LLM access and preferences. This configuration is then used by other parts of the project to provide a tailored experience based on the user's access to different LLMs.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" } ], - "summary": "The code in the `src/cli/commands` folder is responsible for handling various command-line tasks in the Autodoc project. It contains several subfolders, each dedicated to a specific command or functionality, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations.\n\nFor instance, the `estimate` subfolder contains a function that allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input and performs a dry run of the `processRepository` function. It then calculates the total estimated cost and displays it to the user. This helps users make informed decisions about whether to proceed with the indexing process or not.\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n // ...configuration options...\n};\n\nestimate(config);\n```\n\nThe `index` subfolder contains code for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n // ...configuration options...\n};\n\nautodoc.index(config);\n```\n\nThe `init` subfolder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values.\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThe `query` subfolder contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThe `user` subfolder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs).\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIn summary, the code in the `src/cli/commands` folder plays a crucial role in the Autodoc project by providing various command-line functionalities, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations. These functionalities help developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in the `.autodoc\\docs\\json\\src\\cli\\commands` folder is responsible for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project. The folder contains several subfolders, each with a specific purpose.\n\n### estimate\n\nThe `estimate` function provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input and performs a dry run of the repository processing to calculate the estimated cost. Example usage:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\n### index\n\nThe code in this folder processes a given repository and generates documentation in JSON, Markdown, and vector formats. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository, creating Markdown files, and creating vector files. Example usage:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\n### init\n\nThe `init` function initializes the configuration of the Autodoc project. It prompts the user to input necessary information to set up the project and creates the `autodoc.config.json` file in the project root. Example usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\n### query\n\nThe `query` folder contains code for creating a chatbot that can answer questions about a specific software project. The main entry point is the `query` function, which takes an `AutodocRepoConfig` object and an `AutodocUserConfig` object as input. Example usage:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\n### user\n\nThe `user` folder manages the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs). Example usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the code in this folder is essential for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project.", + "questions": "", + "checksum": "d11f941351fb51140313ada9b52bbf1a" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/user/index.json b/.autodoc/docs/json/src/cli/commands/user/index.json index bd017c2..cfc49a8 100644 --- a/.autodoc/docs/json/src/cli/commands/user/index.json +++ b/.autodoc/docs/json/src/cli/commands/user/index.json @@ -1,7 +1,8 @@ { "fileName": "index.ts", - "filePath": "src/cli/commands/user/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts", - "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`.\n\n2. **Question:** How does the `user` function handle existing user configuration files?\n **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration.\n\n3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration?\n **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object." + "filePath": "src\\cli\\commands\\user\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\user\\index.ts", + "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models.\n\n2. **How does the `user` function handle existing user configuration files?**\n\n The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0.\n\n3. **What are the available choices for LLM models in the `user` function?**\n\n The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object.", + "checksum": "76bc1e6d5d61e24907832c4cac443225" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/commands/user/summary.json b/.autodoc/docs/json/src/cli/commands/user/summary.json index 79bbbbe..c33f9ce 100644 --- a/.autodoc/docs/json/src/cli/commands/user/summary.json +++ b/.autodoc/docs/json/src/cli/commands/user/summary.json @@ -1,17 +1,19 @@ { "folderName": "user", - "folderPath": ".autodoc/docs/json/src/cli/commands/user", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/user", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\user", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\user", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/user/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts", - "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`.\n\n2. **Question:** How does the `user` function handle existing user configuration files?\n **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration.\n\n3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration?\n **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object." + "filePath": "src\\cli\\commands\\user\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\user\\index.ts", + "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models.\n\n2. **How does the `user` function handle existing user configuration files?**\n\n The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0.\n\n3. **What are the available choices for LLM models in the `user` function?**\n\n The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object.", + "checksum": "76bc1e6d5d61e24907832c4cac443225" } ], "folders": [], - "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\n```typescript\nfunction makeConfigTemplate(llms: string[]): ConfigTemplate {\n // ...\n}\n```\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\n```typescript\nconst configTemplate = makeConfigTemplate(selectedLLMs);\nawait fs.promises.writeFile(configPath, JSON.stringify(configTemplate, null, 2));\n```\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.\n\nThis code is essential for setting up the user's environment and preferences for the Autodoc project. It ensures that the user has the correct configuration file in place, which is necessary for the proper functioning of the project. The user configuration file is used by other parts of the project to determine which LLMs the user has access to and can query.\n\nFor example, when a user runs the `doc q` command, the project will read the user configuration file to determine which LLMs are available for querying. This ensures that the user only queries the LLMs they have access to, preventing any unauthorized access or usage.\n\nIn summary, the `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project, ensuring that the user has the correct configuration file in place, and allowing the user to select the LLMs they have access to. This is essential for the proper functioning of the project and for maintaining the user's preferences and access to different LLMs.", - "questions": "" + "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function creates a default configuration object with either the provided `config` parameter or GPT-3 as the default LLM. This function is useful for generating a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nThis code is essential for the Autodoc project as it allows users to manage their access to different LLMs and store their preferences in a configuration file. This configuration file can then be used by other parts of the project to determine which LLMs the user has access to and tailor the querying process accordingly.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the `index.ts` file in the `user` folder is a crucial part of the Autodoc project, allowing users to manage their LLM access and preferences. This configuration is then used by other parts of the project to provide a tailored experience based on the user's access to different LLMs.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/spinner.json b/.autodoc/docs/json/src/cli/spinner.json index 144eef6..7508112 100644 --- a/.autodoc/docs/json/src/cli/spinner.json +++ b/.autodoc/docs/json/src/cli/spinner.json @@ -1,7 +1,8 @@ { "fileName": "spinner.ts", - "filePath": "src/cli/spinner.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/spinner.ts", - "summary": "This code provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.", - "questions": "1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the terminal, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **What are the different states of the spinner and how are they updated?**\n\n The spinner can have different states such as spinning, stopped, failed, succeeded, and displaying information. The functions `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` are used to update the spinner's state and text accordingly.\n\n3. **How does the `updateSpinnerText` function work and when should it be used?**\n\n The `updateSpinnerText` function updates the spinner's text with the provided message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message. This function should be used when you want to change the spinner's text while it is spinning or start it with a new message." + "filePath": "src\\cli\\spinner.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\spinner.ts", + "summary": "This code is responsible for managing a spinner, which is a visual element that indicates a process is running in the background. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe code starts by importing the `ora` library and creating a singleton spinner instance with the 'dots' style. This ensures that there will only be one spinner active at any given time.\n\nThere are several functions exported by this module to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: This function updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: This function stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: This function stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: This function stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: This function displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.", + "questions": "1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the command line interface, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **How does the `updateSpinnerText` function work?**\n\n The `updateSpinnerText` function takes a message as an input and updates the spinner's text with the given message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message.\n\n3. **What are the differences between `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions?**\n\n These functions are used to update the spinner's state and message based on the outcome of a process. `spinnerError` is called when there is an error, and it stops the spinner with a failure message. `spinnerSuccess` is called when the process is successful, and it stops the spinner with a success message. `spinnerInfo` is used to display an informational message without stopping the spinner.", + "checksum": "d93ad7e714ce5446916bb1d63cbb6031" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/summary.json b/.autodoc/docs/json/src/cli/summary.json index 01f91a8..84cfdca 100644 --- a/.autodoc/docs/json/src/cli/summary.json +++ b/.autodoc/docs/json/src/cli/summary.json @@ -1,193 +1,217 @@ { "folderName": "cli", - "folderPath": ".autodoc/docs/json/src/cli", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli", + "folderPath": ".autodoc\\docs\\json\\src\\cli", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli", "files": [ { "fileName": "spinner.ts", - "filePath": "src/cli/spinner.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/spinner.ts", - "summary": "This code provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.", - "questions": "1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the terminal, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **What are the different states of the spinner and how are they updated?**\n\n The spinner can have different states such as spinning, stopped, failed, succeeded, and displaying information. The functions `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` are used to update the spinner's state and text accordingly.\n\n3. **How does the `updateSpinnerText` function work and when should it be used?**\n\n The `updateSpinnerText` function updates the spinner's text with the provided message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message. This function should be used when you want to change the spinner's text while it is spinning or start it with a new message." + "filePath": "src\\cli\\spinner.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\spinner.ts", + "summary": "This code is responsible for managing a spinner, which is a visual element that indicates a process is running in the background. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe code starts by importing the `ora` library and creating a singleton spinner instance with the 'dots' style. This ensures that there will only be one spinner active at any given time.\n\nThere are several functions exported by this module to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: This function updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: This function stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: This function stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: This function stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: This function displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.", + "questions": "1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the command line interface, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **How does the `updateSpinnerText` function work?**\n\n The `updateSpinnerText` function takes a message as an input and updates the spinner's text with the given message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message.\n\n3. **What are the differences between `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions?**\n\n These functions are used to update the spinner's state and message based on the outcome of a process. `spinnerError` is called when there is an error, and it stops the spinner with a failure message. `spinnerSuccess` is called when the process is successful, and it stops the spinner with a success message. `spinnerInfo` is used to display an informational message without stopping the spinner.", + "checksum": "d93ad7e714ce5446916bb1d63cbb6031" } ], "folders": [ { "folderName": "commands", - "folderPath": ".autodoc/docs/json/src/cli/commands", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands", "files": [], "folders": [ { "folderName": "estimate", - "folderPath": ".autodoc/docs/json/src/cli/commands/estimate", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/estimate", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\estimate", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\estimate", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/estimate/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts", - "summary": "The `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe main steps involved in the function are:\n\n1. Set the output path for the JSON files generated during the process.\n2. Update the spinner text to display \"Estimating cost...\".\n3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stop the spinner once the dry run is complete.\n5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Display the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository.", - "questions": "1. **What is the purpose of the `estimate` function and what parameters does it accept?**\n\n The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings.\n\n2. **How does the `estimate` function calculate the cost estimate?**\n\n The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details.\n\n3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?**\n\n The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate." + "filePath": "src\\cli\\commands\\estimate\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\estimate\\index.ts", + "summary": "The `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress.\n\nNext, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository.\n\nOnce the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "1. **What is the purpose of the `estimate` function?**\n\n The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost.\n\n2. **What are the parameters passed to the `processRepository` function?**\n\n The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run.\n\n3. **How is the total estimated cost calculated and displayed?**\n\n The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary.", + "checksum": "2b0b3903432ae423bbc597d04b052ecb" } ], "folders": [], - "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe main steps involved in the `estimate` function are:\n\n1. Setting the output path for the JSON files generated during the process.\n2. Updating the spinner text to display \"Estimating cost...\".\n3. Performing a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stopping the spinner once the dry run is complete.\n5. Printing the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculating the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Displaying the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. The function is designed to work seamlessly with other parts of the Autodoc project, such as the `processRepository` function, which is responsible for the actual processing of the repository.\n\nBy providing an estimated cost upfront, the `estimate` function helps users make informed decisions about whether to proceed with the indexing process or not. This can be particularly useful for users with large repositories or those who are working within a budget. Overall, the `estimate` function is an essential tool for users looking to leverage the power of Autodoc while managing their costs effectively.", - "questions": "" + "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input, containing various configuration options such as repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function begins by constructing the path to the JSON output directory, which stores intermediate results of the processing. It then updates the spinner text to indicate that cost estimation is in progress. The `processRepository` function is called with the provided configuration options and a `true` flag, signifying a dry run. This dry run returns the details of what would happen if the repository were processed, which is used to calculate the estimated cost.\n\nUpon completion of the dry run, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in red. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" }, { "folderName": "index", - "folderPath": ".autodoc/docs/json/src/cli/commands/index", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/index", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\index", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\index", "files": [ { "fileName": "convertJsonToMarkdown.ts", - "filePath": "src/cli/commands/index/convertJsonToMarkdown.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts", - "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project.\n\nFirst, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile: () => {\n files++;\n return Promise.resolve();\n },\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nNext, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name.\n\nThe function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension.\n\n```javascript\nconst outputPath = getFileName(markdownFilePath, '.', '.md');\nawait fs.writeFile(outputPath, markdown, 'utf-8');\n```\n\nThe `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile,\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nIn summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories.", - "questions": "1. **What is the purpose of the `convertJsonToMarkdown` function?**\n\n The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information.\n\n2. **How does the `traverseFileSystem` function work and what are its parameters?**\n\n The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders.\n\n3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?**\n\n The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file." + "filePath": "src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process.\n\nThe code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function.\n\nNext, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory.\n\nFinally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function.\n\nExample usage:\n\n```javascript\nconvertJsonToMarkdown({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory.", + "questions": "1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs?\n **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted.\n\n2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code?\n **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project.\n\n3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content?\n **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist.", + "checksum": "79c860becf47b9882441682f0213d534" }, { "fileName": "createVectorStore.ts", - "filePath": "src/cli/commands/index/createVectorStore.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts", - "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings.\n\nThe `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata.\n\nThe `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n\nThis code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file.", - "questions": "1. **Question:** What is the purpose of the `processFile` function and how does it handle errors?\n **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error.\n\n2. **Question:** How does the `processDirectory` function handle nested directories and files?\n **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class?\n **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path." + "filePath": "src\\cli\\commands\\index\\createVectorStore.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\createVectorStore.ts", + "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project.\n\nThe `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document.\n\nThe `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```", + "questions": "1. **Question:** What is the purpose of the `processFile` function and what does it return?\n **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object.\n\n2. **Question:** How does the `processDirectory` function work and what does it return?\n **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it work?\n **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file.", + "checksum": "a3409c4340753a867c72eebef7626fb9" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/index/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts", - "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe `index` function performs the following steps:\n\n1. Define the paths for JSON, Markdown, and data output directories within the `output` folder.\n\n2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step.\n\n3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\n4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.", - "questions": "1. **What is the purpose of the `index` function in this code?**\n\n The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options.\n\n2. **What are the different steps involved in processing the repository?**\n\n The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files.\n\n3. **What is the role of the `AutodocRepoConfig` type?**\n\n The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files." + "filePath": "src\\cli\\commands\\index\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\index.ts", + "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository.\n\nThe `index` function performs three main tasks:\n\n1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory.\n\n ```javascript\n updateSpinnerText('Processing repository...');\n await processRepository({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory.\n\n ```javascript\n updateSpinnerText('Creating markdown files...');\n await convertJsonToMarkdown({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory.\n\n ```javascript\n updateSpinnerText('Create vector files...');\n await createVectorStore({ /* configuration options */ });\n spinnerSuccess();\n ```\n\nThroughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks.\n\nIn the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "1. **What does the `index` function do in this code?**\n\n The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files.\n\n2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?**\n\n The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files.\n\n3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?**\n\n These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing.", + "checksum": "4060b1affae5a6c385cda308b3cd1750" }, { "fileName": "processRepository.ts", - "filePath": "src/cli/commands/index/processRepository.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts", - "summary": "The `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings.\n\nThe function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders.\n\nThe `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory.\n\nThe `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory.\n\nThe main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing.\n\nExample usage of the `processRepository` function:\n\n```javascript\nconst autodocConfig = {\n name: 'myProject',\n repositoryUrl: 'https://github.com/user/myProject',\n root: 'src',\n output: 'output',\n llms: [LLMModels.GPT3, LLMModels.GPT4],\n ignore: ['.git', 'node_modules'],\n filePrompt: 'Explain this code file',\n folderPrompt: 'Summarize this folder',\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nprocessRepository(autodocConfig).then((models) => {\n console.log('Processing complete');\n});\n```\n\nThis code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory.", - "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters?\n **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk.\n\n2. **Question:** How does the code determine the best language model to use for generating summaries and questions?\n **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration.\n\n3. **Question:** How does the code handle traversing the file system and processing files and folders?\n **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively." + "filePath": "src\\cli\\commands\\index\\processRepository.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\processRepository.ts", + "summary": "The `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing.\n\nThe function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory.\n\nThe `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder.\n\nThe main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics.\n\nThe `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder.", + "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs?\n **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing.\n\n2. **Question:** How does the `calculateChecksum` function work and what is its purpose?\n **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed.\n\n3. **Question:** How does the `reindexCheck` function work and when is it used?\n **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents.", + "checksum": "5b3ae9ffad1d4b4a22c6f7fd66bbde6f" }, { "fileName": "prompts.ts", - "filePath": "src/cli/commands/index/prompts.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts", - "summary": "The code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders.\n\n1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example:\n\n```javascript\ncreateCodeFileSummary('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'Write a detailed technical explanation of what this code does.');\n```\n\n2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example:\n\n```javascript\ncreateCodeQuestions('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example:\n\n```javascript\nfolderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.');\n```\n\nThese functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders.", - "questions": "1. **Question:** What is the purpose of the `createCodeFileSummary` function?\n **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?\n **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take?\n **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt." + "filePath": "src\\cli\\commands\\index\\prompts.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\prompts.ts", + "summary": "This code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert.\n\n1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.');\n```\n\n2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers.\n\nExample usage:\n```javascript\nconst prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.');\n```\n\nThese functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project.", + "questions": "1. **What is the purpose of the `createCodeFileSummary` function?**\n\n The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?**\n\n The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **What is the role of the `folderSummaryPrompt` function?**\n\n The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt.", + "checksum": "e44b82bf4912be69149685a997b6bde3" } ], "folders": [], - "summary": "The code in this folder is responsible for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\nFor example, the `processRepository` function processes a code repository and generates summaries and questions for each file and folder within the repository. It uses helper functions like `callLLM` to make API calls to language models and `processFile` and `processFolder` to process individual files and folders. The results are saved as JSON files in the output directory.\n\nThe `convertJsonToMarkdown` function converts JSON files containing documentation information into Markdown files. It counts the number of files in the project and creates Markdown files for each code file in the project using the `traverseFileSystem` utility.\n\nThe `createVectorStore` function processes a directory of text files, splits the text into chunks, and creates a vector store using the HNSWLib library and OpenAIEmbeddings. It processes the files in the directory and calls `processFile` for each file, creating a vector store and saving it to the output file path.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.\n\nIn summary, the code in this folder plays a crucial role in the Autodoc project by processing code repositories, generating documentation in various formats, and creating vector files for the documentation. This helps developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in this folder is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It consists of several functions and utilities that work together to automate the documentation generation process.\n\nThe main function, `index`, takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. It performs three main tasks:\n\n1. **Process the repository**: It calls the `processRepository` function to traverse the repository, generate summaries and questions for code files and folders using the LLMS (Language Learning Management System), and create JSON files with the results. These JSON files are stored in the `output/docs/json/` directory.\n\n2. **Create Markdown files**: It uses the `convertJsonToMarkdown` function to convert the generated JSON files into Markdown files. These Markdown files are stored in the `output/docs/markdown/` directory.\n\n3. **Create vector files**: It calls the `createVectorStore` function to create vector files from the generated Markdown files. These vector files are stored in the `output/docs/data/` directory.\n\nThroughout the execution of these tasks, the code provides visual feedback on the progress of the tasks using `updateSpinnerText` and `spinnerSuccess` functions.\n\nHere's an example of how this code might be used:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will process the repository located at `./input`, generate documentation in JSON, Markdown, and vector formats, and save the results in the `./output` directory.\n\nThe `prompts.ts` file contains utility functions that generate prompts for documentation experts. These functions create markdown formatted strings with specific instructions for the documentation expert, ensuring consistent formatting and instructions across different parts of the project.\n\nIn summary, the code in this folder automates the process of generating documentation for a given repository based on the provided configuration options. The generated documentation can be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "", + "checksum": "376f96417f8cbea6a5ab2463268fe4af" }, { "folderName": "init", - "folderPath": ".autodoc/docs/json/src/cli/commands/init", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/init", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\init", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\init", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/init/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts", - "summary": "This code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available.\n\n2. **Question:** How does the `init` function work and what does it do with the user's input?\n **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`.\n\n3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration?\n **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation." + "filePath": "src\\cli\\commands\\init\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\init\\index.ts", + "summary": "This code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties.\n\n2. **How does the `init` function work and when is it called?**\n\n The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project.\n\n3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?**\n\n The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file.", + "checksum": "b93831ff1f4023ab61c3bea963a8a112" } ], "folders": [], - "summary": "The `index.ts` file in the `init` folder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. This configuration file is crucial for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. The `init` function is a crucial part of the project, as it sets up the necessary configuration for the project to work correctly. It interacts with other parts of the project by providing the required settings and values, ensuring that the project can adapt to different user requirements and preferences.", - "questions": "" + "summary": "The `index.ts` file in the `.autodoc\\docs\\json\\src\\cli\\commands\\init` folder is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\nThis code is essential for setting up the Autodoc project, as it creates the necessary configuration file and gathers user input to customize the project. It works in conjunction with other parts of the project, such as the CLI and the documentation generation process, which rely on the configuration file to function correctly.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" }, { "folderName": "query", - "folderPath": ".autodoc/docs/json/src/cli/commands/query", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/query", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\query", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\query", "files": [ { "fileName": "createChatChain.ts", - "filePath": "src/cli/commands/query/createChatChain.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts", - "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain.\n\nThe question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model.\n\nThe document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate.\n\nThe `makeChain` function takes the following parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's GitHub repository.\n- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation).\n- `chatPrompt`: Additional instructions for answering questions about the content.\n- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users).\n- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors.\n- `llms`: An array of language models (e.g., GPT-3, GPT-4).\n- `onTokenStream`: An optional callback function to handle streaming tokens.\n\nExample usage:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nThis creates a chatbot that can answer questions about the \"autodoc\" project, using the provided language models and vector store.", - "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code?\n **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` callback function work and when is it used?\n **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time." + "filePath": "src\\cli\\commands\\query\\createChatChain.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\createChatChain.ts", + "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's repository.\n- `contentType`: The type of content the chatbot is trained on.\n- `chatPrompt`: Additional instructions for answering questions about the content type.\n- `targetAudience`: The intended audience for the chatbot's answers.\n- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search.\n- `llms`: An array of LLMModels, which are language models used for generating answers.\n- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model.\n\nThe `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code.\n\nNext, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate.\n\nFinally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model.", + "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n\n **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code?\n\n **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` function work and when is it used?\n\n **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process.", + "checksum": "6869048a06de62499933b14c37cddc1d" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/query/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts", - "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nThe code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot.\n\nThe `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top.\n\nThe main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nIf an error occurs during the process, the chatbot displays an error message and prompts the user for another question.\n\nExample usage:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "1. **What is the purpose of the `query` function and what are its input parameters?**\n\n The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration.\n\n2. **How does the `vectorStore` work and what is its role in the code?**\n\n The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions.\n\n3. **How does the chat history work and what is its purpose?**\n\n The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions." + "filePath": "src\\cli\\commands\\query\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\index.ts", + "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks.\n\nThe `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function.\n\nThe chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal.\n\nIf an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the \"MyProject\" codebase.", + "questions": "1. **What is the purpose of the `query` function in this code?**\n\n The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results.\n\n2. **How does the code handle rendering Markdown text in the terminal?**\n\n The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`.\n\n3. **What is the purpose of the `chatHistory` variable and how is it used?**\n\n The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array.", + "checksum": "19807a33957666422f31136970c37245" } ], "folders": [], - "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nIn `createChatChain.ts`, the `makeChain` function is defined, which creates a chatbot using the `ChatVectorDBQAChain` class. This class combines two separate language models: a question generator and a document chain. The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input.\n\nExample usage of `makeChain`:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nIn `index.ts`, the main chatbot interface is defined. It starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nExample usage of the chatbot interface:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "" + "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot is trained on the content of the project and provides answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate.\n\nThe main entry point for the chatbot is the `query` function in `index.ts`. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store and creates a chat chain using the `makeChain` function from `createChatChain.ts`.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example initializes the chatbot with the specified repository and user configurations and starts the chatbot interface for the user to ask questions about the \"MyProject\" codebase.\n\nThe `createChatChain.ts` file defines the `makeChain` function, which creates a chatbot for answering questions about a software project. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters, such as `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and `onTokenStream`. It first creates a question generator using the `LLMChain` class, then creates a `QA_PROMPT` template using the `makeQAPrompt` function, and finally creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project.\n\nIn summary, the code in the `query` folder is responsible for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot uses a combination of natural language processing techniques and efficient nearest neighbor search to generate accurate and relevant answers for the user.", + "questions": "", + "checksum": "9e0d0f111bf588e2df66862dce9db288" }, { "folderName": "user", - "folderPath": ".autodoc/docs/json/src/cli/commands/user", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/user", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\user", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\user", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/user/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts", - "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`.\n\n2. **Question:** How does the `user` function handle existing user configuration files?\n **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration.\n\n3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration?\n **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object." + "filePath": "src\\cli\\commands\\user\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\user\\index.ts", + "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models.\n\n2. **How does the `user` function handle existing user configuration files?**\n\n The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0.\n\n3. **What are the available choices for LLM models in the `user` function?**\n\n The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object.", + "checksum": "76bc1e6d5d61e24907832c4cac443225" } ], "folders": [], - "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\n```typescript\nfunction makeConfigTemplate(llms: string[]): ConfigTemplate {\n // ...\n}\n```\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\n```typescript\nconst configTemplate = makeConfigTemplate(selectedLLMs);\nawait fs.promises.writeFile(configPath, JSON.stringify(configTemplate, null, 2));\n```\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.\n\nThis code is essential for setting up the user's environment and preferences for the Autodoc project. It ensures that the user has the correct configuration file in place, which is necessary for the proper functioning of the project. The user configuration file is used by other parts of the project to determine which LLMs the user has access to and can query.\n\nFor example, when a user runs the `doc q` command, the project will read the user configuration file to determine which LLMs are available for querying. This ensures that the user only queries the LLMs they have access to, preventing any unauthorized access or usage.\n\nIn summary, the `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project, ensuring that the user has the correct configuration file in place, and allowing the user to select the LLMs they have access to. This is essential for the proper functioning of the project and for maintaining the user's preferences and access to different LLMs.", - "questions": "" + "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function creates a default configuration object with either the provided `config` parameter or GPT-3 as the default LLM. This function is useful for generating a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nThis code is essential for the Autodoc project as it allows users to manage their access to different LLMs and store their preferences in a configuration file. This configuration file can then be used by other parts of the project to determine which LLMs the user has access to and tailor the querying process accordingly.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the `index.ts` file in the `user` folder is a crucial part of the Autodoc project, allowing users to manage their LLM access and preferences. This configuration is then used by other parts of the project to provide a tailored experience based on the user's access to different LLMs.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" } ], - "summary": "The code in the `src/cli/commands` folder is responsible for handling various command-line tasks in the Autodoc project. It contains several subfolders, each dedicated to a specific command or functionality, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations.\n\nFor instance, the `estimate` subfolder contains a function that allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input and performs a dry run of the `processRepository` function. It then calculates the total estimated cost and displays it to the user. This helps users make informed decisions about whether to proceed with the indexing process or not.\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n // ...configuration options...\n};\n\nestimate(config);\n```\n\nThe `index` subfolder contains code for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n // ...configuration options...\n};\n\nautodoc.index(config);\n```\n\nThe `init` subfolder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values.\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThe `query` subfolder contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThe `user` subfolder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs).\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIn summary, the code in the `src/cli/commands` folder plays a crucial role in the Autodoc project by providing various command-line functionalities, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations. These functionalities help developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in the `.autodoc\\docs\\json\\src\\cli\\commands` folder is responsible for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project. The folder contains several subfolders, each with a specific purpose.\n\n### estimate\n\nThe `estimate` function provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input and performs a dry run of the repository processing to calculate the estimated cost. Example usage:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\n### index\n\nThe code in this folder processes a given repository and generates documentation in JSON, Markdown, and vector formats. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository, creating Markdown files, and creating vector files. Example usage:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\n### init\n\nThe `init` function initializes the configuration of the Autodoc project. It prompts the user to input necessary information to set up the project and creates the `autodoc.config.json` file in the project root. Example usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\n### query\n\nThe `query` folder contains code for creating a chatbot that can answer questions about a specific software project. The main entry point is the `query` function, which takes an `AutodocRepoConfig` object and an `AutodocUserConfig` object as input. Example usage:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\n### user\n\nThe `user` folder manages the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs). Example usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the code in this folder is essential for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project.", + "questions": "", + "checksum": "d11f941351fb51140313ada9b52bbf1a" }, { "folderName": "utils", - "folderPath": ".autodoc/docs/json/src/cli/utils", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/utils", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\utils", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\utils", "files": [ { "fileName": "APIRateLimit.ts", - "filePath": "src/cli/utils/APIRateLimit.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/APIRateLimit.ts", - "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server.\n\nThe class has a constructor that takes an optional `maxConcurrentCalls` parameter, which defaults to 50. This parameter determines the maximum number of API calls that can be made concurrently.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. This method takes a function `apiFunction` that returns a promise and wraps it in a rate-limited execution. The method returns a promise that resolves with the result of the API call or rejects with an error if the call fails.\n\nWhen `callApi` is called, it adds the `executeCall` function to the `queue`. The `executeCall` function is responsible for executing the API call, resolving or rejecting the promise, and managing the `inProgress` counter. After adding the `executeCall` function to the queue, the code checks if there are available slots for concurrent calls by comparing `inProgress` with `maxConcurrentCalls`. If there are available slots, it calls the `dequeueAndExecute` method.\n\nThe `dequeueAndExecute` method is responsible for executing the queued API calls while ensuring that the number of concurrent calls does not exceed the `maxConcurrentCalls` limit. It dequeues the next API call from the queue and executes it if there are available slots for concurrent calls.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchData(id) {\n // Simulate an API call\n return new Promise((resolve) => setTimeout(() => resolve(`Data for ${id}`), 1000));\n}\n\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\n\n// Usage\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetchData` function, which simulates an API call.", - "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and manages the execution of queued calls based on the available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How does the `dequeueAndExecute` method work?**\n\n The `dequeueAndExecute` method is responsible for executing the queued API calls. It checks if there are any calls in the queue and if there are available slots for concurrent calls. If both conditions are met, it dequeues the next call from the queue and executes it. This method is called whenever a new API call is added to the queue or when an in-progress call is completed." + "filePath": "src\\cli\\utils\\APIRateLimit.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\APIRateLimit.ts", + "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to prevent overwhelming the server with too many requests at once.\n\nThe class constructor takes an optional parameter `maxConcurrentCalls`, which defaults to 50, to set the maximum number of concurrent API calls allowed. It maintains a queue of API calls and keeps track of the number of calls in progress.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. It takes a function `apiFunction` that returns a promise and wraps it in a new promise. The purpose of this wrapping is to control the execution of the API calls and ensure that they do not exceed the specified rate limit.\n\nWhen `callApi` is called, the provided `apiFunction` is added to the queue and the `dequeueAndExecute` method is triggered if there are available slots for concurrent calls. The `dequeueAndExecute` method checks if there are any API calls in the queue and if the number of in-progress calls is below the maximum limit. If both conditions are met, it dequeues the next API call and executes it.\n\nThe `executeCall` function inside `callApi` is responsible for actually calling the API function, resolving or rejecting the promise based on the result, and updating the number of in-progress calls. Once an API call is completed, the `dequeueAndExecute` method is called again to process any remaining calls in the queue.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchSomeData(id) {\n // Call the API using the rate limiter\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetch` function, ensuring that no more than 10 calls are made at once.", + "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and executes it when there are available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How can the maximum number of concurrent calls be configured?**\n\n The maximum number of concurrent calls can be configured by passing a value to the `maxConcurrentCalls` parameter in the constructor of the `APIRateLimit` class. If no value is provided, the default value is set to 50.", + "checksum": "8862552c9cfd8b6db454d45e565081ef" }, { "fileName": "FileUtil.ts", - "filePath": "src/cli/utils/FileUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/FileUtil.ts", - "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for files and folders.\n\n1. `getFileName(input: string, delimiter = '.', extension = '.md'): string`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new file name with the given extension. If the delimiter is not found in the input string, the function appends the extension to the input string. If the delimiter is found, the function replaces the part after the last delimiter with the extension. For example:\n\n ```javascript\n getFileName(\"example.txt\"); // returns \"example.md\"\n getFileName(\"example\"); // returns \"example.md\"\n ```\n\n2. `githubFileUrl(githubRoot: string, inputRoot: string, filePath: string, linkHosted: boolean): string`: This function generates a GitHub URL for a file. It takes the GitHub root URL, the input root path, the file path, and a boolean flag `linkHosted`. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the file. If `linkHosted` is false, the function returns a URL pointing to the file in the GitHub repository. For example:\n\n ```javascript\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", false); // returns \"https://github.com/user/repo/blob/master/example.md\"\n ```\n\n3. `githubFolderUrl(githubRoot: string, inputRoot: string, folderPath: string, linkHosted: boolean): string`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the folder. If `linkHosted` is false, the function returns a URL pointing to the folder in the GitHub repository. For example:\n\n ```javascript\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", true); // returns \"https://github.com/user/repo/folder\"\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", false); // returns \"https://github.com/user/repo/tree/master/folder\"\n ```\n\nThese utility functions can be used in the autodoc project to generate file names and URLs for documentation files and folders, making it easier to manage and navigate the documentation structure.", - "questions": "1. **What does the `getFileName` function do?**\n\n The `getFileName` function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns the input string with the specified extension, replacing the part after the last occurrence of the delimiter if it exists.\n\n2. **What is the purpose of the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both `githubFileUrl` and `githubFolderUrl` functions are used to generate URLs for files and folders, respectively, in a GitHub repository. They take a `githubRoot`, `inputRoot`, a `filePath` or `folderPath`, and a `linkHosted` boolean flag. If `linkHosted` is true, the generated URL will point to the hosted version of the file or folder; otherwise, it will point to the file or folder in the GitHub repository.\n\n3. **Why is the `inputRoot.length - 1` used in the `substring` method for both `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `inputRoot.length - 1` is used to remove the `inputRoot` part from the `filePath` or `folderPath` when generating the final URL. This ensures that the generated URL only contains the relevant path relative to the GitHub repository root." + "filePath": "src\\cli\\utils\\FileUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\FileUtil.ts", + "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for documentation files.\n\n1. `getFileName(input, delimiter, extension)`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new string with the given extension. If the delimiter is found in the input string, the function removes the part of the string after the last occurrence of the delimiter and appends the extension. If the delimiter is not found, the function simply appends the extension to the input string. This function can be used to generate file names for documentation files with the desired extension.\n\n Example usage:\n\n ```\n getFileName('example.txt'); // returns 'example.md'\n getFileName('example', '_', '.html'); // returns 'example.html'\n ```\n\n2. `githubFileUrl(githubRoot, inputRoot, filePath, linkHosted)`: This function generates a GitHub URL for a file. It takes the GitHub repository root URL, the input root folder path, the file path, and a boolean flag indicating whether the URL should be for the hosted version of the file or the source code. It returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true);\n // returns 'https://github.com/user/repo/example.md'\n ```\n\n3. `githubFolderUrl(githubRoot, inputRoot, folderPath, linkHosted)`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. It takes the same arguments as `githubFileUrl` and returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFolderUrl('https://github.com/user/repo', '/input', '/input/folder', true);\n // returns 'https://github.com/user/repo/folder'\n ```\n\nThese utility functions can be used throughout the autodoc project to generate file names and GitHub URLs for documentation files and folders, ensuring consistent naming and URL generation across the project.", + "questions": "1. **What is the purpose of the `getFileName` function?**\n\n The `getFileName` function takes an input string, an optional delimiter, and an optional extension, and returns a new string with the given extension. If the delimiter is not found in the input string, the extension is simply appended to the input string. If the delimiter is found, the input string is sliced up to the last delimiter index and the extension is appended.\n\n2. **What are the differences between the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both functions take the same parameters: `githubRoot`, `inputRoot`, a path (either `filePath` or `folderPath`), and a `linkHosted` boolean. The main difference is in the returned URL: `githubFileUrl` returns a URL pointing to a file in the GitHub repository, while `githubFolderUrl` returns a URL pointing to a folder in the GitHub repository. The URL structure differs slightly, with `/blob/master/` for files and `/tree/master/` for folders.\n\n3. **What is the purpose of the `linkHosted` parameter in the `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `linkHosted` parameter is a boolean that determines whether the returned URL should point to the hosted version of the file or folder on GitHub Pages (if `true`) or to the file or folder within the GitHub repository itself (if `false`). Depending on the value of `linkHosted`, the functions will return different URL structures.", + "checksum": "d1f26fc674b4a9b4a2053642771871c8" }, { "fileName": "LLMUtil.ts", - "filePath": "src/cli/utils/LLMUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/LLMUtil.ts", - "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has a set of properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of `OpenAIChat` with specific configurations. The `inputTokens`, `outputTokens`, `succeeded`, `failed`, and `total` properties are initialized to 0.\n\n```javascript\n{\n name: LLMModels.GPT3,\n inputCostPer1KTokens: 0.002,\n outputCostPer1KTokens: 0.002,\n maxLength: 3050,\n llm: new OpenAIChat({ ... }),\n inputTokens: 0,\n outputTokens: 0,\n succeeded: 0,\n failed: 0,\n total: 0,\n}\n```\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the number of input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost for all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.", - "questions": "1. **Question**: What is the purpose of the `models` object and what are the different models available?\n **Answer**: The `models` object is a record that maps the available LLMModels (GPT3, GPT4, and GPT432k) to their respective details, such as name, input and output costs, maxLength, and an instance of OpenAIChat with the corresponding model.\n\n2. **Question**: How does the `printModelDetails` function work and what information does it display?\n **Answer**: The `printModelDetails` function takes an array of LLMModelDetails and generates an output object containing the model name, file count, succeeded, failed, tokens, and cost. It then calculates the totals for each property and displays the information in a console table.\n\n3. **Question**: What is the purpose of the `totalIndexCostEstimate` function and how does it calculate the total cost?\n **Answer**: The `totalIndexCostEstimate` function calculates the total cost of indexing the given models by iterating through the models array and summing up the input and output costs per 1K tokens for each model." + "filePath": "src\\cli\\utils\\LLMUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\LLMUtil.ts", + "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project that utilizes OpenAI's GPT models. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has its own properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of the `OpenAIChat` class with the respective model name and API key. Additionally, each model has counters for input tokens, output tokens, succeeded, failed, and total files processed.\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost of indexing all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different LLMs. For example, the `printModelDetails` function can be called to display a summary of the models' usage and costs:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nprintModelDetails(Object.values(models));\n```\n\nAnd the `totalIndexCostEstimate` function can be used to estimate the total cost of indexing all models:\n\n```javascript\nimport { models, totalIndexCostEstimate } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```", + "questions": "1. **Question:** What is the purpose of the `models` object and how are the different GPT models being used?\n **Answer:** The `models` object is a record that maps different GPT models (GPT3, GPT4, and GPT432k) to their respective details, such as cost per tokens, maximum length, and an instance of `OpenAIChat` with the corresponding model configuration.\n\n2. **Question:** How does the `printModelDetails` function work and what information does it display?\n **Answer:** The `printModelDetails` function takes an array of `LLMModelDetails` as input, processes the information for each model, and then prints a summary table to the console. The table includes the model name, file count, succeeded and failed counts, total tokens, and cost.\n\n3. **Question:** What is the purpose of the `totalIndexCostEstimate` function and how is it calculating the total cost?\n **Answer:** The `totalIndexCostEstimate` function calculates the total cost of processing the given models by iterating through the input `models` array and summing up the costs based on the input and output tokens and their respective costs per 1K tokens.", + "checksum": "f4464cf197f4af827ac0eac950d568fc" }, { - "fileName": "WaitUtil.ts", - "filePath": "src/cli/utils/WaitUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/WaitUtil.ts", - "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, which is a JavaScript object that represents the eventual completion (or failure) of an asynchronous operation and its resulting value.\n\n### wait function\n\nThe `wait` function takes two arguments: `timeoutMs`, which is the number of milliseconds to wait before resolving the promise, and `value`, which is an optional value to be returned when the promise resolves. The function creates a new `Promise` and uses `setTimeout` to resolve it with the given `value` after the specified `timeoutMs` has passed.\n\nExample usage:\n\n```javascript\n// Wait for 2 seconds and then log \"Hello, world!\"\nwait(2000, \"Hello, world!\").then(console.log);\n```\n\n### forTrue function\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. The purpose of this function is to repeatedly check if the given function `fn` returns `true`. If it does, the promise resolves with `true`. If the function does not return `true` after 200 checks, the promise is rejected.\n\nThe function uses `setInterval` to repeatedly call the given function `fn` every 50 milliseconds. If `fn` returns `true`, the interval is cleared, and the promise is resolved. If the function has been called 200 times without returning `true`, the promise is rejected.\n\nExample usage:\n\n```javascript\n// Check if a certain element is visible on the page\nconst isElementVisible = () => document.querySelector(\"#my-element\").offsetParent !== null;\n\n// Wait for the element to become visible, then log \"Element is visible!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\"));\n```\n\nIn summary, these utility functions help manage asynchronous operations by providing a way to wait for a certain amount of time or for a specific condition to be met. They can be used in various parts of the larger project to handle timing and conditional logic in an asynchronous manner.", - "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds. It can be used to introduce a delay in the execution of asynchronous code.\n\n2. **How does the `forTrue` function work and what is its use case?**\n\n The `forTrue` function takes a function `fn` as an argument, which returns a boolean value. It repeatedly checks the result of `fn` every 50 milliseconds until it returns `true` or the maximum number of checks (200) is reached. This function can be used to wait for a specific condition to be met before proceeding with the execution of asynchronous code.\n\n3. **Is there any error handling or customization for the `forTrue` function, such as customizing the interval or maximum number of checks?**\n\n Currently, there is no error handling or customization options for the `forTrue` function. The interval is hardcoded to 50 milliseconds, and the maximum number of checks is hardcoded to 200. To add customization, additional parameters could be added to the function signature and used in the implementation." + "fileName": "traverseFileSystem.ts", + "filePath": "src\\cli\\utils\\traverseFileSystem.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\traverseFileSystem.ts", + "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processing files and folders based on the provided parameters. It is designed to be used in the larger project for generating documentation or performing other tasks that require processing files and folders in a directory structure.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains various properties to control the traversal and processing behavior. These properties include:\n\n- `inputPath`: The root path to start the traversal from.\n- `projectName`: The name of the project being processed.\n- `processFile`: An optional callback function to process a file.\n- `processFolder`: An optional callback function to process a folder.\n- `ignore`: An array of patterns to ignore during traversal.\n- `filePrompt`, `folderPrompt`: Optional prompts for user interaction.\n- `contentType`, `targetAudience`, `linkHosted`: Additional metadata for processing.\n\nThe function first checks if the provided `inputPath` exists using `fs.access`. If the path does not exist, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which is called recursively to traverse the file system. It reads the contents of the current directory using `fs.readdir`, filters out ignored items, and processes the remaining items.\n\nFor each item, if it is a directory, the `dfs` function is called recursively, and the `processFolder` callback is invoked if provided. If it is a file and its content is text (checked using `isText`), the `processFile` callback is invoked if provided.\n\nThe traversal is performed using `Promise.all` to process items concurrently, improving performance. If an error occurs during traversal, it is logged and rethrown.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => {\n // Process file logic here\n },\n processFolder: (params) => {\n // Process folder logic here\n },\n ignore: ['node_modules/**', '.git/**'],\n});\n```", + "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes folders and files based on the provided parameters, and ignores files and folders based on the given ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file name as input and returns a boolean value indicating whether the file should be ignored or not. It checks if the file name matches any of the ignore patterns provided in the `ignore` parameter using the `minimatch` library.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory found.", + "checksum": "b9e957c10ee6c009864c90aa2fa93763" }, { - "fileName": "traverseFileSystem.ts", - "filePath": "src/cli/utils/traverseFileSystem.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/traverseFileSystem.ts", - "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used in the larger project for processing and generating documentation for a given project.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains the following properties:\n\n- `inputPath`: The root folder path to start traversing.\n- `projectName`: The name of the project being documented.\n- `processFile`: An optional callback function to process files.\n- `processFolder`: An optional callback function to process folders.\n- `ignore`: An array of patterns to ignore files and folders.\n- `filePrompt`: An optional prompt for processing files.\n- `folderPrompt`: An optional prompt for processing folders.\n- `contentType`: The type of content being processed.\n- `targetAudience`: The target audience for the documentation.\n- `linkHosted`: A flag indicating if the documentation should be linked to a hosted version.\n\nThe function first checks if the provided `inputPath` exists. If not, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which recursively traverses the file system. It reads the contents of the current folder, filters out ignored files and folders, and processes them accordingly. If an entry is a directory, it calls `dfs` recursively and then calls the `processFolder` callback if provided. If an entry is a file and is a text file, it calls the `processFile` callback if provided.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nimport { traverseFileSystem } from './autodoc';\n\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\n\ntraverseFileSystem(params);\n```\n\nThis example would traverse the `myProject` folder, ignoring any files and folders within `node_modules` and `.git`, and process the remaining files and folders using the provided callback functions.", - "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes files and folders based on the provided parameters, and ignores files and folders that match the specified ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file or folder name as input and returns a boolean value indicating whether the file or folder should be ignored based on the provided ignore patterns. It uses the `minimatch` library to check if the file or folder name matches any of the ignore patterns.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory." + "fileName": "WaitUtil.ts", + "filePath": "src\\cli\\utils\\WaitUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\WaitUtil.ts", + "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, making them suitable for use with `async/await` syntax.\n\n### wait\n\nThe `wait` function takes two arguments: `timeoutMs`, a number representing the desired waiting time in milliseconds, and an optional `value` that defaults to `null`. It returns a `Promise` that resolves with the provided `value` after the specified `timeoutMs` has elapsed. This function can be used to introduce a delay in the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\ndelayedEcho(); // Output: Start -> (1 second delay) -> End\n```\n\n### forTrue\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. It returns a `Promise` that resolves with `true` when the provided function `fn` returns `true`. The function `fn` is checked every 50 milliseconds, up to a maximum of 200 times (i.e., 10 seconds). If `fn` does not return `true` within this time, the `Promise` is rejected.\n\nThis function can be used to wait for a specific condition to be met before continuing the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nlet condition = false;\n\nsetTimeout(() => {\n condition = true;\n}, 3000);\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n\nwaitForCondition(); // Output: Waiting for condition... -> (3 second delay) -> Condition met!\n```\n\nIn summary, this file provides two utility functions that help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used in the larger project to control the flow of asynchronous code execution.", + "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds, optionally returning a value when the promise is resolved.\n\n2. **How does the `forTrue` function work?**\n\n The `forTrue` function takes a function `fn` as an argument, which should return a boolean value. It checks the result of `fn` every 50 milliseconds and resolves the promise when `fn` returns `true`. If `fn` does not return `true` after 200 attempts, the promise is rejected.\n\n3. **What is the use case for the `forTrue` function?**\n\n The `forTrue` function can be used to wait for a certain condition to be met before proceeding with the execution of the code. This can be useful in situations where you need to wait for an asynchronous operation to complete or a specific state to be reached before continuing.", + "checksum": "bf4acebb6c2736274af75a8c8441c9d2" } ], "folders": [], - "summary": "The code in the `.autodoc/docs/json/src/cli/utils` folder provides utility functions and classes that help manage various aspects of the autodoc project, such as rate-limiting API calls, handling file and folder paths, managing language models, and traversing file systems.\n\n`APIRateLimit.ts` contains the `APIRateLimit` class, which is designed to manage and limit the number of concurrent API calls made by the application. This is useful when the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server. For example:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\n`FileUtil.ts` provides utility functions for handling file and folder paths, such as generating file names and GitHub URLs for files and folders. These functions can be used to manage and navigate the documentation structure. For example:\n\n```javascript\ngetFileName(\"example.txt\"); // returns \"example.md\"\ngithubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project. It provides functions like `printModelDetails` and `totalIndexCostEstimate` to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations in the larger project. They can be used in various parts of the project to handle timing and conditional logic in an asynchronous manner. For example:\n\n```javascript\nwait(2000, \"Hello, world!\").then(console.log); // Waits for 2 seconds and then logs \"Hello, world!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\")); // Waits for an element to become visible, then logs \"Element is visible!\"\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used for processing and generating documentation for a given project. For example:\n\n```javascript\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\ntraverseFileSystem(params);\n```\n\nIn summary, the code in this folder provides various utility functions and classes that help manage different aspects of the autodoc project, making it easier to handle tasks such as rate-limiting, file and folder management, language model management, asynchronous operations, and file system traversal.", - "questions": "" + "summary": "The `.autodoc\\docs\\json\\src\\cli\\utils` folder contains utility functions and classes that assist in managing API rate limits, handling file and folder paths, managing language models, traversing file systems, and controlling asynchronous operations. These utilities can be used throughout the autodoc project to ensure consistent behavior and improve code organization.\n\n`APIRateLimit.ts` provides the `APIRateLimit` class, which manages and limits the number of concurrent API calls made by the application. This is useful when working with rate-limited APIs or preventing server overload. Example usage:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function fetchSomeData(id) {\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\n`FileUtil.ts` offers utility functions for generating file names and GitHub URLs for documentation files. These functions ensure consistent naming and URL generation across the project. Example usage:\n\n```javascript\ngetFileName('example.txt'); // returns 'example.md'\ngithubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true); // returns 'https://github.com/user/repo/example.md'\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project utilizing OpenAI's GPT models. Functions like `printModelDetails` and `totalIndexCostEstimate` can be used to manage and analyze the usage and costs of different LLMs. Example usage:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\nprintModelDetails(Object.values(models));\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processing files and folders based on provided parameters. This is useful for generating documentation or performing tasks that require processing files and folders in a directory structure. Example usage:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => { /* Process file logic */ },\n processFolder: (params) => { /* Process folder logic */ },\n ignore: ['node_modules/**', '.git/**'],\n});\n```\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used to control the flow of asynchronous code execution. Example usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n```\n\nIn summary, the utilities in this folder enhance the autodoc project by providing consistent behavior, improving code organization, and managing various aspects of the project, such as API rate limits, file and folder paths, language models, file system traversal, and asynchronous operations.", + "questions": "", + "checksum": "a4b7088863601cd326edbec7726eefe7" } ], - "summary": "The `spinner.ts` file in the `.autodoc/docs/json/src/cli` folder provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.", - "questions": "" + "summary": "The code in the `spinner.ts` file, located in the `.autodoc\\docs\\json\\src\\cli` folder, is responsible for managing a spinner, a visual element that indicates a background process is running. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe module exports several functions to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: Updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: Stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: Stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: Stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: Displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.", + "questions": "", + "checksum": "e9d728bc3244f1081af08994f5fb1cd0" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/utils/APIRateLimit.json b/.autodoc/docs/json/src/cli/utils/APIRateLimit.json index 1beed52..0e5e81c 100644 --- a/.autodoc/docs/json/src/cli/utils/APIRateLimit.json +++ b/.autodoc/docs/json/src/cli/utils/APIRateLimit.json @@ -1,7 +1,8 @@ { "fileName": "APIRateLimit.ts", - "filePath": "src/cli/utils/APIRateLimit.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/APIRateLimit.ts", - "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server.\n\nThe class has a constructor that takes an optional `maxConcurrentCalls` parameter, which defaults to 50. This parameter determines the maximum number of API calls that can be made concurrently.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. This method takes a function `apiFunction` that returns a promise and wraps it in a rate-limited execution. The method returns a promise that resolves with the result of the API call or rejects with an error if the call fails.\n\nWhen `callApi` is called, it adds the `executeCall` function to the `queue`. The `executeCall` function is responsible for executing the API call, resolving or rejecting the promise, and managing the `inProgress` counter. After adding the `executeCall` function to the queue, the code checks if there are available slots for concurrent calls by comparing `inProgress` with `maxConcurrentCalls`. If there are available slots, it calls the `dequeueAndExecute` method.\n\nThe `dequeueAndExecute` method is responsible for executing the queued API calls while ensuring that the number of concurrent calls does not exceed the `maxConcurrentCalls` limit. It dequeues the next API call from the queue and executes it if there are available slots for concurrent calls.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchData(id) {\n // Simulate an API call\n return new Promise((resolve) => setTimeout(() => resolve(`Data for ${id}`), 1000));\n}\n\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\n\n// Usage\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetchData` function, which simulates an API call.", - "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and manages the execution of queued calls based on the available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How does the `dequeueAndExecute` method work?**\n\n The `dequeueAndExecute` method is responsible for executing the queued API calls. It checks if there are any calls in the queue and if there are available slots for concurrent calls. If both conditions are met, it dequeues the next call from the queue and executes it. This method is called whenever a new API call is added to the queue or when an in-progress call is completed." + "filePath": "src\\cli\\utils\\APIRateLimit.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\APIRateLimit.ts", + "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to prevent overwhelming the server with too many requests at once.\n\nThe class constructor takes an optional parameter `maxConcurrentCalls`, which defaults to 50, to set the maximum number of concurrent API calls allowed. It maintains a queue of API calls and keeps track of the number of calls in progress.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. It takes a function `apiFunction` that returns a promise and wraps it in a new promise. The purpose of this wrapping is to control the execution of the API calls and ensure that they do not exceed the specified rate limit.\n\nWhen `callApi` is called, the provided `apiFunction` is added to the queue and the `dequeueAndExecute` method is triggered if there are available slots for concurrent calls. The `dequeueAndExecute` method checks if there are any API calls in the queue and if the number of in-progress calls is below the maximum limit. If both conditions are met, it dequeues the next API call and executes it.\n\nThe `executeCall` function inside `callApi` is responsible for actually calling the API function, resolving or rejecting the promise based on the result, and updating the number of in-progress calls. Once an API call is completed, the `dequeueAndExecute` method is called again to process any remaining calls in the queue.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchSomeData(id) {\n // Call the API using the rate limiter\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetch` function, ensuring that no more than 10 calls are made at once.", + "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and executes it when there are available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How can the maximum number of concurrent calls be configured?**\n\n The maximum number of concurrent calls can be configured by passing a value to the `maxConcurrentCalls` parameter in the constructor of the `APIRateLimit` class. If no value is provided, the default value is set to 50.", + "checksum": "8862552c9cfd8b6db454d45e565081ef" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/utils/FileUtil.json b/.autodoc/docs/json/src/cli/utils/FileUtil.json index a4178d9..e9ed28b 100644 --- a/.autodoc/docs/json/src/cli/utils/FileUtil.json +++ b/.autodoc/docs/json/src/cli/utils/FileUtil.json @@ -1,7 +1,8 @@ { "fileName": "FileUtil.ts", - "filePath": "src/cli/utils/FileUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/FileUtil.ts", - "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for files and folders.\n\n1. `getFileName(input: string, delimiter = '.', extension = '.md'): string`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new file name with the given extension. If the delimiter is not found in the input string, the function appends the extension to the input string. If the delimiter is found, the function replaces the part after the last delimiter with the extension. For example:\n\n ```javascript\n getFileName(\"example.txt\"); // returns \"example.md\"\n getFileName(\"example\"); // returns \"example.md\"\n ```\n\n2. `githubFileUrl(githubRoot: string, inputRoot: string, filePath: string, linkHosted: boolean): string`: This function generates a GitHub URL for a file. It takes the GitHub root URL, the input root path, the file path, and a boolean flag `linkHosted`. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the file. If `linkHosted` is false, the function returns a URL pointing to the file in the GitHub repository. For example:\n\n ```javascript\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", false); // returns \"https://github.com/user/repo/blob/master/example.md\"\n ```\n\n3. `githubFolderUrl(githubRoot: string, inputRoot: string, folderPath: string, linkHosted: boolean): string`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the folder. If `linkHosted` is false, the function returns a URL pointing to the folder in the GitHub repository. For example:\n\n ```javascript\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", true); // returns \"https://github.com/user/repo/folder\"\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", false); // returns \"https://github.com/user/repo/tree/master/folder\"\n ```\n\nThese utility functions can be used in the autodoc project to generate file names and URLs for documentation files and folders, making it easier to manage and navigate the documentation structure.", - "questions": "1. **What does the `getFileName` function do?**\n\n The `getFileName` function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns the input string with the specified extension, replacing the part after the last occurrence of the delimiter if it exists.\n\n2. **What is the purpose of the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both `githubFileUrl` and `githubFolderUrl` functions are used to generate URLs for files and folders, respectively, in a GitHub repository. They take a `githubRoot`, `inputRoot`, a `filePath` or `folderPath`, and a `linkHosted` boolean flag. If `linkHosted` is true, the generated URL will point to the hosted version of the file or folder; otherwise, it will point to the file or folder in the GitHub repository.\n\n3. **Why is the `inputRoot.length - 1` used in the `substring` method for both `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `inputRoot.length - 1` is used to remove the `inputRoot` part from the `filePath` or `folderPath` when generating the final URL. This ensures that the generated URL only contains the relevant path relative to the GitHub repository root." + "filePath": "src\\cli\\utils\\FileUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\FileUtil.ts", + "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for documentation files.\n\n1. `getFileName(input, delimiter, extension)`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new string with the given extension. If the delimiter is found in the input string, the function removes the part of the string after the last occurrence of the delimiter and appends the extension. If the delimiter is not found, the function simply appends the extension to the input string. This function can be used to generate file names for documentation files with the desired extension.\n\n Example usage:\n\n ```\n getFileName('example.txt'); // returns 'example.md'\n getFileName('example', '_', '.html'); // returns 'example.html'\n ```\n\n2. `githubFileUrl(githubRoot, inputRoot, filePath, linkHosted)`: This function generates a GitHub URL for a file. It takes the GitHub repository root URL, the input root folder path, the file path, and a boolean flag indicating whether the URL should be for the hosted version of the file or the source code. It returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true);\n // returns 'https://github.com/user/repo/example.md'\n ```\n\n3. `githubFolderUrl(githubRoot, inputRoot, folderPath, linkHosted)`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. It takes the same arguments as `githubFileUrl` and returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFolderUrl('https://github.com/user/repo', '/input', '/input/folder', true);\n // returns 'https://github.com/user/repo/folder'\n ```\n\nThese utility functions can be used throughout the autodoc project to generate file names and GitHub URLs for documentation files and folders, ensuring consistent naming and URL generation across the project.", + "questions": "1. **What is the purpose of the `getFileName` function?**\n\n The `getFileName` function takes an input string, an optional delimiter, and an optional extension, and returns a new string with the given extension. If the delimiter is not found in the input string, the extension is simply appended to the input string. If the delimiter is found, the input string is sliced up to the last delimiter index and the extension is appended.\n\n2. **What are the differences between the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both functions take the same parameters: `githubRoot`, `inputRoot`, a path (either `filePath` or `folderPath`), and a `linkHosted` boolean. The main difference is in the returned URL: `githubFileUrl` returns a URL pointing to a file in the GitHub repository, while `githubFolderUrl` returns a URL pointing to a folder in the GitHub repository. The URL structure differs slightly, with `/blob/master/` for files and `/tree/master/` for folders.\n\n3. **What is the purpose of the `linkHosted` parameter in the `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `linkHosted` parameter is a boolean that determines whether the returned URL should point to the hosted version of the file or folder on GitHub Pages (if `true`) or to the file or folder within the GitHub repository itself (if `false`). Depending on the value of `linkHosted`, the functions will return different URL structures.", + "checksum": "d1f26fc674b4a9b4a2053642771871c8" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/utils/LLMUtil.json b/.autodoc/docs/json/src/cli/utils/LLMUtil.json index 02cdb15..cfa3bba 100644 --- a/.autodoc/docs/json/src/cli/utils/LLMUtil.json +++ b/.autodoc/docs/json/src/cli/utils/LLMUtil.json @@ -1,7 +1,8 @@ { "fileName": "LLMUtil.ts", - "filePath": "src/cli/utils/LLMUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/LLMUtil.ts", - "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has a set of properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of `OpenAIChat` with specific configurations. The `inputTokens`, `outputTokens`, `succeeded`, `failed`, and `total` properties are initialized to 0.\n\n```javascript\n{\n name: LLMModels.GPT3,\n inputCostPer1KTokens: 0.002,\n outputCostPer1KTokens: 0.002,\n maxLength: 3050,\n llm: new OpenAIChat({ ... }),\n inputTokens: 0,\n outputTokens: 0,\n succeeded: 0,\n failed: 0,\n total: 0,\n}\n```\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the number of input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost for all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.", - "questions": "1. **Question**: What is the purpose of the `models` object and what are the different models available?\n **Answer**: The `models` object is a record that maps the available LLMModels (GPT3, GPT4, and GPT432k) to their respective details, such as name, input and output costs, maxLength, and an instance of OpenAIChat with the corresponding model.\n\n2. **Question**: How does the `printModelDetails` function work and what information does it display?\n **Answer**: The `printModelDetails` function takes an array of LLMModelDetails and generates an output object containing the model name, file count, succeeded, failed, tokens, and cost. It then calculates the totals for each property and displays the information in a console table.\n\n3. **Question**: What is the purpose of the `totalIndexCostEstimate` function and how does it calculate the total cost?\n **Answer**: The `totalIndexCostEstimate` function calculates the total cost of indexing the given models by iterating through the models array and summing up the input and output costs per 1K tokens for each model." + "filePath": "src\\cli\\utils\\LLMUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\LLMUtil.ts", + "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project that utilizes OpenAI's GPT models. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has its own properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of the `OpenAIChat` class with the respective model name and API key. Additionally, each model has counters for input tokens, output tokens, succeeded, failed, and total files processed.\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost of indexing all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different LLMs. For example, the `printModelDetails` function can be called to display a summary of the models' usage and costs:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nprintModelDetails(Object.values(models));\n```\n\nAnd the `totalIndexCostEstimate` function can be used to estimate the total cost of indexing all models:\n\n```javascript\nimport { models, totalIndexCostEstimate } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```", + "questions": "1. **Question:** What is the purpose of the `models` object and how are the different GPT models being used?\n **Answer:** The `models` object is a record that maps different GPT models (GPT3, GPT4, and GPT432k) to their respective details, such as cost per tokens, maximum length, and an instance of `OpenAIChat` with the corresponding model configuration.\n\n2. **Question:** How does the `printModelDetails` function work and what information does it display?\n **Answer:** The `printModelDetails` function takes an array of `LLMModelDetails` as input, processes the information for each model, and then prints a summary table to the console. The table includes the model name, file count, succeeded and failed counts, total tokens, and cost.\n\n3. **Question:** What is the purpose of the `totalIndexCostEstimate` function and how is it calculating the total cost?\n **Answer:** The `totalIndexCostEstimate` function calculates the total cost of processing the given models by iterating through the input `models` array and summing up the costs based on the input and output tokens and their respective costs per 1K tokens.", + "checksum": "f4464cf197f4af827ac0eac950d568fc" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/utils/WaitUtil.json b/.autodoc/docs/json/src/cli/utils/WaitUtil.json index e476c3e..da72db7 100644 --- a/.autodoc/docs/json/src/cli/utils/WaitUtil.json +++ b/.autodoc/docs/json/src/cli/utils/WaitUtil.json @@ -1,7 +1,8 @@ { "fileName": "WaitUtil.ts", - "filePath": "src/cli/utils/WaitUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/WaitUtil.ts", - "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, which is a JavaScript object that represents the eventual completion (or failure) of an asynchronous operation and its resulting value.\n\n### wait function\n\nThe `wait` function takes two arguments: `timeoutMs`, which is the number of milliseconds to wait before resolving the promise, and `value`, which is an optional value to be returned when the promise resolves. The function creates a new `Promise` and uses `setTimeout` to resolve it with the given `value` after the specified `timeoutMs` has passed.\n\nExample usage:\n\n```javascript\n// Wait for 2 seconds and then log \"Hello, world!\"\nwait(2000, \"Hello, world!\").then(console.log);\n```\n\n### forTrue function\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. The purpose of this function is to repeatedly check if the given function `fn` returns `true`. If it does, the promise resolves with `true`. If the function does not return `true` after 200 checks, the promise is rejected.\n\nThe function uses `setInterval` to repeatedly call the given function `fn` every 50 milliseconds. If `fn` returns `true`, the interval is cleared, and the promise is resolved. If the function has been called 200 times without returning `true`, the promise is rejected.\n\nExample usage:\n\n```javascript\n// Check if a certain element is visible on the page\nconst isElementVisible = () => document.querySelector(\"#my-element\").offsetParent !== null;\n\n// Wait for the element to become visible, then log \"Element is visible!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\"));\n```\n\nIn summary, these utility functions help manage asynchronous operations by providing a way to wait for a certain amount of time or for a specific condition to be met. They can be used in various parts of the larger project to handle timing and conditional logic in an asynchronous manner.", - "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds. It can be used to introduce a delay in the execution of asynchronous code.\n\n2. **How does the `forTrue` function work and what is its use case?**\n\n The `forTrue` function takes a function `fn` as an argument, which returns a boolean value. It repeatedly checks the result of `fn` every 50 milliseconds until it returns `true` or the maximum number of checks (200) is reached. This function can be used to wait for a specific condition to be met before proceeding with the execution of asynchronous code.\n\n3. **Is there any error handling or customization for the `forTrue` function, such as customizing the interval or maximum number of checks?**\n\n Currently, there is no error handling or customization options for the `forTrue` function. The interval is hardcoded to 50 milliseconds, and the maximum number of checks is hardcoded to 200. To add customization, additional parameters could be added to the function signature and used in the implementation." + "filePath": "src\\cli\\utils\\WaitUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\WaitUtil.ts", + "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, making them suitable for use with `async/await` syntax.\n\n### wait\n\nThe `wait` function takes two arguments: `timeoutMs`, a number representing the desired waiting time in milliseconds, and an optional `value` that defaults to `null`. It returns a `Promise` that resolves with the provided `value` after the specified `timeoutMs` has elapsed. This function can be used to introduce a delay in the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\ndelayedEcho(); // Output: Start -> (1 second delay) -> End\n```\n\n### forTrue\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. It returns a `Promise` that resolves with `true` when the provided function `fn` returns `true`. The function `fn` is checked every 50 milliseconds, up to a maximum of 200 times (i.e., 10 seconds). If `fn` does not return `true` within this time, the `Promise` is rejected.\n\nThis function can be used to wait for a specific condition to be met before continuing the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nlet condition = false;\n\nsetTimeout(() => {\n condition = true;\n}, 3000);\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n\nwaitForCondition(); // Output: Waiting for condition... -> (3 second delay) -> Condition met!\n```\n\nIn summary, this file provides two utility functions that help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used in the larger project to control the flow of asynchronous code execution.", + "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds, optionally returning a value when the promise is resolved.\n\n2. **How does the `forTrue` function work?**\n\n The `forTrue` function takes a function `fn` as an argument, which should return a boolean value. It checks the result of `fn` every 50 milliseconds and resolves the promise when `fn` returns `true`. If `fn` does not return `true` after 200 attempts, the promise is rejected.\n\n3. **What is the use case for the `forTrue` function?**\n\n The `forTrue` function can be used to wait for a certain condition to be met before proceeding with the execution of the code. This can be useful in situations where you need to wait for an asynchronous operation to complete or a specific state to be reached before continuing.", + "checksum": "bf4acebb6c2736274af75a8c8441c9d2" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/utils/summary.json b/.autodoc/docs/json/src/cli/utils/summary.json index 2025542..eda89bb 100644 --- a/.autodoc/docs/json/src/cli/utils/summary.json +++ b/.autodoc/docs/json/src/cli/utils/summary.json @@ -1,45 +1,51 @@ { "folderName": "utils", - "folderPath": ".autodoc/docs/json/src/cli/utils", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/utils", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\utils", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\utils", "files": [ { "fileName": "APIRateLimit.ts", - "filePath": "src/cli/utils/APIRateLimit.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/APIRateLimit.ts", - "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server.\n\nThe class has a constructor that takes an optional `maxConcurrentCalls` parameter, which defaults to 50. This parameter determines the maximum number of API calls that can be made concurrently.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. This method takes a function `apiFunction` that returns a promise and wraps it in a rate-limited execution. The method returns a promise that resolves with the result of the API call or rejects with an error if the call fails.\n\nWhen `callApi` is called, it adds the `executeCall` function to the `queue`. The `executeCall` function is responsible for executing the API call, resolving or rejecting the promise, and managing the `inProgress` counter. After adding the `executeCall` function to the queue, the code checks if there are available slots for concurrent calls by comparing `inProgress` with `maxConcurrentCalls`. If there are available slots, it calls the `dequeueAndExecute` method.\n\nThe `dequeueAndExecute` method is responsible for executing the queued API calls while ensuring that the number of concurrent calls does not exceed the `maxConcurrentCalls` limit. It dequeues the next API call from the queue and executes it if there are available slots for concurrent calls.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchData(id) {\n // Simulate an API call\n return new Promise((resolve) => setTimeout(() => resolve(`Data for ${id}`), 1000));\n}\n\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\n\n// Usage\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetchData` function, which simulates an API call.", - "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and manages the execution of queued calls based on the available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How does the `dequeueAndExecute` method work?**\n\n The `dequeueAndExecute` method is responsible for executing the queued API calls. It checks if there are any calls in the queue and if there are available slots for concurrent calls. If both conditions are met, it dequeues the next call from the queue and executes it. This method is called whenever a new API call is added to the queue or when an in-progress call is completed." + "filePath": "src\\cli\\utils\\APIRateLimit.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\APIRateLimit.ts", + "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to prevent overwhelming the server with too many requests at once.\n\nThe class constructor takes an optional parameter `maxConcurrentCalls`, which defaults to 50, to set the maximum number of concurrent API calls allowed. It maintains a queue of API calls and keeps track of the number of calls in progress.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. It takes a function `apiFunction` that returns a promise and wraps it in a new promise. The purpose of this wrapping is to control the execution of the API calls and ensure that they do not exceed the specified rate limit.\n\nWhen `callApi` is called, the provided `apiFunction` is added to the queue and the `dequeueAndExecute` method is triggered if there are available slots for concurrent calls. The `dequeueAndExecute` method checks if there are any API calls in the queue and if the number of in-progress calls is below the maximum limit. If both conditions are met, it dequeues the next API call and executes it.\n\nThe `executeCall` function inside `callApi` is responsible for actually calling the API function, resolving or rejecting the promise based on the result, and updating the number of in-progress calls. Once an API call is completed, the `dequeueAndExecute` method is called again to process any remaining calls in the queue.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchSomeData(id) {\n // Call the API using the rate limiter\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetch` function, ensuring that no more than 10 calls are made at once.", + "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and executes it when there are available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How can the maximum number of concurrent calls be configured?**\n\n The maximum number of concurrent calls can be configured by passing a value to the `maxConcurrentCalls` parameter in the constructor of the `APIRateLimit` class. If no value is provided, the default value is set to 50.", + "checksum": "8862552c9cfd8b6db454d45e565081ef" }, { "fileName": "FileUtil.ts", - "filePath": "src/cli/utils/FileUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/FileUtil.ts", - "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for files and folders.\n\n1. `getFileName(input: string, delimiter = '.', extension = '.md'): string`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new file name with the given extension. If the delimiter is not found in the input string, the function appends the extension to the input string. If the delimiter is found, the function replaces the part after the last delimiter with the extension. For example:\n\n ```javascript\n getFileName(\"example.txt\"); // returns \"example.md\"\n getFileName(\"example\"); // returns \"example.md\"\n ```\n\n2. `githubFileUrl(githubRoot: string, inputRoot: string, filePath: string, linkHosted: boolean): string`: This function generates a GitHub URL for a file. It takes the GitHub root URL, the input root path, the file path, and a boolean flag `linkHosted`. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the file. If `linkHosted` is false, the function returns a URL pointing to the file in the GitHub repository. For example:\n\n ```javascript\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", false); // returns \"https://github.com/user/repo/blob/master/example.md\"\n ```\n\n3. `githubFolderUrl(githubRoot: string, inputRoot: string, folderPath: string, linkHosted: boolean): string`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the folder. If `linkHosted` is false, the function returns a URL pointing to the folder in the GitHub repository. For example:\n\n ```javascript\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", true); // returns \"https://github.com/user/repo/folder\"\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", false); // returns \"https://github.com/user/repo/tree/master/folder\"\n ```\n\nThese utility functions can be used in the autodoc project to generate file names and URLs for documentation files and folders, making it easier to manage and navigate the documentation structure.", - "questions": "1. **What does the `getFileName` function do?**\n\n The `getFileName` function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns the input string with the specified extension, replacing the part after the last occurrence of the delimiter if it exists.\n\n2. **What is the purpose of the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both `githubFileUrl` and `githubFolderUrl` functions are used to generate URLs for files and folders, respectively, in a GitHub repository. They take a `githubRoot`, `inputRoot`, a `filePath` or `folderPath`, and a `linkHosted` boolean flag. If `linkHosted` is true, the generated URL will point to the hosted version of the file or folder; otherwise, it will point to the file or folder in the GitHub repository.\n\n3. **Why is the `inputRoot.length - 1` used in the `substring` method for both `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `inputRoot.length - 1` is used to remove the `inputRoot` part from the `filePath` or `folderPath` when generating the final URL. This ensures that the generated URL only contains the relevant path relative to the GitHub repository root." + "filePath": "src\\cli\\utils\\FileUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\FileUtil.ts", + "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for documentation files.\n\n1. `getFileName(input, delimiter, extension)`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new string with the given extension. If the delimiter is found in the input string, the function removes the part of the string after the last occurrence of the delimiter and appends the extension. If the delimiter is not found, the function simply appends the extension to the input string. This function can be used to generate file names for documentation files with the desired extension.\n\n Example usage:\n\n ```\n getFileName('example.txt'); // returns 'example.md'\n getFileName('example', '_', '.html'); // returns 'example.html'\n ```\n\n2. `githubFileUrl(githubRoot, inputRoot, filePath, linkHosted)`: This function generates a GitHub URL for a file. It takes the GitHub repository root URL, the input root folder path, the file path, and a boolean flag indicating whether the URL should be for the hosted version of the file or the source code. It returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true);\n // returns 'https://github.com/user/repo/example.md'\n ```\n\n3. `githubFolderUrl(githubRoot, inputRoot, folderPath, linkHosted)`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. It takes the same arguments as `githubFileUrl` and returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFolderUrl('https://github.com/user/repo', '/input', '/input/folder', true);\n // returns 'https://github.com/user/repo/folder'\n ```\n\nThese utility functions can be used throughout the autodoc project to generate file names and GitHub URLs for documentation files and folders, ensuring consistent naming and URL generation across the project.", + "questions": "1. **What is the purpose of the `getFileName` function?**\n\n The `getFileName` function takes an input string, an optional delimiter, and an optional extension, and returns a new string with the given extension. If the delimiter is not found in the input string, the extension is simply appended to the input string. If the delimiter is found, the input string is sliced up to the last delimiter index and the extension is appended.\n\n2. **What are the differences between the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both functions take the same parameters: `githubRoot`, `inputRoot`, a path (either `filePath` or `folderPath`), and a `linkHosted` boolean. The main difference is in the returned URL: `githubFileUrl` returns a URL pointing to a file in the GitHub repository, while `githubFolderUrl` returns a URL pointing to a folder in the GitHub repository. The URL structure differs slightly, with `/blob/master/` for files and `/tree/master/` for folders.\n\n3. **What is the purpose of the `linkHosted` parameter in the `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `linkHosted` parameter is a boolean that determines whether the returned URL should point to the hosted version of the file or folder on GitHub Pages (if `true`) or to the file or folder within the GitHub repository itself (if `false`). Depending on the value of `linkHosted`, the functions will return different URL structures.", + "checksum": "d1f26fc674b4a9b4a2053642771871c8" }, { "fileName": "LLMUtil.ts", - "filePath": "src/cli/utils/LLMUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/LLMUtil.ts", - "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has a set of properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of `OpenAIChat` with specific configurations. The `inputTokens`, `outputTokens`, `succeeded`, `failed`, and `total` properties are initialized to 0.\n\n```javascript\n{\n name: LLMModels.GPT3,\n inputCostPer1KTokens: 0.002,\n outputCostPer1KTokens: 0.002,\n maxLength: 3050,\n llm: new OpenAIChat({ ... }),\n inputTokens: 0,\n outputTokens: 0,\n succeeded: 0,\n failed: 0,\n total: 0,\n}\n```\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the number of input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost for all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.", - "questions": "1. **Question**: What is the purpose of the `models` object and what are the different models available?\n **Answer**: The `models` object is a record that maps the available LLMModels (GPT3, GPT4, and GPT432k) to their respective details, such as name, input and output costs, maxLength, and an instance of OpenAIChat with the corresponding model.\n\n2. **Question**: How does the `printModelDetails` function work and what information does it display?\n **Answer**: The `printModelDetails` function takes an array of LLMModelDetails and generates an output object containing the model name, file count, succeeded, failed, tokens, and cost. It then calculates the totals for each property and displays the information in a console table.\n\n3. **Question**: What is the purpose of the `totalIndexCostEstimate` function and how does it calculate the total cost?\n **Answer**: The `totalIndexCostEstimate` function calculates the total cost of indexing the given models by iterating through the models array and summing up the input and output costs per 1K tokens for each model." + "filePath": "src\\cli\\utils\\LLMUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\LLMUtil.ts", + "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project that utilizes OpenAI's GPT models. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has its own properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of the `OpenAIChat` class with the respective model name and API key. Additionally, each model has counters for input tokens, output tokens, succeeded, failed, and total files processed.\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost of indexing all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different LLMs. For example, the `printModelDetails` function can be called to display a summary of the models' usage and costs:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nprintModelDetails(Object.values(models));\n```\n\nAnd the `totalIndexCostEstimate` function can be used to estimate the total cost of indexing all models:\n\n```javascript\nimport { models, totalIndexCostEstimate } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```", + "questions": "1. **Question:** What is the purpose of the `models` object and how are the different GPT models being used?\n **Answer:** The `models` object is a record that maps different GPT models (GPT3, GPT4, and GPT432k) to their respective details, such as cost per tokens, maximum length, and an instance of `OpenAIChat` with the corresponding model configuration.\n\n2. **Question:** How does the `printModelDetails` function work and what information does it display?\n **Answer:** The `printModelDetails` function takes an array of `LLMModelDetails` as input, processes the information for each model, and then prints a summary table to the console. The table includes the model name, file count, succeeded and failed counts, total tokens, and cost.\n\n3. **Question:** What is the purpose of the `totalIndexCostEstimate` function and how is it calculating the total cost?\n **Answer:** The `totalIndexCostEstimate` function calculates the total cost of processing the given models by iterating through the input `models` array and summing up the costs based on the input and output tokens and their respective costs per 1K tokens.", + "checksum": "f4464cf197f4af827ac0eac950d568fc" }, { - "fileName": "WaitUtil.ts", - "filePath": "src/cli/utils/WaitUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/WaitUtil.ts", - "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, which is a JavaScript object that represents the eventual completion (or failure) of an asynchronous operation and its resulting value.\n\n### wait function\n\nThe `wait` function takes two arguments: `timeoutMs`, which is the number of milliseconds to wait before resolving the promise, and `value`, which is an optional value to be returned when the promise resolves. The function creates a new `Promise` and uses `setTimeout` to resolve it with the given `value` after the specified `timeoutMs` has passed.\n\nExample usage:\n\n```javascript\n// Wait for 2 seconds and then log \"Hello, world!\"\nwait(2000, \"Hello, world!\").then(console.log);\n```\n\n### forTrue function\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. The purpose of this function is to repeatedly check if the given function `fn` returns `true`. If it does, the promise resolves with `true`. If the function does not return `true` after 200 checks, the promise is rejected.\n\nThe function uses `setInterval` to repeatedly call the given function `fn` every 50 milliseconds. If `fn` returns `true`, the interval is cleared, and the promise is resolved. If the function has been called 200 times without returning `true`, the promise is rejected.\n\nExample usage:\n\n```javascript\n// Check if a certain element is visible on the page\nconst isElementVisible = () => document.querySelector(\"#my-element\").offsetParent !== null;\n\n// Wait for the element to become visible, then log \"Element is visible!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\"));\n```\n\nIn summary, these utility functions help manage asynchronous operations by providing a way to wait for a certain amount of time or for a specific condition to be met. They can be used in various parts of the larger project to handle timing and conditional logic in an asynchronous manner.", - "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds. It can be used to introduce a delay in the execution of asynchronous code.\n\n2. **How does the `forTrue` function work and what is its use case?**\n\n The `forTrue` function takes a function `fn` as an argument, which returns a boolean value. It repeatedly checks the result of `fn` every 50 milliseconds until it returns `true` or the maximum number of checks (200) is reached. This function can be used to wait for a specific condition to be met before proceeding with the execution of asynchronous code.\n\n3. **Is there any error handling or customization for the `forTrue` function, such as customizing the interval or maximum number of checks?**\n\n Currently, there is no error handling or customization options for the `forTrue` function. The interval is hardcoded to 50 milliseconds, and the maximum number of checks is hardcoded to 200. To add customization, additional parameters could be added to the function signature and used in the implementation." + "fileName": "traverseFileSystem.ts", + "filePath": "src\\cli\\utils\\traverseFileSystem.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\traverseFileSystem.ts", + "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processing files and folders based on the provided parameters. It is designed to be used in the larger project for generating documentation or performing other tasks that require processing files and folders in a directory structure.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains various properties to control the traversal and processing behavior. These properties include:\n\n- `inputPath`: The root path to start the traversal from.\n- `projectName`: The name of the project being processed.\n- `processFile`: An optional callback function to process a file.\n- `processFolder`: An optional callback function to process a folder.\n- `ignore`: An array of patterns to ignore during traversal.\n- `filePrompt`, `folderPrompt`: Optional prompts for user interaction.\n- `contentType`, `targetAudience`, `linkHosted`: Additional metadata for processing.\n\nThe function first checks if the provided `inputPath` exists using `fs.access`. If the path does not exist, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which is called recursively to traverse the file system. It reads the contents of the current directory using `fs.readdir`, filters out ignored items, and processes the remaining items.\n\nFor each item, if it is a directory, the `dfs` function is called recursively, and the `processFolder` callback is invoked if provided. If it is a file and its content is text (checked using `isText`), the `processFile` callback is invoked if provided.\n\nThe traversal is performed using `Promise.all` to process items concurrently, improving performance. If an error occurs during traversal, it is logged and rethrown.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => {\n // Process file logic here\n },\n processFolder: (params) => {\n // Process folder logic here\n },\n ignore: ['node_modules/**', '.git/**'],\n});\n```", + "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes folders and files based on the provided parameters, and ignores files and folders based on the given ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file name as input and returns a boolean value indicating whether the file should be ignored or not. It checks if the file name matches any of the ignore patterns provided in the `ignore` parameter using the `minimatch` library.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory found.", + "checksum": "b9e957c10ee6c009864c90aa2fa93763" }, { - "fileName": "traverseFileSystem.ts", - "filePath": "src/cli/utils/traverseFileSystem.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/traverseFileSystem.ts", - "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used in the larger project for processing and generating documentation for a given project.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains the following properties:\n\n- `inputPath`: The root folder path to start traversing.\n- `projectName`: The name of the project being documented.\n- `processFile`: An optional callback function to process files.\n- `processFolder`: An optional callback function to process folders.\n- `ignore`: An array of patterns to ignore files and folders.\n- `filePrompt`: An optional prompt for processing files.\n- `folderPrompt`: An optional prompt for processing folders.\n- `contentType`: The type of content being processed.\n- `targetAudience`: The target audience for the documentation.\n- `linkHosted`: A flag indicating if the documentation should be linked to a hosted version.\n\nThe function first checks if the provided `inputPath` exists. If not, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which recursively traverses the file system. It reads the contents of the current folder, filters out ignored files and folders, and processes them accordingly. If an entry is a directory, it calls `dfs` recursively and then calls the `processFolder` callback if provided. If an entry is a file and is a text file, it calls the `processFile` callback if provided.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nimport { traverseFileSystem } from './autodoc';\n\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\n\ntraverseFileSystem(params);\n```\n\nThis example would traverse the `myProject` folder, ignoring any files and folders within `node_modules` and `.git`, and process the remaining files and folders using the provided callback functions.", - "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes files and folders based on the provided parameters, and ignores files and folders that match the specified ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file or folder name as input and returns a boolean value indicating whether the file or folder should be ignored based on the provided ignore patterns. It uses the `minimatch` library to check if the file or folder name matches any of the ignore patterns.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory." + "fileName": "WaitUtil.ts", + "filePath": "src\\cli\\utils\\WaitUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\WaitUtil.ts", + "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, making them suitable for use with `async/await` syntax.\n\n### wait\n\nThe `wait` function takes two arguments: `timeoutMs`, a number representing the desired waiting time in milliseconds, and an optional `value` that defaults to `null`. It returns a `Promise` that resolves with the provided `value` after the specified `timeoutMs` has elapsed. This function can be used to introduce a delay in the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\ndelayedEcho(); // Output: Start -> (1 second delay) -> End\n```\n\n### forTrue\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. It returns a `Promise` that resolves with `true` when the provided function `fn` returns `true`. The function `fn` is checked every 50 milliseconds, up to a maximum of 200 times (i.e., 10 seconds). If `fn` does not return `true` within this time, the `Promise` is rejected.\n\nThis function can be used to wait for a specific condition to be met before continuing the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nlet condition = false;\n\nsetTimeout(() => {\n condition = true;\n}, 3000);\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n\nwaitForCondition(); // Output: Waiting for condition... -> (3 second delay) -> Condition met!\n```\n\nIn summary, this file provides two utility functions that help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used in the larger project to control the flow of asynchronous code execution.", + "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds, optionally returning a value when the promise is resolved.\n\n2. **How does the `forTrue` function work?**\n\n The `forTrue` function takes a function `fn` as an argument, which should return a boolean value. It checks the result of `fn` every 50 milliseconds and resolves the promise when `fn` returns `true`. If `fn` does not return `true` after 200 attempts, the promise is rejected.\n\n3. **What is the use case for the `forTrue` function?**\n\n The `forTrue` function can be used to wait for a certain condition to be met before proceeding with the execution of the code. This can be useful in situations where you need to wait for an asynchronous operation to complete or a specific state to be reached before continuing.", + "checksum": "bf4acebb6c2736274af75a8c8441c9d2" } ], "folders": [], - "summary": "The code in the `.autodoc/docs/json/src/cli/utils` folder provides utility functions and classes that help manage various aspects of the autodoc project, such as rate-limiting API calls, handling file and folder paths, managing language models, and traversing file systems.\n\n`APIRateLimit.ts` contains the `APIRateLimit` class, which is designed to manage and limit the number of concurrent API calls made by the application. This is useful when the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server. For example:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\n`FileUtil.ts` provides utility functions for handling file and folder paths, such as generating file names and GitHub URLs for files and folders. These functions can be used to manage and navigate the documentation structure. For example:\n\n```javascript\ngetFileName(\"example.txt\"); // returns \"example.md\"\ngithubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project. It provides functions like `printModelDetails` and `totalIndexCostEstimate` to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations in the larger project. They can be used in various parts of the project to handle timing and conditional logic in an asynchronous manner. For example:\n\n```javascript\nwait(2000, \"Hello, world!\").then(console.log); // Waits for 2 seconds and then logs \"Hello, world!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\")); // Waits for an element to become visible, then logs \"Element is visible!\"\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used for processing and generating documentation for a given project. For example:\n\n```javascript\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\ntraverseFileSystem(params);\n```\n\nIn summary, the code in this folder provides various utility functions and classes that help manage different aspects of the autodoc project, making it easier to handle tasks such as rate-limiting, file and folder management, language model management, asynchronous operations, and file system traversal.", - "questions": "" + "summary": "The `.autodoc\\docs\\json\\src\\cli\\utils` folder contains utility functions and classes that assist in managing API rate limits, handling file and folder paths, managing language models, traversing file systems, and controlling asynchronous operations. These utilities can be used throughout the autodoc project to ensure consistent behavior and improve code organization.\n\n`APIRateLimit.ts` provides the `APIRateLimit` class, which manages and limits the number of concurrent API calls made by the application. This is useful when working with rate-limited APIs or preventing server overload. Example usage:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function fetchSomeData(id) {\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\n`FileUtil.ts` offers utility functions for generating file names and GitHub URLs for documentation files. These functions ensure consistent naming and URL generation across the project. Example usage:\n\n```javascript\ngetFileName('example.txt'); // returns 'example.md'\ngithubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true); // returns 'https://github.com/user/repo/example.md'\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project utilizing OpenAI's GPT models. Functions like `printModelDetails` and `totalIndexCostEstimate` can be used to manage and analyze the usage and costs of different LLMs. Example usage:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\nprintModelDetails(Object.values(models));\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processing files and folders based on provided parameters. This is useful for generating documentation or performing tasks that require processing files and folders in a directory structure. Example usage:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => { /* Process file logic */ },\n processFolder: (params) => { /* Process folder logic */ },\n ignore: ['node_modules/**', '.git/**'],\n});\n```\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used to control the flow of asynchronous code execution. Example usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n```\n\nIn summary, the utilities in this folder enhance the autodoc project by providing consistent behavior, improving code organization, and managing various aspects of the project, such as API rate limits, file and folder paths, language models, file system traversal, and asynchronous operations.", + "questions": "", + "checksum": "a4b7088863601cd326edbec7726eefe7" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/cli/utils/traverseFileSystem.json b/.autodoc/docs/json/src/cli/utils/traverseFileSystem.json index 62be6e8..159296e 100644 --- a/.autodoc/docs/json/src/cli/utils/traverseFileSystem.json +++ b/.autodoc/docs/json/src/cli/utils/traverseFileSystem.json @@ -1,7 +1,8 @@ { "fileName": "traverseFileSystem.ts", - "filePath": "src/cli/utils/traverseFileSystem.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/traverseFileSystem.ts", - "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used in the larger project for processing and generating documentation for a given project.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains the following properties:\n\n- `inputPath`: The root folder path to start traversing.\n- `projectName`: The name of the project being documented.\n- `processFile`: An optional callback function to process files.\n- `processFolder`: An optional callback function to process folders.\n- `ignore`: An array of patterns to ignore files and folders.\n- `filePrompt`: An optional prompt for processing files.\n- `folderPrompt`: An optional prompt for processing folders.\n- `contentType`: The type of content being processed.\n- `targetAudience`: The target audience for the documentation.\n- `linkHosted`: A flag indicating if the documentation should be linked to a hosted version.\n\nThe function first checks if the provided `inputPath` exists. If not, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which recursively traverses the file system. It reads the contents of the current folder, filters out ignored files and folders, and processes them accordingly. If an entry is a directory, it calls `dfs` recursively and then calls the `processFolder` callback if provided. If an entry is a file and is a text file, it calls the `processFile` callback if provided.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nimport { traverseFileSystem } from './autodoc';\n\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\n\ntraverseFileSystem(params);\n```\n\nThis example would traverse the `myProject` folder, ignoring any files and folders within `node_modules` and `.git`, and process the remaining files and folders using the provided callback functions.", - "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes files and folders based on the provided parameters, and ignores files and folders that match the specified ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file or folder name as input and returns a boolean value indicating whether the file or folder should be ignored based on the provided ignore patterns. It uses the `minimatch` library to check if the file or folder name matches any of the ignore patterns.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory." + "filePath": "src\\cli\\utils\\traverseFileSystem.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\traverseFileSystem.ts", + "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processing files and folders based on the provided parameters. It is designed to be used in the larger project for generating documentation or performing other tasks that require processing files and folders in a directory structure.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains various properties to control the traversal and processing behavior. These properties include:\n\n- `inputPath`: The root path to start the traversal from.\n- `projectName`: The name of the project being processed.\n- `processFile`: An optional callback function to process a file.\n- `processFolder`: An optional callback function to process a folder.\n- `ignore`: An array of patterns to ignore during traversal.\n- `filePrompt`, `folderPrompt`: Optional prompts for user interaction.\n- `contentType`, `targetAudience`, `linkHosted`: Additional metadata for processing.\n\nThe function first checks if the provided `inputPath` exists using `fs.access`. If the path does not exist, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which is called recursively to traverse the file system. It reads the contents of the current directory using `fs.readdir`, filters out ignored items, and processes the remaining items.\n\nFor each item, if it is a directory, the `dfs` function is called recursively, and the `processFolder` callback is invoked if provided. If it is a file and its content is text (checked using `isText`), the `processFile` callback is invoked if provided.\n\nThe traversal is performed using `Promise.all` to process items concurrently, improving performance. If an error occurs during traversal, it is logged and rethrown.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => {\n // Process file logic here\n },\n processFolder: (params) => {\n // Process folder logic here\n },\n ignore: ['node_modules/**', '.git/**'],\n});\n```", + "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes folders and files based on the provided parameters, and ignores files and folders based on the given ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file name as input and returns a boolean value indicating whether the file should be ignored or not. It checks if the file name matches any of the ignore patterns provided in the `ignore` parameter using the `minimatch` library.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory found.", + "checksum": "b9e957c10ee6c009864c90aa2fa93763" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/const.json b/.autodoc/docs/json/src/const.json index 6e9c89e..e37d363 100644 --- a/.autodoc/docs/json/src/const.json +++ b/.autodoc/docs/json/src/const.json @@ -1,7 +1,8 @@ { "fileName": "const.ts", - "filePath": "src/const.ts", - "url": "https://github.com/context-labs/autodoc/src/const.ts", - "summary": "The code in this file is responsible for managing the user configuration file for the Autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively.\n\nThe `userConfigFileName` constant is defined as `'autodoc.user.json'`. This constant represents the name of the user configuration file that will be used by the Autodoc project.\n\nThe `userConfigFilePath` constant is created using the `path.resolve()` function, which resolves a sequence of paths into an absolute path. It takes three arguments:\n\n1. `os.homedir()`: This function returns the current user's home directory. It ensures that the user configuration file is stored in the user's home directory, making it user-specific.\n2. `'./.config/autodoc/'`: This string specifies the subdirectory within the user's home directory where the configuration file will be stored. The `.config` directory is a common location for storing configuration files on Unix-based systems, and the `autodoc` subdirectory is used to keep the Autodoc configuration files organized.\n3. `userConfigFileName`: This constant is used as the file name for the user configuration file.\n\nThe `userConfigFilePath` constant will store the absolute path to the user configuration file, which can be used by other parts of the Autodoc project to read or write user-specific settings.\n\nIn summary, this code is responsible for defining the location and name of the user configuration file for the Autodoc project. It ensures that the configuration file is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences.", - "questions": "1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?**\n\n The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder.\n\n2. **Why are the `node:path` and `node:os` modules imported?**\n\n The `node:path` module is imported to provide utilities for working with file and directory paths, such as the `path.resolve()` function used to construct the `userConfigFilePath`. The `node:os` module is imported to provide operating system-related utility methods, such as `os.homedir()` which returns the current user's home directory.\n\n3. **Is this code compatible with different operating systems?**\n\n Yes, this code is compatible with different operating systems. The `os.homedir()` function from the `node:os` module returns the correct home directory path for the current user, regardless of the operating system. Additionally, the `path.resolve()` function from the `node:path` module handles path separators and other OS-specific details, ensuring the correct file path is generated." + "filePath": "src\\const.ts", + "url": "https://github.com/context-labs/autodoc/src\\const.ts", + "summary": "The code in this file is responsible for managing the user configuration file for the autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively.\n\nThe `userConfigFileName` constant is defined as `'autodoc.user.json'`, which represents the name of the user configuration file. This file is expected to store user-specific settings for the autodoc project in JSON format.\n\nThe `userConfigFilePath` constant is created using the `path.resolve()` function, which combines the provided arguments into an absolute file path. The `os.homedir()` function is used to get the current user's home directory, and `./.config/autodoc/` is appended to it as the folder where the user configuration file should be stored. Finally, the `userConfigFileName` constant is appended to the path, resulting in the complete file path for the user configuration file.\n\nBy exporting both `userConfigFileName` and `userConfigFilePath`, other parts of the autodoc project can easily access and use these constants to read or write user-specific settings. For example, when the autodoc application starts, it can read the user configuration file from the specified path, and apply the settings accordingly.\n\nHere's a code example of how these constants might be used in another part of the autodoc project:\n\n```javascript\nimport { userConfigFilePath } from './path/to/this/file';\n\n// Read user configuration from the file\nconst userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8'));\n\n// Apply user settings\napplyUserSettings(userConfig);\n```\n\nIn summary, this code is responsible for defining the name and file path of the user configuration file for the autodoc project, allowing other parts of the project to easily access and manage user-specific settings.", + "questions": "1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?**\n\n The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder.\n\n2. **Why are the `node:path` and `node:os` modules being imported?**\n\n The `node:path` module is imported to provide utilities for working with file and directory paths, such as resolving the absolute path to the user configuration file. The `node:os` module is imported to provide operating system-related utility methods, such as getting the user's home directory.\n\n3. **Is this code compatible with different operating systems?**\n\n Yes, this code is compatible with different operating systems. The `os.homedir()` method returns the home directory of the current user, which is platform-specific, and the `path.resolve()` method takes care of handling the correct path separators for the current operating system.", + "checksum": "ce40980fffc58e17b13690b9e37a6015" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/index.json b/.autodoc/docs/json/src/index.json index dff5d2c..ba5c65f 100644 --- a/.autodoc/docs/json/src/index.json +++ b/.autodoc/docs/json/src/index.json @@ -1,7 +1,8 @@ { "fileName": "index.ts", - "filePath": "src/index.ts", - "url": "https://github.com/context-labs/autodoc/src/index.ts", - "summary": "The code is a CLI (Command Line Interface) tool for the Autodoc project, which helps in generating documentation for a codebase. It uses the `commander` package to define and manage commands, and `inquirer` for interactive prompts. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`.\n\n1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration.\n ```bash\n autodoc init\n ```\n\n2. `estimate`: Estimates the cost of running the `index` command on the repository. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc estimate\n ```\n\n3. `index`: Traverses the codebase, writes documentation using LLM (Language Model), and creates a locally stored index. It prompts the user to confirm before starting the indexing process.\n ```bash\n autodoc index\n ```\n\n4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration; otherwise, it creates a new one.\n ```bash\n autodoc user\n ```\n\n5. `q`: Queries an Autodoc index. It requires both `autodoc.config.json` and user configuration files to be present.\n ```bash\n autodoc q\n ```\n\nThe code also handles unhandled promise rejections by logging the error stack, showing an error spinner, stopping the spinner, and exiting with an error code.\n\nOverall, this CLI tool simplifies the process of generating documentation for a codebase by providing an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities.", - "questions": "1. **Question:** What is the purpose of the `autodoc.config.json` file and how is it used in the code?\n **Answer:** The `autodoc.config.json` file is used to store the configuration for the Autodoc repository. It is read and parsed in various commands like `init`, `estimate`, `index`, and `q` to provide the necessary configuration for each command's execution.\n\n2. **Question:** How does the `estimate` command work and what does it do?\n **Answer:** The `estimate` command reads the `autodoc.config.json` file, parses it into a configuration object, and then calls the `estimate` function with the configuration. The purpose of this command is to estimate the cost of running the `index` command on the repository.\n\n3. **Question:** What is the purpose of the `user` command and how does it handle user configuration?\n **Answer:** The `user` command is used to set the Autodoc user configuration. It reads the user configuration file specified by `userConfigFilePath`, parses it into a configuration object, and then calls the `user` function with the configuration. If the configuration file is not found, it calls the `user` function without any configuration, allowing the user to set up their configuration." + "filePath": "src\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\index.ts", + "summary": "This code is the main entry point for the Autodoc CLI tool, which provides a set of commands to help developers automatically generate documentation for their codebase. The tool uses the `commander` library to define and handle commands, and `inquirer` for interactive prompts.\n\nThe available commands are:\n\n1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration.\n ```bash\n autodoc init\n ```\n\n2. `estimate`: Estimates the cost of running the `index` command on the repository. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc estimate\n ```\n\n3. `index`: Traverses the codebase, writes documentation using LLM, and creates a locally stored index. Before starting the indexing process, it prompts the user for confirmation. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc index\n ```\n\n4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration.\n ```bash\n autodoc user\n ```\n\n5. `q`: Queries an Autodoc index. It requires both the `autodoc.config.json` and user configuration files to be present.\n ```bash\n autodoc q\n ```\n\nThe code also listens for unhandled promise rejections and handles them gracefully by showing an error spinner, stopping the spinner, and exiting with an error code.\n\nIn the larger project, this CLI tool serves as the primary interface for users to interact with Autodoc, allowing them to easily generate and manage documentation for their codebase.", + "questions": "1. **What is the purpose of the Autodoc CLI Tool?**\n\n The Autodoc CLI Tool is designed to help developers automatically generate documentation for their codebase by traversing the code, writing docs via LLM, and creating a locally stored index.\n\n2. **How does the `estimate` command work and what does it return?**\n\n The `estimate` command reads the `autodoc.config.json` file and estimates the cost of running the `index` command on the repository. It provides an estimation of the resources required to generate the documentation.\n\n3. **What is the role of the `user` command and how does it interact with the user configuration file?**\n\n The `user` command is responsible for setting the Autodoc user configuration. It reads the user configuration file (if it exists) and allows the user to update or create a new configuration. This configuration is then used in other commands, such as the `query` command, to interact with the Autodoc index.", + "checksum": "7bc160e4c4ef027d4968e3650a305a7d" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/langchain/hnswlib.json b/.autodoc/docs/json/src/langchain/hnswlib.json index 029a66b..12d9cdb 100644 --- a/.autodoc/docs/json/src/langchain/hnswlib.json +++ b/.autodoc/docs/json/src/langchain/hnswlib.json @@ -1,7 +1,8 @@ { "fileName": "hnswlib.ts", - "filePath": "src/langchain/hnswlib.ts", - "url": "https://github.com/context-labs/autodoc/src/langchain/hnswlib.ts", - "summary": "The `HNSWLib` class in this code is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index.\n\nThe constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method takes an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method is responsible for initializing the index, resizing it if necessary, and adding the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nExample usage:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, this class can be used to efficiently store and search for similar documents based on their embeddings, which can be useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.", - "questions": "1. **Question:** What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer:** The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question:** How does the `addDocuments` method work and what is its purpose?\n **Answer:** The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them into vectors using the `embedDocuments` method from the `embeddings` object. It then adds these vectors and the corresponding documents to the HNSW index and the `docstore` respectively.\n\n3. **Question:** How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer:** The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input. It checks if the query vector has the same length as the number of dimensions and if `k` is not greater than the number of elements in the index. It then performs a k-nearest neighbors search on the HNSW index using the query vector and returns an array of `[Document, number]` tuples, where each tuple contains a document from the `docstore` and its corresponding distance score to the query vector." + "filePath": "src\\langchain\\hnswlib.ts", + "url": "https://github.com/context-labs/autodoc/src\\langchain\\hnswlib.ts", + "summary": "The `HNSWLib` class in this code is a specialized vector store that uses the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. It is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. The main purpose of this class is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe constructor of the `HNSWLib` class takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method takes an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods allow for persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nIn the larger project, the `HNSWLib` class can be used to efficiently store and search for documents based on their content similarity, which can be useful for tasks such as document clustering, recommendation systems, or information retrieval.", + "questions": "1. **Question**: What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer**: The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question**: How does the `addDocuments` method work and what is its purpose?\n **Answer**: The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them using the provided `Embeddings` instance. It then adds the resulting vectors and documents to the HNSW index and the `InMemoryDocstore`, respectively.\n\n3. **Question**: How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer**: The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input, and searches for the `k` most similar vectors in the HNSW index. It returns an array of tuples, where each tuple contains a `Document` object and its corresponding similarity score to the query vector.", + "checksum": "4725f6bfddda88355b55a980a1eae582" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/langchain/summary.json b/.autodoc/docs/json/src/langchain/summary.json index 4ec7faf..d88bc7c 100644 --- a/.autodoc/docs/json/src/langchain/summary.json +++ b/.autodoc/docs/json/src/langchain/summary.json @@ -1,17 +1,19 @@ { "folderName": "langchain", - "folderPath": ".autodoc/docs/json/src/langchain", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/langchain", + "folderPath": ".autodoc\\docs\\json\\src\\langchain", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\langchain", "files": [ { "fileName": "hnswlib.ts", - "filePath": "src/langchain/hnswlib.ts", - "url": "https://github.com/context-labs/autodoc/src/langchain/hnswlib.ts", - "summary": "The `HNSWLib` class in this code is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index.\n\nThe constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method takes an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method is responsible for initializing the index, resizing it if necessary, and adding the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nExample usage:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, this class can be used to efficiently store and search for similar documents based on their embeddings, which can be useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.", - "questions": "1. **Question:** What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer:** The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question:** How does the `addDocuments` method work and what is its purpose?\n **Answer:** The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them into vectors using the `embedDocuments` method from the `embeddings` object. It then adds these vectors and the corresponding documents to the HNSW index and the `docstore` respectively.\n\n3. **Question:** How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer:** The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input. It checks if the query vector has the same length as the number of dimensions and if `k` is not greater than the number of elements in the index. It then performs a k-nearest neighbors search on the HNSW index using the query vector and returns an array of `[Document, number]` tuples, where each tuple contains a document from the `docstore` and its corresponding distance score to the query vector." + "filePath": "src\\langchain\\hnswlib.ts", + "url": "https://github.com/context-labs/autodoc/src\\langchain\\hnswlib.ts", + "summary": "The `HNSWLib` class in this code is a specialized vector store that uses the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. It is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. The main purpose of this class is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe constructor of the `HNSWLib` class takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method takes an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods allow for persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nIn the larger project, the `HNSWLib` class can be used to efficiently store and search for documents based on their content similarity, which can be useful for tasks such as document clustering, recommendation systems, or information retrieval.", + "questions": "1. **Question**: What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer**: The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question**: How does the `addDocuments` method work and what is its purpose?\n **Answer**: The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them using the provided `Embeddings` instance. It then adds the resulting vectors and documents to the HNSW index and the `InMemoryDocstore`, respectively.\n\n3. **Question**: How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer**: The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input, and searches for the `k` most similar vectors in the HNSW index. It returns an array of tuples, where each tuple contains a `Document` object and its corresponding similarity score to the query vector.", + "checksum": "4725f6bfddda88355b55a980a1eae582" } ], "folders": [], - "summary": "The `hnswlib.ts` file in the `.autodoc/docs/json/src/langchain` folder contains the `HNSWLib` class, which is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.\n\nThe `HNSWLib` class extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index. It takes an `Embeddings` object and an `HNSWLibArgs` object as arguments in its constructor. The `Embeddings` object is responsible for converting text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method accepts an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method initializes the index, resizes it if necessary, and adds the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nHere's an example of how this code might be used:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, the `HNSWLib` class can be integrated with other components to build efficient and scalable systems for document similarity search, clustering, and recommendations based on text embeddings.", - "questions": "" + "summary": "The `hnswlib.ts` file in the `.autodoc\\docs\\json\\src\\langchain` folder contains the `HNSWLib` class, which is a specialized vector store utilizing the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. This class is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. Its primary purpose is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe `HNSWLib` class constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is responsible for converting documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method accepts an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods enable persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nIn the larger project, the `HNSWLib` class can be employed to efficiently store and search for documents based on their content similarity, which can be beneficial for tasks such as document clustering, recommendation systems, or information retrieval.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nThis code snippet demonstrates how to create an `HNSWLib` instance, add documents to the index, and perform a similarity search. The results can then be used for various purposes, such as finding related documents or generating recommendations based on content similarity.", + "questions": "", + "checksum": "ccbe47bddb9d048f35d29fb2d8c04d7f" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/summary.json b/.autodoc/docs/json/src/summary.json index 0e74775..7b33b05 100644 --- a/.autodoc/docs/json/src/summary.json +++ b/.autodoc/docs/json/src/summary.json @@ -1,242 +1,272 @@ { "folderName": "src", - "folderPath": ".autodoc/docs/json/src", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src", + "folderPath": ".autodoc\\docs\\json\\src", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src", "files": [ { "fileName": "const.ts", - "filePath": "src/const.ts", - "url": "https://github.com/context-labs/autodoc/src/const.ts", - "summary": "The code in this file is responsible for managing the user configuration file for the Autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively.\n\nThe `userConfigFileName` constant is defined as `'autodoc.user.json'`. This constant represents the name of the user configuration file that will be used by the Autodoc project.\n\nThe `userConfigFilePath` constant is created using the `path.resolve()` function, which resolves a sequence of paths into an absolute path. It takes three arguments:\n\n1. `os.homedir()`: This function returns the current user's home directory. It ensures that the user configuration file is stored in the user's home directory, making it user-specific.\n2. `'./.config/autodoc/'`: This string specifies the subdirectory within the user's home directory where the configuration file will be stored. The `.config` directory is a common location for storing configuration files on Unix-based systems, and the `autodoc` subdirectory is used to keep the Autodoc configuration files organized.\n3. `userConfigFileName`: This constant is used as the file name for the user configuration file.\n\nThe `userConfigFilePath` constant will store the absolute path to the user configuration file, which can be used by other parts of the Autodoc project to read or write user-specific settings.\n\nIn summary, this code is responsible for defining the location and name of the user configuration file for the Autodoc project. It ensures that the configuration file is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences.", - "questions": "1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?**\n\n The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder.\n\n2. **Why are the `node:path` and `node:os` modules imported?**\n\n The `node:path` module is imported to provide utilities for working with file and directory paths, such as the `path.resolve()` function used to construct the `userConfigFilePath`. The `node:os` module is imported to provide operating system-related utility methods, such as `os.homedir()` which returns the current user's home directory.\n\n3. **Is this code compatible with different operating systems?**\n\n Yes, this code is compatible with different operating systems. The `os.homedir()` function from the `node:os` module returns the correct home directory path for the current user, regardless of the operating system. Additionally, the `path.resolve()` function from the `node:path` module handles path separators and other OS-specific details, ensuring the correct file path is generated." + "filePath": "src\\const.ts", + "url": "https://github.com/context-labs/autodoc/src\\const.ts", + "summary": "The code in this file is responsible for managing the user configuration file for the autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively.\n\nThe `userConfigFileName` constant is defined as `'autodoc.user.json'`, which represents the name of the user configuration file. This file is expected to store user-specific settings for the autodoc project in JSON format.\n\nThe `userConfigFilePath` constant is created using the `path.resolve()` function, which combines the provided arguments into an absolute file path. The `os.homedir()` function is used to get the current user's home directory, and `./.config/autodoc/` is appended to it as the folder where the user configuration file should be stored. Finally, the `userConfigFileName` constant is appended to the path, resulting in the complete file path for the user configuration file.\n\nBy exporting both `userConfigFileName` and `userConfigFilePath`, other parts of the autodoc project can easily access and use these constants to read or write user-specific settings. For example, when the autodoc application starts, it can read the user configuration file from the specified path, and apply the settings accordingly.\n\nHere's a code example of how these constants might be used in another part of the autodoc project:\n\n```javascript\nimport { userConfigFilePath } from './path/to/this/file';\n\n// Read user configuration from the file\nconst userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8'));\n\n// Apply user settings\napplyUserSettings(userConfig);\n```\n\nIn summary, this code is responsible for defining the name and file path of the user configuration file for the autodoc project, allowing other parts of the project to easily access and manage user-specific settings.", + "questions": "1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?**\n\n The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder.\n\n2. **Why are the `node:path` and `node:os` modules being imported?**\n\n The `node:path` module is imported to provide utilities for working with file and directory paths, such as resolving the absolute path to the user configuration file. The `node:os` module is imported to provide operating system-related utility methods, such as getting the user's home directory.\n\n3. **Is this code compatible with different operating systems?**\n\n Yes, this code is compatible with different operating systems. The `os.homedir()` method returns the home directory of the current user, which is platform-specific, and the `path.resolve()` method takes care of handling the correct path separators for the current operating system.", + "checksum": "ce40980fffc58e17b13690b9e37a6015" }, { "fileName": "index.ts", - "filePath": "src/index.ts", - "url": "https://github.com/context-labs/autodoc/src/index.ts", - "summary": "The code is a CLI (Command Line Interface) tool for the Autodoc project, which helps in generating documentation for a codebase. It uses the `commander` package to define and manage commands, and `inquirer` for interactive prompts. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`.\n\n1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration.\n ```bash\n autodoc init\n ```\n\n2. `estimate`: Estimates the cost of running the `index` command on the repository. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc estimate\n ```\n\n3. `index`: Traverses the codebase, writes documentation using LLM (Language Model), and creates a locally stored index. It prompts the user to confirm before starting the indexing process.\n ```bash\n autodoc index\n ```\n\n4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration; otherwise, it creates a new one.\n ```bash\n autodoc user\n ```\n\n5. `q`: Queries an Autodoc index. It requires both `autodoc.config.json` and user configuration files to be present.\n ```bash\n autodoc q\n ```\n\nThe code also handles unhandled promise rejections by logging the error stack, showing an error spinner, stopping the spinner, and exiting with an error code.\n\nOverall, this CLI tool simplifies the process of generating documentation for a codebase by providing an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities.", - "questions": "1. **Question:** What is the purpose of the `autodoc.config.json` file and how is it used in the code?\n **Answer:** The `autodoc.config.json` file is used to store the configuration for the Autodoc repository. It is read and parsed in various commands like `init`, `estimate`, `index`, and `q` to provide the necessary configuration for each command's execution.\n\n2. **Question:** How does the `estimate` command work and what does it do?\n **Answer:** The `estimate` command reads the `autodoc.config.json` file, parses it into a configuration object, and then calls the `estimate` function with the configuration. The purpose of this command is to estimate the cost of running the `index` command on the repository.\n\n3. **Question:** What is the purpose of the `user` command and how does it handle user configuration?\n **Answer:** The `user` command is used to set the Autodoc user configuration. It reads the user configuration file specified by `userConfigFilePath`, parses it into a configuration object, and then calls the `user` function with the configuration. If the configuration file is not found, it calls the `user` function without any configuration, allowing the user to set up their configuration." + "filePath": "src\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\index.ts", + "summary": "This code is the main entry point for the Autodoc CLI tool, which provides a set of commands to help developers automatically generate documentation for their codebase. The tool uses the `commander` library to define and handle commands, and `inquirer` for interactive prompts.\n\nThe available commands are:\n\n1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration.\n ```bash\n autodoc init\n ```\n\n2. `estimate`: Estimates the cost of running the `index` command on the repository. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc estimate\n ```\n\n3. `index`: Traverses the codebase, writes documentation using LLM, and creates a locally stored index. Before starting the indexing process, it prompts the user for confirmation. It requires the `autodoc.config.json` file to be present.\n ```bash\n autodoc index\n ```\n\n4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration.\n ```bash\n autodoc user\n ```\n\n5. `q`: Queries an Autodoc index. It requires both the `autodoc.config.json` and user configuration files to be present.\n ```bash\n autodoc q\n ```\n\nThe code also listens for unhandled promise rejections and handles them gracefully by showing an error spinner, stopping the spinner, and exiting with an error code.\n\nIn the larger project, this CLI tool serves as the primary interface for users to interact with Autodoc, allowing them to easily generate and manage documentation for their codebase.", + "questions": "1. **What is the purpose of the Autodoc CLI Tool?**\n\n The Autodoc CLI Tool is designed to help developers automatically generate documentation for their codebase by traversing the code, writing docs via LLM, and creating a locally stored index.\n\n2. **How does the `estimate` command work and what does it return?**\n\n The `estimate` command reads the `autodoc.config.json` file and estimates the cost of running the `index` command on the repository. It provides an estimation of the resources required to generate the documentation.\n\n3. **What is the role of the `user` command and how does it interact with the user configuration file?**\n\n The `user` command is responsible for setting the Autodoc user configuration. It reads the user configuration file (if it exists) and allows the user to update or create a new configuration. This configuration is then used in other commands, such as the `query` command, to interact with the Autodoc index.", + "checksum": "7bc160e4c4ef027d4968e3650a305a7d" }, { "fileName": "types.ts", - "filePath": "src/types.ts", - "url": "https://github.com/context-labs/autodoc/src/types.ts", - "summary": "This code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders within the repository.\n\nThe code starts by importing `OpenAIChat` from the `langchain/llms` package. It then defines several types and interfaces that are used throughout the project:\n\n- `AutodocUserConfig`: Represents the user configuration for the autodoc project, including the LLM models to be used.\n- `AutodocRepoConfig`: Represents the configuration for a specific repository, including its name, URL, root directory, output directory, LLM models, and other settings.\n- `FileSummary` and `FolderSummary`: Represent the summaries and questions generated for files and folders, respectively.\n- `ProcessFileParams`, `ProcessFolderParams`, and `TraverseFileSystemParams`: Define the parameters for processing files, folders, and traversing the file system, respectively.\n- `ProcessFile` and `ProcessFolder`: Define the function types for processing files and folders, respectively.\n- `LLMModels`: Enumerates the available LLM models, such as GPT-3.5-turbo, GPT-4, and GPT-4-32k.\n- `LLMModelDetails`: Represents the details of an LLM model, including its name, cost per 1K tokens, maximum length, and other statistics.\n\nFor example, when using this code in the larger project, you might define a `ProcessFile` function that takes a `ProcessFileParams` object as input and generates a summary and questions for the file using the specified LLM model. Similarly, you could define a `ProcessFolder` function that processes all files and subfolders within a folder, generating summaries and questions for each.\n\nThe `TraverseFileSystemParams` type allows you to configure how the file system is traversed, including specifying which files and folders to ignore, and what prompts to use for generating summaries and questions.\n\nOverall, this code provides the foundation for the `autodoc` project by defining the types and interfaces needed to process code repositories and generate documentation using OpenAI's language models.", - "questions": "1. **Question:** What is the purpose of the `LLMModels` enum and how is it used in the code?\n **Answer:** The `LLMModels` enum defines the available language models for the autodoc project. It is used in the `AutodocUserConfig` and `AutodocRepoConfig` types to specify which language models should be used for processing files and folders.\n\n2. **Question:** What are the `ProcessFile` and `ProcessFolder` types and how are they used in the code?\n **Answer:** `ProcessFile` and `ProcessFolder` are types for functions that process a file or a folder, respectively. They are used as optional parameters in the `TraverseFileSystemParams` type, allowing developers to provide custom processing functions when traversing the file system.\n\n3. **Question:** What is the purpose of the `TraverseFileSystemParams` type and how is it used in the code?\n **Answer:** The `TraverseFileSystemParams` type defines the parameters required for traversing the file system. It is used to pass configuration options, such as input path, project name, custom processing functions, and other settings, to a function that will traverse the file system and process files and folders accordingly." + "filePath": "src\\types.ts", + "url": "https://github.com/context-labs/autodoc/src\\types.ts", + "summary": "This code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders in the repository.\n\nThe `AutodocUserConfig` and `AutodocRepoConfig` types define the configuration options for the user and repository, respectively. These include settings such as the LLM models to use, repository URL, output directory, and content type.\n\n`FileSummary` and `FolderSummary` types represent the generated summaries for files and folders, including their paths, URLs, and checksums. The `ProcessFileParams` and `ProcessFolderParams` types define the parameters required for processing files and folders, such as the file or folder name, path, and content type.\n\n`ProcessFile` and `ProcessFolder` are function types that take the respective parameters and return a promise. These functions are responsible for processing the files and folders, generating summaries, and updating the documentation.\n\n`TraverseFileSystemParams` type defines the parameters for traversing the file system, including the input path, project name, and optional `processFile` and `processFolder` functions. It also includes settings for ignoring certain files or folders and content type preferences.\n\nThe `LLMModels` enum lists the available language models, such as GPT-3.5 Turbo, GPT-4, and GPT-4 32k. The `LLMModelDetails` type provides information about each model, including the cost per 1K tokens, maximum length, and success/failure statistics.\n\nIn the larger project, these types and interfaces would be used to configure and run the `autodoc` tool, allowing users to automatically generate documentation for their code repositories using OpenAI's language models. For example, a user could provide an `AutodocRepoConfig` object to configure the tool, and then use the `TraverseFileSystem` function to process the repository and generate the documentation.", + "questions": "1. **What is the purpose of the `AutodocUserConfig` and `AutodocRepoConfig` types?**\n\n The `AutodocUserConfig` type is used to define the user configuration for the autodoc project, which includes an array of LLMModels. The `AutodocRepoConfig` type is used to define the repository configuration for the autodoc project, which includes various properties such as name, repository URL, root, output, LLMModels, and more.\n\n2. **What are the different LLMModels available in the `LLMModels` enum?**\n\n The `LLMModels` enum lists the available language models for the autodoc project. Currently, there are three models: GPT3 (gpt-3.5-turbo), GPT4 (gpt-4), and GPT432k (gpt-4-32k).\n\n3. **What is the purpose of the `ProcessFile` and `ProcessFolder` types?**\n\n The `ProcessFile` type is a function type that takes a `ProcessFileParams` object as input and returns a Promise. It is used to process a single file in the autodoc project. The `ProcessFolder` type is a function type that takes a `ProcessFolderParams` object as input and returns a Promise. It is used to process a folder in the autodoc project.", + "checksum": "796822d4da09cce719cb86b540d2fb66" } ], "folders": [ { "folderName": "cli", - "folderPath": ".autodoc/docs/json/src/cli", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli", + "folderPath": ".autodoc\\docs\\json\\src\\cli", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli", "files": [ { "fileName": "spinner.ts", - "filePath": "src/cli/spinner.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/spinner.ts", - "summary": "This code provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.", - "questions": "1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the terminal, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **What are the different states of the spinner and how are they updated?**\n\n The spinner can have different states such as spinning, stopped, failed, succeeded, and displaying information. The functions `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` are used to update the spinner's state and text accordingly.\n\n3. **How does the `updateSpinnerText` function work and when should it be used?**\n\n The `updateSpinnerText` function updates the spinner's text with the provided message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message. This function should be used when you want to change the spinner's text while it is spinning or start it with a new message." + "filePath": "src\\cli\\spinner.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\spinner.ts", + "summary": "This code is responsible for managing a spinner, which is a visual element that indicates a process is running in the background. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe code starts by importing the `ora` library and creating a singleton spinner instance with the 'dots' style. This ensures that there will only be one spinner active at any given time.\n\nThere are several functions exported by this module to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: This function updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: This function stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: This function stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: This function stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: This function displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.", + "questions": "1. **What is the purpose of the `ora` package in this code?**\n\n The `ora` package is used to create a spinner in the command line interface, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style.\n\n2. **How does the `updateSpinnerText` function work?**\n\n The `updateSpinnerText` function takes a message as an input and updates the spinner's text with the given message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message.\n\n3. **What are the differences between `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions?**\n\n These functions are used to update the spinner's state and message based on the outcome of a process. `spinnerError` is called when there is an error, and it stops the spinner with a failure message. `spinnerSuccess` is called when the process is successful, and it stops the spinner with a success message. `spinnerInfo` is used to display an informational message without stopping the spinner.", + "checksum": "d93ad7e714ce5446916bb1d63cbb6031" } ], "folders": [ { "folderName": "commands", - "folderPath": ".autodoc/docs/json/src/cli/commands", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands", "files": [], "folders": [ { "folderName": "estimate", - "folderPath": ".autodoc/docs/json/src/cli/commands/estimate", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/estimate", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\estimate", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\estimate", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/estimate/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts", - "summary": "The `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe main steps involved in the function are:\n\n1. Set the output path for the JSON files generated during the process.\n2. Update the spinner text to display \"Estimating cost...\".\n3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stop the spinner once the dry run is complete.\n5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Display the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository.", - "questions": "1. **What is the purpose of the `estimate` function and what parameters does it accept?**\n\n The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings.\n\n2. **How does the `estimate` function calculate the cost estimate?**\n\n The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details.\n\n3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?**\n\n The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate." + "filePath": "src\\cli\\commands\\estimate\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\estimate\\index.ts", + "summary": "The `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress.\n\nNext, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository.\n\nOnce the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "1. **What is the purpose of the `estimate` function?**\n\n The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost.\n\n2. **What are the parameters passed to the `processRepository` function?**\n\n The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run.\n\n3. **How is the total estimated cost calculated and displayed?**\n\n The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary.", + "checksum": "2b0b3903432ae423bbc597d04b052ecb" } ], "folders": [], - "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe main steps involved in the `estimate` function are:\n\n1. Setting the output path for the JSON files generated during the process.\n2. Updating the spinner text to display \"Estimating cost...\".\n3. Performing a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed.\n4. Stopping the spinner once the dry run is complete.\n5. Printing the details of the models obtained from the dry run using the `printModelDetails` utility function.\n6. Calculating the total estimated cost using the `totalIndexCostEstimate` utility function.\n7. Displaying the estimated cost in a user-friendly format using the `chalk` library.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output/',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. The function is designed to work seamlessly with other parts of the Autodoc project, such as the `processRepository` function, which is responsible for the actual processing of the repository.\n\nBy providing an estimated cost upfront, the `estimate` function helps users make informed decisions about whether to proceed with the indexing process or not. This can be particularly useful for users with large repositories or those who are working within a budget. Overall, the `estimate` function is an essential tool for users looking to leverage the power of Autodoc while managing their costs effectively.", - "questions": "" + "summary": "The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input, containing various configuration options such as repository name, URL, root directory, output directory, and other settings related to the processing of the repository.\n\nThe function begins by constructing the path to the JSON output directory, which stores intermediate results of the processing. It then updates the spinner text to indicate that cost estimation is in progress. The `processRepository` function is called with the provided configuration options and a `true` flag, signifying a dry run. This dry run returns the details of what would happen if the repository were processed, which is used to calculate the estimated cost.\n\nUpon completion of the dry run, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input.\n\nFinally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in red. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges.\n\nHere's an example of how the `estimate` function might be used in the larger project:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\nThis example would estimate the cost of processing the \"my-repo\" repository with the specified configuration options.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" }, { "folderName": "index", - "folderPath": ".autodoc/docs/json/src/cli/commands/index", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/index", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\index", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\index", "files": [ { "fileName": "convertJsonToMarkdown.ts", - "filePath": "src/cli/commands/index/convertJsonToMarkdown.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts", - "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project.\n\nFirst, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile: () => {\n files++;\n return Promise.resolve();\n },\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nNext, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name.\n\nThe function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension.\n\n```javascript\nconst outputPath = getFileName(markdownFilePath, '.', '.md');\nawait fs.writeFile(outputPath, markdown, 'utf-8');\n```\n\nThe `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project.\n\n```javascript\nawait traverseFileSystem({\n inputPath: inputRoot,\n projectName,\n processFile,\n ignore: [],\n filePrompt,\n folderPrompt,\n contentType,\n targetAudience,\n linkHosted,\n});\n```\n\nIn summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories.", - "questions": "1. **What is the purpose of the `convertJsonToMarkdown` function?**\n\n The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information.\n\n2. **How does the `traverseFileSystem` function work and what are its parameters?**\n\n The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders.\n\n3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?**\n\n The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file." + "filePath": "src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\convertJsonToMarkdown.ts", + "summary": "The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories.\n\nThe function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process.\n\nThe code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function.\n\nNext, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory.\n\nFinally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function.\n\nExample usage:\n\n```javascript\nconvertJsonToMarkdown({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory.", + "questions": "1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs?\n **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted.\n\n2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code?\n **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project.\n\n3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content?\n **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist.", + "checksum": "79c860becf47b9882441682f0213d534" }, { "fileName": "createVectorStore.ts", - "filePath": "src/cli/commands/index/createVectorStore.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts", - "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings.\n\nThe `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata.\n\nThe `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```\n\nThis code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file.", - "questions": "1. **Question:** What is the purpose of the `processFile` function and how does it handle errors?\n **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error.\n\n2. **Question:** How does the `processDirectory` function handle nested directories and files?\n **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class?\n **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path." + "filePath": "src\\cli\\commands\\index\\createVectorStore.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\createVectorStore.ts", + "summary": "The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project.\n\nThe `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document.\n\nThe `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects.\n\nThe `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects.\n\nThe `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path.\n\nExample usage:\n\n```javascript\nconst config = {\n root: './data/documents',\n output: './data/vector_store',\n};\n\ncreateVectorStore(config).then(() => {\n console.log('Vector store created successfully');\n});\n```", + "questions": "1. **Question:** What is the purpose of the `processFile` function and what does it return?\n **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object.\n\n2. **Question:** How does the `processDirectory` function work and what does it return?\n **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories.\n\n3. **Question:** What is the purpose of the `createVectorStore` function and how does it work?\n **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file.", + "checksum": "a3409c4340753a867c72eebef7626fb9" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/index/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts", - "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository.\n\nThe `index` function performs the following steps:\n\n1. Define the paths for JSON, Markdown, and data output directories within the `output` folder.\n\n2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step.\n\n3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\n4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.", - "questions": "1. **What is the purpose of the `index` function in this code?**\n\n The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options.\n\n2. **What are the different steps involved in processing the repository?**\n\n The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files.\n\n3. **What is the role of the `AutodocRepoConfig` type?**\n\n The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files." + "filePath": "src\\cli\\commands\\index\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\index.ts", + "summary": "The code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository.\n\nThe `index` function performs three main tasks:\n\n1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory.\n\n ```javascript\n updateSpinnerText('Processing repository...');\n await processRepository({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory.\n\n ```javascript\n updateSpinnerText('Creating markdown files...');\n await convertJsonToMarkdown({ /* configuration options */ });\n spinnerSuccess();\n ```\n\n3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory.\n\n ```javascript\n updateSpinnerText('Create vector files...');\n await createVectorStore({ /* configuration options */ });\n spinnerSuccess();\n ```\n\nThroughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks.\n\nIn the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "1. **What does the `index` function do in this code?**\n\n The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files.\n\n2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?**\n\n The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files.\n\n3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?**\n\n These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing.", + "checksum": "4060b1affae5a6c385cda308b3cd1750" }, { "fileName": "processRepository.ts", - "filePath": "src/cli/commands/index/processRepository.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts", - "summary": "The `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings.\n\nThe function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders.\n\nThe `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory.\n\nThe `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory.\n\nThe main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing.\n\nExample usage of the `processRepository` function:\n\n```javascript\nconst autodocConfig = {\n name: 'myProject',\n repositoryUrl: 'https://github.com/user/myProject',\n root: 'src',\n output: 'output',\n llms: [LLMModels.GPT3, LLMModels.GPT4],\n ignore: ['.git', 'node_modules'],\n filePrompt: 'Explain this code file',\n folderPrompt: 'Summarize this folder',\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nprocessRepository(autodocConfig).then((models) => {\n console.log('Processing complete');\n});\n```\n\nThis code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory.", - "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters?\n **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk.\n\n2. **Question:** How does the code determine the best language model to use for generating summaries and questions?\n **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration.\n\n3. **Question:** How does the code handle traversing the file system and processing files and folders?\n **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively." + "filePath": "src\\cli\\commands\\index\\processRepository.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\processRepository.ts", + "summary": "The `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing.\n\nThe function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory.\n\nThe `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder.\n\nThe main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics.\n\nThe `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder.", + "questions": "1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs?\n **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing.\n\n2. **Question:** How does the `calculateChecksum` function work and what is its purpose?\n **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed.\n\n3. **Question:** How does the `reindexCheck` function work and when is it used?\n **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents.", + "checksum": "5b3ae9ffad1d4b4a22c6f7fd66bbde6f" }, { "fileName": "prompts.ts", - "filePath": "src/cli/commands/index/prompts.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts", - "summary": "The code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders.\n\n1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example:\n\n```javascript\ncreateCodeFileSummary('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'Write a detailed technical explanation of what this code does.');\n```\n\n2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example:\n\n```javascript\ncreateCodeQuestions('src/example.js', 'autodoc', 'console.log(\"Hello, World!\");', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example:\n\n```javascript\nfolderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.');\n```\n\nThese functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders.", - "questions": "1. **Question:** What is the purpose of the `createCodeFileSummary` function?\n **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?\n **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take?\n **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt." + "filePath": "src\\cli\\commands\\index\\prompts.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\index\\prompts.ts", + "summary": "This code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert.\n\n1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.');\n```\n\n2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers.\n\nExample usage:\n```javascript\nconst prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner');\n```\n\n3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert.\n\nExample usage:\n```javascript\nconst prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.');\n```\n\nThese functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project.", + "questions": "1. **What is the purpose of the `createCodeFileSummary` function?**\n\n The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt.\n\n2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?**\n\n The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt.\n\n3. **What is the role of the `folderSummaryPrompt` function?**\n\n The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt.", + "checksum": "e44b82bf4912be69149685a997b6bde3" } ], "folders": [], - "summary": "The code in this folder is responsible for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\nFor example, the `processRepository` function processes a code repository and generates summaries and questions for each file and folder within the repository. It uses helper functions like `callLLM` to make API calls to language models and `processFile` and `processFolder` to process individual files and folders. The results are saved as JSON files in the output directory.\n\nThe `convertJsonToMarkdown` function converts JSON files containing documentation information into Markdown files. It counts the number of files in the project and creates Markdown files for each code file in the project using the `traverseFileSystem` utility.\n\nThe `createVectorStore` function processes a directory of text files, splits the text into chunks, and creates a vector store using the HNSWLib library and OpenAIEmbeddings. It processes the files in the directory and calls `processFile` for each file, creating a vector store and saving it to the output file path.\n\nHere's an example of how this code might be used in the larger project:\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n root: './src',\n output: './output',\n llms: 'https://llms.example.com',\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'text',\n targetAudience: 'developers',\n linkHosted: 'https://myproject-docs.example.com',\n};\n\nautodoc.index(config);\n```\n\nThis example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text.\n\nIn summary, the code in this folder plays a crucial role in the Autodoc project by processing code repositories, generating documentation in various formats, and creating vector files for the documentation. This helps developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in this folder is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It consists of several functions and utilities that work together to automate the documentation generation process.\n\nThe main function, `index`, takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. It performs three main tasks:\n\n1. **Process the repository**: It calls the `processRepository` function to traverse the repository, generate summaries and questions for code files and folders using the LLMS (Language Learning Management System), and create JSON files with the results. These JSON files are stored in the `output/docs/json/` directory.\n\n2. **Create Markdown files**: It uses the `convertJsonToMarkdown` function to convert the generated JSON files into Markdown files. These Markdown files are stored in the `output/docs/markdown/` directory.\n\n3. **Create vector files**: It calls the `createVectorStore` function to create vector files from the generated Markdown files. These vector files are stored in the `output/docs/data/` directory.\n\nThroughout the execution of these tasks, the code provides visual feedback on the progress of the tasks using `updateSpinnerText` and `spinnerSuccess` functions.\n\nHere's an example of how this code might be used:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\nThis will process the repository located at `./input`, generate documentation in JSON, Markdown, and vector formats, and save the results in the `./output` directory.\n\nThe `prompts.ts` file contains utility functions that generate prompts for documentation experts. These functions create markdown formatted strings with specific instructions for the documentation expert, ensuring consistent formatting and instructions across different parts of the project.\n\nIn summary, the code in this folder automates the process of generating documentation for a given repository based on the provided configuration options. The generated documentation can be used for various purposes, such as displaying it on a website or analyzing the content for specific insights.", + "questions": "", + "checksum": "376f96417f8cbea6a5ab2463268fe4af" }, { "folderName": "init", - "folderPath": ".autodoc/docs/json/src/cli/commands/init", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/init", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\init", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\init", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/init/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts", - "summary": "This code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available.\n\n2. **Question:** How does the `init` function work and what does it do with the user's input?\n **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`.\n\n3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration?\n **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation." + "filePath": "src\\cli\\commands\\init\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\init\\index.ts", + "summary": "This code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties.\n\n2. **How does the `init` function work and when is it called?**\n\n The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project.\n\n3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?**\n\n The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file.", + "checksum": "b93831ff1f4023ab61c3bea963a8a112" } ], "folders": [], - "summary": "The `index.ts` file in the `init` folder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. This configuration file is crucial for the project to function correctly and adapt to different user requirements.\n\nThe `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation.\n\nThe `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started.\n\nHere's an example of how the `init` function is used:\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThis code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. The `init` function is a crucial part of the project, as it sets up the necessary configuration for the project to work correctly. It interacts with other parts of the project by providing the required settings and values, ensuring that the project can adapt to different user requirements and preferences.", - "questions": "" + "summary": "The `index.ts` file in the `.autodoc\\docs\\json\\src\\cli\\commands\\init` folder is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument.\n\nThe `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided.\n\nThe `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nNext, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access).\n\nAfter the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root.\n\nFinally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project.\n\nExample usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\nThis code is essential for setting up the Autodoc project, as it creates the necessary configuration file and gathers user input to customize the project. It works in conjunction with other parts of the project, such as the CLI and the documentation generation process, which rely on the configuration file to function correctly.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" }, { "folderName": "query", - "folderPath": ".autodoc/docs/json/src/cli/commands/query", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/query", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\query", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\query", "files": [ { "fileName": "createChatChain.ts", - "filePath": "src/cli/commands/query/createChatChain.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts", - "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain.\n\nThe question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model.\n\nThe document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate.\n\nThe `makeChain` function takes the following parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's GitHub repository.\n- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation).\n- `chatPrompt`: Additional instructions for answering questions about the content.\n- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users).\n- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors.\n- `llms`: An array of language models (e.g., GPT-3, GPT-4).\n- `onTokenStream`: An optional callback function to handle streaming tokens.\n\nExample usage:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nThis creates a chatbot that can answer questions about the \"autodoc\" project, using the provided language models and vector store.", - "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code?\n **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` callback function work and when is it used?\n **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time." + "filePath": "src\\cli\\commands\\query\\createChatChain.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\createChatChain.ts", + "summary": "This code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters:\n\n- `projectName`: The name of the software project.\n- `repositoryUrl`: The URL of the project's repository.\n- `contentType`: The type of content the chatbot is trained on.\n- `chatPrompt`: Additional instructions for answering questions about the content type.\n- `targetAudience`: The intended audience for the chatbot's answers.\n- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search.\n- `llms`: An array of LLMModels, which are language models used for generating answers.\n- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model.\n\nThe `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code.\n\nNext, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate.\n\nFinally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model.", + "questions": "1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters?\n\n **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function.\n\n2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code?\n\n **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively.\n\n3. **Question:** How does the `onTokenStream` function work and when is it used?\n\n **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process.", + "checksum": "6869048a06de62499933b14c37cddc1d" }, { "fileName": "index.ts", - "filePath": "src/cli/commands/query/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts", - "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nThe code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot.\n\nThe `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top.\n\nThe main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nIf an error occurs during the process, the chatbot displays an error message and prompts the user for another question.\n\nExample usage:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "1. **What is the purpose of the `query` function and what are its input parameters?**\n\n The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration.\n\n2. **How does the `vectorStore` work and what is its role in the code?**\n\n The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions.\n\n3. **How does the chat history work and what is its purpose?**\n\n The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions." + "filePath": "src\\cli\\commands\\query\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\query\\index.ts", + "summary": "This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks.\n\nThe `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function.\n\nThe chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal.\n\nIf an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the \"MyProject\" codebase.", + "questions": "1. **What is the purpose of the `query` function in this code?**\n\n The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results.\n\n2. **How does the code handle rendering Markdown text in the terminal?**\n\n The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`.\n\n3. **What is the purpose of the `chatHistory` variable and how is it used?**\n\n The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array.", + "checksum": "19807a33957666422f31136970c37245" } ], "folders": [], - "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\nIn `createChatChain.ts`, the `makeChain` function is defined, which creates a chatbot using the `ChatVectorDBQAChain` class. This class combines two separate language models: a question generator and a document chain. The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input.\n\nExample usage of `makeChain`:\n\n```javascript\nconst chatbot = makeChain(\n \"autodoc\",\n \"https://github.com/autodoc/autodoc\",\n \"code\",\n \"\",\n \"developer\",\n vectorstore,\n [gpt3, gpt4],\n (token) => console.log(token)\n);\n```\n\nIn `index.ts`, the main chatbot interface is defined. It starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input.\n\nThe main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library.\n\nExample usage of the chatbot interface:\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThis chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers.", - "questions": "" + "summary": "The `query` folder in the Autodoc project contains code for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot is trained on the content of the project and provides answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate.\n\nThe main entry point for the chatbot is the `query` function in `index.ts`. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store and creates a chat chain using the `makeChain` function from `createChatChain.ts`.\n\nHere's an example of how the `query` function might be used:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\nThis example initializes the chatbot with the specified repository and user configurations and starts the chatbot interface for the user to ask questions about the \"MyProject\" codebase.\n\nThe `createChatChain.ts` file defines the `makeChain` function, which creates a chatbot for answering questions about a software project. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter.\n\nThe `makeChain` function takes several parameters, such as `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and `onTokenStream`. It first creates a question generator using the `LLMChain` class, then creates a `QA_PROMPT` template using the `makeQAPrompt` function, and finally creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project.\n\nIn summary, the code in the `query` folder is responsible for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot uses a combination of natural language processing techniques and efficient nearest neighbor search to generate accurate and relevant answers for the user.", + "questions": "", + "checksum": "9e0d0f111bf588e2df66862dce9db288" }, { "folderName": "user", - "folderPath": ".autodoc/docs/json/src/cli/commands/user", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/user", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\commands\\user", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\commands\\user", "files": [ { "fileName": "index.ts", - "filePath": "src/cli/commands/user/index.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts", - "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.", - "questions": "1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return?\n **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`.\n\n2. **Question:** How does the `user` function handle existing user configuration files?\n **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration.\n\n3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration?\n **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object." + "filePath": "src\\cli\\commands\\user\\index.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\commands\\user\\index.ts", + "summary": "This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```", + "questions": "1. **What is the purpose of the `makeConfigTemplate` function?**\n\n The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models.\n\n2. **How does the `user` function handle existing user configuration files?**\n\n The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0.\n\n3. **What are the available choices for LLM models in the `user` function?**\n\n The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object.", + "checksum": "76bc1e6d5d61e24907832c4cac443225" } ], "folders": [], - "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K.\n\nThe `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user.\n\n```typescript\nfunction makeConfigTemplate(llms: string[]): ConfigTemplate {\n // ...\n}\n```\n\nThe `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIf the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options:\n\n1. GPT-3.5 Turbo\n2. GPT-3.5 Turbo, GPT-4 8K (Early Access)\n3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access)\n\nAfter the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format.\n\n```typescript\nconst configTemplate = makeConfigTemplate(selectedLLMs);\nawait fs.promises.writeFile(configPath, JSON.stringify(configTemplate, null, 2));\n```\n\nFinally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command.\n\nThis code is essential for setting up the user's environment and preferences for the Autodoc project. It ensures that the user has the correct configuration file in place, which is necessary for the proper functioning of the project. The user configuration file is used by other parts of the project to determine which LLMs the user has access to and can query.\n\nFor example, when a user runs the `doc q` command, the project will read the user configuration file to determine which LLMs are available for querying. This ensures that the user only queries the LLMs they have access to, preventing any unauthorized access or usage.\n\nIn summary, the `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project, ensuring that the user has the correct configuration file in place, and allowing the user to select the LLMs they have access to. This is essential for the proper functioning of the project and for maintaining the user's preferences and access to different LLMs.", - "questions": "" + "summary": "The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K.\n\nThe `makeConfigTemplate` function creates a default configuration object with either the provided `config` parameter or GPT-3 as the default LLM. This function is useful for generating a new configuration object when needed.\n\nThe main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits.\n\nIf the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code.\n\nNext, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function.\n\nFinally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command.\n\nThis code is essential for the Autodoc project as it allows users to manage their access to different LLMs and store their preferences in a configuration file. This configuration file can then be used by other parts of the project to determine which LLMs the user has access to and tailor the querying process accordingly.\n\nExample usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the `index.ts` file in the `user` folder is a crucial part of the Autodoc project, allowing users to manage their LLM access and preferences. This configuration is then used by other parts of the project to provide a tailored experience based on the user's access to different LLMs.", + "questions": "", + "checksum": "4b8fd2b2abaec4959873fc3396c414d8" } ], - "summary": "The code in the `src/cli/commands` folder is responsible for handling various command-line tasks in the Autodoc project. It contains several subfolders, each dedicated to a specific command or functionality, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations.\n\nFor instance, the `estimate` subfolder contains a function that allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input and performs a dry run of the `processRepository` function. It then calculates the total estimated cost and displays it to the user. This helps users make informed decisions about whether to proceed with the indexing process or not.\n\n```javascript\nimport { estimate } from './autodoc/estimate';\n\nconst config = {\n // ...configuration options...\n};\n\nestimate(config);\n```\n\nThe `index` subfolder contains code for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown.\n\n```javascript\nimport autodoc from './autodoc';\n\nconst config = {\n // ...configuration options...\n};\n\nautodoc.index(config);\n```\n\nThe `init` subfolder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values.\n\n```javascript\nimport { init } from './autodoc';\n\n(async () => {\n await init();\n})();\n```\n\nThe `query` subfolder contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation.\n\n```javascript\nquery(repoConfig, userConfig);\n```\n\nThe `user` subfolder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs).\n\n```typescript\nasync function user(): Promise {\n // ...\n}\n```\n\nIn summary, the code in the `src/cli/commands` folder plays a crucial role in the Autodoc project by providing various command-line functionalities, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations. These functionalities help developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users.", - "questions": "" + "summary": "The code in the `.autodoc\\docs\\json\\src\\cli\\commands` folder is responsible for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project. The folder contains several subfolders, each with a specific purpose.\n\n### estimate\n\nThe `estimate` function provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input and performs a dry run of the repository processing to calculate the estimated cost. Example usage:\n\n```javascript\nimport { estimate } from './path/to/this/file';\n\nconst config = {\n name: 'my-repo',\n repositoryUrl: 'https://github.com/user/my-repo.git',\n root: './',\n output: './output',\n llms: ['en'],\n ignore: ['.git', 'node_modules'],\n filePrompt: true,\n folderPrompt: true,\n chatPrompt: true,\n contentType: 'code',\n targetAudience: 'developers',\n linkHosted: true,\n};\n\nestimate(config);\n```\n\n### index\n\nThe code in this folder processes a given repository and generates documentation in JSON, Markdown, and vector formats. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository, creating Markdown files, and creating vector files. Example usage:\n\n```javascript\nindex({\n name: \"myProject\",\n root: \"./input\",\n output: \"./output\",\n filePrompt: true,\n folderPrompt: true,\n contentType: \"code\",\n targetAudience: \"developers\",\n linkHosted: \"https://github.com/user/myProject\",\n});\n```\n\n### init\n\nThe `init` function initializes the configuration of the Autodoc project. It prompts the user to input necessary information to set up the project and creates the `autodoc.config.json` file in the project root. Example usage:\n\n```javascript\nimport { init } from './path/to/this/file';\n\n// Initialize the configuration with default values\nawait init();\n\n// Initialize the configuration with custom values\nawait init({\n name: 'My Custom Repository',\n repositoryUrl: 'https://github.com/user/repo',\n});\n```\n\n### query\n\nThe `query` folder contains code for creating a chatbot that can answer questions about a specific software project. The main entry point is the `query` function, which takes an `AutodocRepoConfig` object and an `AutodocUserConfig` object as input. Example usage:\n\n```javascript\nimport { query } from './autodoc';\n\nconst repoConfig = {\n name: 'MyProject',\n repositoryUrl: 'https://github.com/user/myproject',\n output: 'path/to/output',\n contentType: 'code',\n chatPrompt: 'Ask me anything about MyProject',\n targetAudience: 'developers',\n};\n\nconst userConfig = {\n llms: 'path/to/llms',\n};\n\nquery(repoConfig, userConfig);\n```\n\n### user\n\nThe `user` folder manages the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs). Example usage:\n\n```javascript\nimport { user } from './path/to/this/file';\n\n// Create a new user configuration with default settings\nawait user();\n\n// Update the user configuration with a custom config object\nawait user({ llms: [LLMModels.GPT3, LLMModels.GPT4] });\n```\n\nIn summary, the code in this folder is essential for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project.", + "questions": "", + "checksum": "d11f941351fb51140313ada9b52bbf1a" }, { "folderName": "utils", - "folderPath": ".autodoc/docs/json/src/cli/utils", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/utils", + "folderPath": ".autodoc\\docs\\json\\src\\cli\\utils", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\cli\\utils", "files": [ { "fileName": "APIRateLimit.ts", - "filePath": "src/cli/utils/APIRateLimit.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/APIRateLimit.ts", - "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server.\n\nThe class has a constructor that takes an optional `maxConcurrentCalls` parameter, which defaults to 50. This parameter determines the maximum number of API calls that can be made concurrently.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. This method takes a function `apiFunction` that returns a promise and wraps it in a rate-limited execution. The method returns a promise that resolves with the result of the API call or rejects with an error if the call fails.\n\nWhen `callApi` is called, it adds the `executeCall` function to the `queue`. The `executeCall` function is responsible for executing the API call, resolving or rejecting the promise, and managing the `inProgress` counter. After adding the `executeCall` function to the queue, the code checks if there are available slots for concurrent calls by comparing `inProgress` with `maxConcurrentCalls`. If there are available slots, it calls the `dequeueAndExecute` method.\n\nThe `dequeueAndExecute` method is responsible for executing the queued API calls while ensuring that the number of concurrent calls does not exceed the `maxConcurrentCalls` limit. It dequeues the next API call from the queue and executes it if there are available slots for concurrent calls.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchData(id) {\n // Simulate an API call\n return new Promise((resolve) => setTimeout(() => resolve(`Data for ${id}`), 1000));\n}\n\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\n\n// Usage\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetchData` function, which simulates an API call.", - "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and manages the execution of queued calls based on the available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How does the `dequeueAndExecute` method work?**\n\n The `dequeueAndExecute` method is responsible for executing the queued API calls. It checks if there are any calls in the queue and if there are available slots for concurrent calls. If both conditions are met, it dequeues the next call from the queue and executes it. This method is called whenever a new API call is added to the queue or when an in-progress call is completed." + "filePath": "src\\cli\\utils\\APIRateLimit.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\APIRateLimit.ts", + "summary": "The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to prevent overwhelming the server with too many requests at once.\n\nThe class constructor takes an optional parameter `maxConcurrentCalls`, which defaults to 50, to set the maximum number of concurrent API calls allowed. It maintains a queue of API calls and keeps track of the number of calls in progress.\n\nThe main method of this class is `callApi(apiFunction: () => Promise): Promise`. It takes a function `apiFunction` that returns a promise and wraps it in a new promise. The purpose of this wrapping is to control the execution of the API calls and ensure that they do not exceed the specified rate limit.\n\nWhen `callApi` is called, the provided `apiFunction` is added to the queue and the `dequeueAndExecute` method is triggered if there are available slots for concurrent calls. The `dequeueAndExecute` method checks if there are any API calls in the queue and if the number of in-progress calls is below the maximum limit. If both conditions are met, it dequeues the next API call and executes it.\n\nThe `executeCall` function inside `callApi` is responsible for actually calling the API function, resolving or rejecting the promise based on the result, and updating the number of in-progress calls. Once an API call is completed, the `dequeueAndExecute` method is called again to process any remaining calls in the queue.\n\nHere's an example of how this class can be used in the larger project:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\n\nasync function fetchSomeData(id) {\n // Call the API using the rate limiter\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\nIn this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetch` function, ensuring that no more than 10 calls are made at once.", + "questions": "1. **What is the purpose of the `APIRateLimit` class?**\n\n The `APIRateLimit` class is designed to manage and limit the number of concurrent API calls to a specified maximum, preventing the application from overwhelming the API with too many requests at once.\n\n2. **How does the `callApi` method work and what is its return type?**\n\n The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and executes it when there are available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`.\n\n3. **How can the maximum number of concurrent calls be configured?**\n\n The maximum number of concurrent calls can be configured by passing a value to the `maxConcurrentCalls` parameter in the constructor of the `APIRateLimit` class. If no value is provided, the default value is set to 50.", + "checksum": "8862552c9cfd8b6db454d45e565081ef" }, { "fileName": "FileUtil.ts", - "filePath": "src/cli/utils/FileUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/FileUtil.ts", - "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for files and folders.\n\n1. `getFileName(input: string, delimiter = '.', extension = '.md'): string`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new file name with the given extension. If the delimiter is not found in the input string, the function appends the extension to the input string. If the delimiter is found, the function replaces the part after the last delimiter with the extension. For example:\n\n ```javascript\n getFileName(\"example.txt\"); // returns \"example.md\"\n getFileName(\"example\"); // returns \"example.md\"\n ```\n\n2. `githubFileUrl(githubRoot: string, inputRoot: string, filePath: string, linkHosted: boolean): string`: This function generates a GitHub URL for a file. It takes the GitHub root URL, the input root path, the file path, and a boolean flag `linkHosted`. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the file. If `linkHosted` is false, the function returns a URL pointing to the file in the GitHub repository. For example:\n\n ```javascript\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n githubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", false); // returns \"https://github.com/user/repo/blob/master/example.md\"\n ```\n\n3. `githubFolderUrl(githubRoot: string, inputRoot: string, folderPath: string, linkHosted: boolean): string`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the folder. If `linkHosted` is false, the function returns a URL pointing to the folder in the GitHub repository. For example:\n\n ```javascript\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", true); // returns \"https://github.com/user/repo/folder\"\n githubFolderUrl(\"https://github.com/user/repo\", \"/input\", \"/input/folder\", false); // returns \"https://github.com/user/repo/tree/master/folder\"\n ```\n\nThese utility functions can be used in the autodoc project to generate file names and URLs for documentation files and folders, making it easier to manage and navigate the documentation structure.", - "questions": "1. **What does the `getFileName` function do?**\n\n The `getFileName` function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns the input string with the specified extension, replacing the part after the last occurrence of the delimiter if it exists.\n\n2. **What is the purpose of the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both `githubFileUrl` and `githubFolderUrl` functions are used to generate URLs for files and folders, respectively, in a GitHub repository. They take a `githubRoot`, `inputRoot`, a `filePath` or `folderPath`, and a `linkHosted` boolean flag. If `linkHosted` is true, the generated URL will point to the hosted version of the file or folder; otherwise, it will point to the file or folder in the GitHub repository.\n\n3. **Why is the `inputRoot.length - 1` used in the `substring` method for both `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `inputRoot.length - 1` is used to remove the `inputRoot` part from the `filePath` or `folderPath` when generating the final URL. This ensures that the generated URL only contains the relevant path relative to the GitHub repository root." + "filePath": "src\\cli\\utils\\FileUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\FileUtil.ts", + "summary": "This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for documentation files.\n\n1. `getFileName(input, delimiter, extension)`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new string with the given extension. If the delimiter is found in the input string, the function removes the part of the string after the last occurrence of the delimiter and appends the extension. If the delimiter is not found, the function simply appends the extension to the input string. This function can be used to generate file names for documentation files with the desired extension.\n\n Example usage:\n\n ```\n getFileName('example.txt'); // returns 'example.md'\n getFileName('example', '_', '.html'); // returns 'example.html'\n ```\n\n2. `githubFileUrl(githubRoot, inputRoot, filePath, linkHosted)`: This function generates a GitHub URL for a file. It takes the GitHub repository root URL, the input root folder path, the file path, and a boolean flag indicating whether the URL should be for the hosted version of the file or the source code. It returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true);\n // returns 'https://github.com/user/repo/example.md'\n ```\n\n3. `githubFolderUrl(githubRoot, inputRoot, folderPath, linkHosted)`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. It takes the same arguments as `githubFileUrl` and returns a string with the generated URL.\n\n Example usage:\n\n ```\n githubFolderUrl('https://github.com/user/repo', '/input', '/input/folder', true);\n // returns 'https://github.com/user/repo/folder'\n ```\n\nThese utility functions can be used throughout the autodoc project to generate file names and GitHub URLs for documentation files and folders, ensuring consistent naming and URL generation across the project.", + "questions": "1. **What is the purpose of the `getFileName` function?**\n\n The `getFileName` function takes an input string, an optional delimiter, and an optional extension, and returns a new string with the given extension. If the delimiter is not found in the input string, the extension is simply appended to the input string. If the delimiter is found, the input string is sliced up to the last delimiter index and the extension is appended.\n\n2. **What are the differences between the `githubFileUrl` and `githubFolderUrl` functions?**\n\n Both functions take the same parameters: `githubRoot`, `inputRoot`, a path (either `filePath` or `folderPath`), and a `linkHosted` boolean. The main difference is in the returned URL: `githubFileUrl` returns a URL pointing to a file in the GitHub repository, while `githubFolderUrl` returns a URL pointing to a folder in the GitHub repository. The URL structure differs slightly, with `/blob/master/` for files and `/tree/master/` for folders.\n\n3. **What is the purpose of the `linkHosted` parameter in the `githubFileUrl` and `githubFolderUrl` functions?**\n\n The `linkHosted` parameter is a boolean that determines whether the returned URL should point to the hosted version of the file or folder on GitHub Pages (if `true`) or to the file or folder within the GitHub repository itself (if `false`). Depending on the value of `linkHosted`, the functions will return different URL structures.", + "checksum": "d1f26fc674b4a9b4a2053642771871c8" }, { "fileName": "LLMUtil.ts", - "filePath": "src/cli/utils/LLMUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/LLMUtil.ts", - "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has a set of properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of `OpenAIChat` with specific configurations. The `inputTokens`, `outputTokens`, `succeeded`, `failed`, and `total` properties are initialized to 0.\n\n```javascript\n{\n name: LLMModels.GPT3,\n inputCostPer1KTokens: 0.002,\n outputCostPer1KTokens: 0.002,\n maxLength: 3050,\n llm: new OpenAIChat({ ... }),\n inputTokens: 0,\n outputTokens: 0,\n succeeded: 0,\n failed: 0,\n total: 0,\n}\n```\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the number of input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost for all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.", - "questions": "1. **Question**: What is the purpose of the `models` object and what are the different models available?\n **Answer**: The `models` object is a record that maps the available LLMModels (GPT3, GPT4, and GPT432k) to their respective details, such as name, input and output costs, maxLength, and an instance of OpenAIChat with the corresponding model.\n\n2. **Question**: How does the `printModelDetails` function work and what information does it display?\n **Answer**: The `printModelDetails` function takes an array of LLMModelDetails and generates an output object containing the model name, file count, succeeded, failed, tokens, and cost. It then calculates the totals for each property and displays the information in a console table.\n\n3. **Question**: What is the purpose of the `totalIndexCostEstimate` function and how does it calculate the total cost?\n **Answer**: The `totalIndexCostEstimate` function calculates the total cost of indexing the given models by iterating through the models array and summing up the input and output costs per 1K tokens for each model." + "filePath": "src\\cli\\utils\\LLMUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\LLMUtil.ts", + "summary": "This code defines and manages different language models (LLMs) and their associated costs for a project that utilizes OpenAI's GPT models. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file.\n\nThe `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has its own properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of the `OpenAIChat` class with the respective model name and API key. Additionally, each model has counters for input tokens, output tokens, succeeded, failed, and total files processed.\n\nThe `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models.\n\nThe `totalIndexCostEstimate` function calculates the total cost of indexing all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number.\n\nThese functions can be used in the larger project to manage and analyze the usage and costs of different LLMs. For example, the `printModelDetails` function can be called to display a summary of the models' usage and costs:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nprintModelDetails(Object.values(models));\n```\n\nAnd the `totalIndexCostEstimate` function can be used to estimate the total cost of indexing all models:\n\n```javascript\nimport { models, totalIndexCostEstimate } from './path/to/this/file';\n\n// Process files with models...\n// Update models' properties...\n\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```", + "questions": "1. **Question:** What is the purpose of the `models` object and how are the different GPT models being used?\n **Answer:** The `models` object is a record that maps different GPT models (GPT3, GPT4, and GPT432k) to their respective details, such as cost per tokens, maximum length, and an instance of `OpenAIChat` with the corresponding model configuration.\n\n2. **Question:** How does the `printModelDetails` function work and what information does it display?\n **Answer:** The `printModelDetails` function takes an array of `LLMModelDetails` as input, processes the information for each model, and then prints a summary table to the console. The table includes the model name, file count, succeeded and failed counts, total tokens, and cost.\n\n3. **Question:** What is the purpose of the `totalIndexCostEstimate` function and how is it calculating the total cost?\n **Answer:** The `totalIndexCostEstimate` function calculates the total cost of processing the given models by iterating through the input `models` array and summing up the costs based on the input and output tokens and their respective costs per 1K tokens.", + "checksum": "f4464cf197f4af827ac0eac950d568fc" }, { - "fileName": "WaitUtil.ts", - "filePath": "src/cli/utils/WaitUtil.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/WaitUtil.ts", - "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, which is a JavaScript object that represents the eventual completion (or failure) of an asynchronous operation and its resulting value.\n\n### wait function\n\nThe `wait` function takes two arguments: `timeoutMs`, which is the number of milliseconds to wait before resolving the promise, and `value`, which is an optional value to be returned when the promise resolves. The function creates a new `Promise` and uses `setTimeout` to resolve it with the given `value` after the specified `timeoutMs` has passed.\n\nExample usage:\n\n```javascript\n// Wait for 2 seconds and then log \"Hello, world!\"\nwait(2000, \"Hello, world!\").then(console.log);\n```\n\n### forTrue function\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. The purpose of this function is to repeatedly check if the given function `fn` returns `true`. If it does, the promise resolves with `true`. If the function does not return `true` after 200 checks, the promise is rejected.\n\nThe function uses `setInterval` to repeatedly call the given function `fn` every 50 milliseconds. If `fn` returns `true`, the interval is cleared, and the promise is resolved. If the function has been called 200 times without returning `true`, the promise is rejected.\n\nExample usage:\n\n```javascript\n// Check if a certain element is visible on the page\nconst isElementVisible = () => document.querySelector(\"#my-element\").offsetParent !== null;\n\n// Wait for the element to become visible, then log \"Element is visible!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\"));\n```\n\nIn summary, these utility functions help manage asynchronous operations by providing a way to wait for a certain amount of time or for a specific condition to be met. They can be used in various parts of the larger project to handle timing and conditional logic in an asynchronous manner.", - "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds. It can be used to introduce a delay in the execution of asynchronous code.\n\n2. **How does the `forTrue` function work and what is its use case?**\n\n The `forTrue` function takes a function `fn` as an argument, which returns a boolean value. It repeatedly checks the result of `fn` every 50 milliseconds until it returns `true` or the maximum number of checks (200) is reached. This function can be used to wait for a specific condition to be met before proceeding with the execution of asynchronous code.\n\n3. **Is there any error handling or customization for the `forTrue` function, such as customizing the interval or maximum number of checks?**\n\n Currently, there is no error handling or customization options for the `forTrue` function. The interval is hardcoded to 50 milliseconds, and the maximum number of checks is hardcoded to 200. To add customization, additional parameters could be added to the function signature and used in the implementation." + "fileName": "traverseFileSystem.ts", + "filePath": "src\\cli\\utils\\traverseFileSystem.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\traverseFileSystem.ts", + "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processing files and folders based on the provided parameters. It is designed to be used in the larger project for generating documentation or performing other tasks that require processing files and folders in a directory structure.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains various properties to control the traversal and processing behavior. These properties include:\n\n- `inputPath`: The root path to start the traversal from.\n- `projectName`: The name of the project being processed.\n- `processFile`: An optional callback function to process a file.\n- `processFolder`: An optional callback function to process a folder.\n- `ignore`: An array of patterns to ignore during traversal.\n- `filePrompt`, `folderPrompt`: Optional prompts for user interaction.\n- `contentType`, `targetAudience`, `linkHosted`: Additional metadata for processing.\n\nThe function first checks if the provided `inputPath` exists using `fs.access`. If the path does not exist, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which is called recursively to traverse the file system. It reads the contents of the current directory using `fs.readdir`, filters out ignored items, and processes the remaining items.\n\nFor each item, if it is a directory, the `dfs` function is called recursively, and the `processFolder` callback is invoked if provided. If it is a file and its content is text (checked using `isText`), the `processFile` callback is invoked if provided.\n\nThe traversal is performed using `Promise.all` to process items concurrently, improving performance. If an error occurs during traversal, it is logged and rethrown.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => {\n // Process file logic here\n },\n processFolder: (params) => {\n // Process folder logic here\n },\n ignore: ['node_modules/**', '.git/**'],\n});\n```", + "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes folders and files based on the provided parameters, and ignores files and folders based on the given ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file name as input and returns a boolean value indicating whether the file should be ignored or not. It checks if the file name matches any of the ignore patterns provided in the `ignore` parameter using the `minimatch` library.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory found.", + "checksum": "b9e957c10ee6c009864c90aa2fa93763" }, { - "fileName": "traverseFileSystem.ts", - "filePath": "src/cli/utils/traverseFileSystem.ts", - "url": "https://github.com/context-labs/autodoc/src/cli/utils/traverseFileSystem.ts", - "summary": "The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used in the larger project for processing and generating documentation for a given project.\n\nThe function takes an object of type `TraverseFileSystemParams` as its input, which contains the following properties:\n\n- `inputPath`: The root folder path to start traversing.\n- `projectName`: The name of the project being documented.\n- `processFile`: An optional callback function to process files.\n- `processFolder`: An optional callback function to process folders.\n- `ignore`: An array of patterns to ignore files and folders.\n- `filePrompt`: An optional prompt for processing files.\n- `folderPrompt`: An optional prompt for processing folders.\n- `contentType`: The type of content being processed.\n- `targetAudience`: The target audience for the documentation.\n- `linkHosted`: A flag indicating if the documentation should be linked to a hosted version.\n\nThe function first checks if the provided `inputPath` exists. If not, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns.\n\nThe main logic of the function is implemented in the `dfs` (depth-first search) function, which recursively traverses the file system. It reads the contents of the current folder, filters out ignored files and folders, and processes them accordingly. If an entry is a directory, it calls `dfs` recursively and then calls the `processFolder` callback if provided. If an entry is a file and is a text file, it calls the `processFile` callback if provided.\n\nHere's an example of how this function might be used in the larger project:\n\n```javascript\nimport { traverseFileSystem } from './autodoc';\n\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\n\ntraverseFileSystem(params);\n```\n\nThis example would traverse the `myProject` folder, ignoring any files and folders within `node_modules` and `.git`, and process the remaining files and folders using the provided callback functions.", - "questions": "1. **What is the purpose of the `traverseFileSystem` function?**\n\n The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes files and folders based on the provided parameters, and ignores files and folders that match the specified ignore patterns.\n\n2. **How does the `shouldIgnore` function work?**\n\n The `shouldIgnore` function takes a file or folder name as input and returns a boolean value indicating whether the file or folder should be ignored based on the provided ignore patterns. It uses the `minimatch` library to check if the file or folder name matches any of the ignore patterns.\n\n3. **What is the role of the `dfs` function inside `traverseFileSystem`?**\n\n The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory." + "fileName": "WaitUtil.ts", + "filePath": "src\\cli\\utils\\WaitUtil.ts", + "url": "https://github.com/context-labs/autodoc/src\\cli\\utils\\WaitUtil.ts", + "summary": "The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, making them suitable for use with `async/await` syntax.\n\n### wait\n\nThe `wait` function takes two arguments: `timeoutMs`, a number representing the desired waiting time in milliseconds, and an optional `value` that defaults to `null`. It returns a `Promise` that resolves with the provided `value` after the specified `timeoutMs` has elapsed. This function can be used to introduce a delay in the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\ndelayedEcho(); // Output: Start -> (1 second delay) -> End\n```\n\n### forTrue\n\nThe `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. It returns a `Promise` that resolves with `true` when the provided function `fn` returns `true`. The function `fn` is checked every 50 milliseconds, up to a maximum of 200 times (i.e., 10 seconds). If `fn` does not return `true` within this time, the `Promise` is rejected.\n\nThis function can be used to wait for a specific condition to be met before continuing the execution of asynchronous code.\n\nExample usage:\n\n```javascript\nlet condition = false;\n\nsetTimeout(() => {\n condition = true;\n}, 3000);\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n\nwaitForCondition(); // Output: Waiting for condition... -> (3 second delay) -> Condition met!\n```\n\nIn summary, this file provides two utility functions that help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used in the larger project to control the flow of asynchronous code execution.", + "questions": "1. **What is the purpose of the `wait` function?**\n\n The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds, optionally returning a value when the promise is resolved.\n\n2. **How does the `forTrue` function work?**\n\n The `forTrue` function takes a function `fn` as an argument, which should return a boolean value. It checks the result of `fn` every 50 milliseconds and resolves the promise when `fn` returns `true`. If `fn` does not return `true` after 200 attempts, the promise is rejected.\n\n3. **What is the use case for the `forTrue` function?**\n\n The `forTrue` function can be used to wait for a certain condition to be met before proceeding with the execution of the code. This can be useful in situations where you need to wait for an asynchronous operation to complete or a specific state to be reached before continuing.", + "checksum": "bf4acebb6c2736274af75a8c8441c9d2" } ], "folders": [], - "summary": "The code in the `.autodoc/docs/json/src/cli/utils` folder provides utility functions and classes that help manage various aspects of the autodoc project, such as rate-limiting API calls, handling file and folder paths, managing language models, and traversing file systems.\n\n`APIRateLimit.ts` contains the `APIRateLimit` class, which is designed to manage and limit the number of concurrent API calls made by the application. This is useful when the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server. For example:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function getData(id) {\n return apiRateLimiter.callApi(() => fetchData(id));\n}\ngetData(1).then(console.log); // Fetches data for ID 1, rate-limited\n```\n\n`FileUtil.ts` provides utility functions for handling file and folder paths, such as generating file names and GitHub URLs for files and folders. These functions can be used to manage and navigate the documentation structure. For example:\n\n```javascript\ngetFileName(\"example.txt\"); // returns \"example.md\"\ngithubFileUrl(\"https://github.com/user/repo\", \"/input\", \"/input/example.md\", true); // returns \"https://github.com/user/repo/example.md\"\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project. It provides functions like `printModelDetails` and `totalIndexCostEstimate` to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models.\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations in the larger project. They can be used in various parts of the project to handle timing and conditional logic in an asynchronous manner. For example:\n\n```javascript\nwait(2000, \"Hello, world!\").then(console.log); // Waits for 2 seconds and then logs \"Hello, world!\"\nforTrue(isElementVisible).then(() => console.log(\"Element is visible!\")); // Waits for an element to become visible, then logs \"Element is visible!\"\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used for processing and generating documentation for a given project. For example:\n\n```javascript\nconst params = {\n inputPath: './myProject',\n projectName: 'My Project',\n ignore: ['node_modules/**', '.git/**'],\n processFile: async (fileInfo) => {\n // Process the file, e.g., generate documentation\n },\n processFolder: async (folderInfo) => {\n // Process the folder, e.g., create a folder in the output directory\n },\n};\ntraverseFileSystem(params);\n```\n\nIn summary, the code in this folder provides various utility functions and classes that help manage different aspects of the autodoc project, making it easier to handle tasks such as rate-limiting, file and folder management, language model management, asynchronous operations, and file system traversal.", - "questions": "" + "summary": "The `.autodoc\\docs\\json\\src\\cli\\utils` folder contains utility functions and classes that assist in managing API rate limits, handling file and folder paths, managing language models, traversing file systems, and controlling asynchronous operations. These utilities can be used throughout the autodoc project to ensure consistent behavior and improve code organization.\n\n`APIRateLimit.ts` provides the `APIRateLimit` class, which manages and limits the number of concurrent API calls made by the application. This is useful when working with rate-limited APIs or preventing server overload. Example usage:\n\n```javascript\nconst apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls\nasync function fetchSomeData(id) {\n const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`));\n return result;\n}\n```\n\n`FileUtil.ts` offers utility functions for generating file names and GitHub URLs for documentation files. These functions ensure consistent naming and URL generation across the project. Example usage:\n\n```javascript\ngetFileName('example.txt'); // returns 'example.md'\ngithubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true); // returns 'https://github.com/user/repo/example.md'\n```\n\n`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project utilizing OpenAI's GPT models. Functions like `printModelDetails` and `totalIndexCostEstimate` can be used to manage and analyze the usage and costs of different LLMs. Example usage:\n\n```javascript\nimport { models, printModelDetails } from './path/to/this/file';\nprintModelDetails(Object.values(models));\nconst totalCost = totalIndexCostEstimate(Object.values(models));\nconsole.log(`Total cost: ${totalCost}`);\n```\n\n`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processing files and folders based on provided parameters. This is useful for generating documentation or performing tasks that require processing files and folders in a directory structure. Example usage:\n\n```javascript\nawait traverseFileSystem({\n inputPath: './src',\n projectName: 'myProject',\n processFile: (params) => { /* Process file logic */ },\n processFolder: (params) => { /* Process folder logic */ },\n ignore: ['node_modules/**', '.git/**'],\n});\n```\n\n`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used to control the flow of asynchronous code execution. Example usage:\n\n```javascript\nasync function delayedEcho() {\n console.log(\"Start\");\n await wait(1000, \"Hello\");\n console.log(\"End\");\n}\n\nasync function waitForCondition() {\n console.log(\"Waiting for condition...\");\n await forTrue(() => condition);\n console.log(\"Condition met!\");\n}\n```\n\nIn summary, the utilities in this folder enhance the autodoc project by providing consistent behavior, improving code organization, and managing various aspects of the project, such as API rate limits, file and folder paths, language models, file system traversal, and asynchronous operations.", + "questions": "", + "checksum": "a4b7088863601cd326edbec7726eefe7" } ], - "summary": "The `spinner.ts` file in the `.autodoc/docs/json/src/cli` folder provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages.\n\nThe `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style.\n\nThe `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\n```\n\nThe `stopSpinner` function stops the spinner if it is currently spinning:\n\n```javascript\nstopSpinner();\n```\n\nThe `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.):\n\n```javascript\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nIn the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes.", - "questions": "" + "summary": "The code in the `spinner.ts` file, located in the `.autodoc\\docs\\json\\src\\cli` folder, is responsible for managing a spinner, a visual element that indicates a background process is running. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces.\n\nThe module exports several functions to interact with the spinner:\n\n1. `updateSpinnerText(message: string)`: Updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message.\n\n Example usage:\n ```javascript\n updateSpinnerText('Loading data...');\n ```\n\n2. `stopSpinner()`: Stops the spinner if it is currently spinning.\n\n Example usage:\n ```javascript\n stopSpinner();\n ```\n\n3. `spinnerError(message?: string)`: Stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerError('Failed to load data');\n ```\n\n4. `spinnerSuccess(message?: string)`: Stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning.\n\n Example usage:\n ```javascript\n spinnerSuccess('Data loaded successfully');\n ```\n\n5. `spinnerInfo(message: string)`: Displays an informational message without affecting the spinner's state.\n\n Example usage:\n ```javascript\n spinnerInfo('Connecting to server...');\n ```\n\nIn the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages.", + "questions": "", + "checksum": "e9d728bc3244f1081af08994f5fb1cd0" }, { "folderName": "langchain", - "folderPath": ".autodoc/docs/json/src/langchain", - "url": "https://github.com/context-labs/autodoc/.autodoc/docs/json/src/langchain", + "folderPath": ".autodoc\\docs\\json\\src\\langchain", + "url": "https://github.com/context-labs/autodoc/.autodoc\\docs\\json\\src\\langchain", "files": [ { "fileName": "hnswlib.ts", - "filePath": "src/langchain/hnswlib.ts", - "url": "https://github.com/context-labs/autodoc/src/langchain/hnswlib.ts", - "summary": "The `HNSWLib` class in this code is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index.\n\nThe constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method takes an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method is responsible for initializing the index, resizing it if necessary, and adding the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nExample usage:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, this class can be used to efficiently store and search for similar documents based on their embeddings, which can be useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.", - "questions": "1. **Question:** What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer:** The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question:** How does the `addDocuments` method work and what is its purpose?\n **Answer:** The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them into vectors using the `embedDocuments` method from the `embeddings` object. It then adds these vectors and the corresponding documents to the HNSW index and the `docstore` respectively.\n\n3. **Question:** How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer:** The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input. It checks if the query vector has the same length as the number of dimensions and if `k` is not greater than the number of elements in the index. It then performs a k-nearest neighbors search on the HNSW index using the query vector and returns an array of `[Document, number]` tuples, where each tuple contains a document from the `docstore` and its corresponding distance score to the query vector." + "filePath": "src\\langchain\\hnswlib.ts", + "url": "https://github.com/context-labs/autodoc/src\\langchain\\hnswlib.ts", + "summary": "The `HNSWLib` class in this code is a specialized vector store that uses the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. It is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. The main purpose of this class is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe constructor of the `HNSWLib` class takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method takes an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods allow for persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nIn the larger project, the `HNSWLib` class can be used to efficiently store and search for documents based on their content similarity, which can be useful for tasks such as document clustering, recommendation systems, or information retrieval.", + "questions": "1. **Question**: What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class?\n **Answer**: The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk.\n\n2. **Question**: How does the `addDocuments` method work and what is its purpose?\n **Answer**: The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them using the provided `Embeddings` instance. It then adds the resulting vectors and documents to the HNSW index and the `InMemoryDocstore`, respectively.\n\n3. **Question**: How does the `similaritySearchVectorWithScore` method work and what does it return?\n **Answer**: The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input, and searches for the `k` most similar vectors in the HNSW index. It returns an array of tuples, where each tuple contains a `Document` object and its corresponding similarity score to the query vector.", + "checksum": "4725f6bfddda88355b55a980a1eae582" } ], "folders": [], - "summary": "The `hnswlib.ts` file in the `.autodoc/docs/json/src/langchain` folder contains the `HNSWLib` class, which is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems.\n\nThe `HNSWLib` class extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index. It takes an `Embeddings` object and an `HNSWLibArgs` object as arguments in its constructor. The `Embeddings` object is responsible for converting text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata.\n\nThe `addDocuments` method accepts an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method initializes the index, resizes it if necessary, and adds the vectors and their corresponding metadata to the `InMemoryDocstore`.\n\nThe `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search.\n\nThe `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively.\n\nHere's an example of how this code might be used:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn the larger project, the `HNSWLib` class can be integrated with other components to build efficient and scalable systems for document similarity search, clustering, and recommendations based on text embeddings.", - "questions": "" + "summary": "The `hnswlib.ts` file in the `.autodoc\\docs\\json\\src\\langchain` folder contains the `HNSWLib` class, which is a specialized vector store utilizing the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. This class is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. Its primary purpose is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content.\n\nThe `HNSWLib` class constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is responsible for converting documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents.\n\nThe `addDocuments` method accepts an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores.\n\nThe `save` and `load` methods enable persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively.\n\nIn the larger project, the `HNSWLib` class can be employed to efficiently store and search for documents based on their content similarity, which can be beneficial for tasks such as document clustering, recommendation systems, or information retrieval.\n\nHere's an example of how to use the `HNSWLib` class:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nThis code snippet demonstrates how to create an `HNSWLib` instance, add documents to the index, and perform a similarity search. The results can then be used for various purposes, such as finding related documents or generating recommendations based on content similarity.", + "questions": "", + "checksum": "ccbe47bddb9d048f35d29fb2d8c04d7f" } ], - "summary": "The `.autodoc/docs/json/src` folder contains the core components of the Autodoc project, which aims to automatically generate documentation for a given code repository using OpenAI's language models (LLMs). The main files in this folder are `const.ts`, `index.ts`, and `types.ts`.\n\n`const.ts` manages the user configuration file for the Autodoc project. It defines the location and name of the user configuration file, ensuring that it is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences.\n\n`index.ts` is a CLI (Command Line Interface) tool for the Autodoc project, which simplifies the process of generating documentation for a codebase. It provides an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`. For example:\n\n```bash\nautodoc init\nautodoc estimate\nautodoc index\nautodoc user\nautodoc q\n```\n\n`types.ts` defines the types and interfaces for the Autodoc project, providing the foundation for processing code repositories and generating documentation using OpenAI's language models. It includes types such as `AutodocUserConfig`, `AutodocRepoConfig`, `FileSummary`, `FolderSummary`, and more.\n\nThe `cli` subfolder contains the `spinner.ts` file, which provides a utility for managing a command-line spinner using the `ora` library. This utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes. For example:\n\n```javascript\nupdateSpinnerText('Loading data...');\nstopSpinner();\nspinnerError('An error occurred');\nspinnerSuccess('Operation completed successfully');\nspinnerInfo('Please wait...');\n```\n\nThe `langchain` subfolder contains the `hnswlib.ts` file, which implements a vector store using the Hierarchical Navigable Small World (HNSW) algorithm. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems. For example:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings);\n\nconst queryVector = await embeddings.embedText(\"example query\");\nconst similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5);\n```\n\nIn summary, the code in this folder provides the core components and utilities for the Autodoc project, enabling the automatic generation of documentation for code repositories using OpenAI's language models. The CLI tool simplifies the process, while the types and interfaces lay the foundation for processing and generating documentation. The additional utilities, such as the spinner and HNSWLib, enhance the user experience and provide efficient search capabilities.", - "questions": "" + "summary": "The `.autodoc\\docs\\json\\src` folder contains the core components of the autodoc project, which is designed to automatically generate documentation for a given code repository using OpenAI's language models (LLMs). The folder consists of three main files: `const.ts`, `index.ts`, and `types.ts`, as well as two subfolders: `cli` and `langchain`.\n\n`const.ts` defines the name and file path of the user configuration file for the autodoc project. This file stores user-specific settings in JSON format. Other parts of the project can easily access and use these constants to read or write user-specific settings. For example:\n\n```javascript\nimport { userConfigFilePath } from './path/to/this/file';\n\n// Read user configuration from the file\nconst userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8'));\n\n// Apply user settings\napplyUserSettings(userConfig);\n```\n\n`index.ts` serves as the main entry point for the Autodoc CLI tool, providing a set of commands for developers to generate and manage documentation for their codebase. The available commands include `init`, `estimate`, `index`, `user`, and `q`. The CLI tool uses the `commander` library for command handling and `inquirer` for interactive prompts.\n\n`types.ts` defines the types and interfaces for the autodoc project, such as `AutodocUserConfig`, `AutodocRepoConfig`, `FileSummary`, `FolderSummary`, and more. These types are used to configure and run the autodoc tool, allowing users to generate documentation for their code repositories using OpenAI's LLMs.\n\nThe `cli` subfolder contains the `spinner.ts` file, which manages a spinner for visual feedback during background processes. It exports functions like `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` for easy interaction with the spinner.\n\nThe `langchain` subfolder contains the `hnswlib.ts` file, which provides the `HNSWLib` class for efficient similarity search using the Hierarchical Navigable Small World (HNSW) algorithm. This class is used to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content. Example usage:\n\n```javascript\nconst embeddings = new Embeddings(/* ... */);\nconst args = { space: 'cosine' };\nconst hnswLib = new HNSWLib(embeddings, args);\n\n// Add documents to the index\nawait hnswLib.addDocuments(documents);\n\n// Perform a similarity search\nconst queryVector = /* ... */;\nconst k = 10;\nconst results = await hnswLib.similaritySearchVectorWithScore(queryVector, k);\n```\n\nIn summary, the code in this folder is responsible for the core functionality of the autodoc project, including user configuration management, CLI tool commands, type definitions, spinner management, and efficient similarity search using the HNSW algorithm.", + "questions": "", + "checksum": "de4c7ea3f98620e42875dbf7fb0df9a9" } \ No newline at end of file diff --git a/.autodoc/docs/json/src/types.json b/.autodoc/docs/json/src/types.json index cadfbb6..73151b8 100644 --- a/.autodoc/docs/json/src/types.json +++ b/.autodoc/docs/json/src/types.json @@ -1,7 +1,8 @@ { "fileName": "types.ts", - "filePath": "src/types.ts", - "url": "https://github.com/context-labs/autodoc/src/types.ts", - "summary": "This code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders within the repository.\n\nThe code starts by importing `OpenAIChat` from the `langchain/llms` package. It then defines several types and interfaces that are used throughout the project:\n\n- `AutodocUserConfig`: Represents the user configuration for the autodoc project, including the LLM models to be used.\n- `AutodocRepoConfig`: Represents the configuration for a specific repository, including its name, URL, root directory, output directory, LLM models, and other settings.\n- `FileSummary` and `FolderSummary`: Represent the summaries and questions generated for files and folders, respectively.\n- `ProcessFileParams`, `ProcessFolderParams`, and `TraverseFileSystemParams`: Define the parameters for processing files, folders, and traversing the file system, respectively.\n- `ProcessFile` and `ProcessFolder`: Define the function types for processing files and folders, respectively.\n- `LLMModels`: Enumerates the available LLM models, such as GPT-3.5-turbo, GPT-4, and GPT-4-32k.\n- `LLMModelDetails`: Represents the details of an LLM model, including its name, cost per 1K tokens, maximum length, and other statistics.\n\nFor example, when using this code in the larger project, you might define a `ProcessFile` function that takes a `ProcessFileParams` object as input and generates a summary and questions for the file using the specified LLM model. Similarly, you could define a `ProcessFolder` function that processes all files and subfolders within a folder, generating summaries and questions for each.\n\nThe `TraverseFileSystemParams` type allows you to configure how the file system is traversed, including specifying which files and folders to ignore, and what prompts to use for generating summaries and questions.\n\nOverall, this code provides the foundation for the `autodoc` project by defining the types and interfaces needed to process code repositories and generate documentation using OpenAI's language models.", - "questions": "1. **Question:** What is the purpose of the `LLMModels` enum and how is it used in the code?\n **Answer:** The `LLMModels` enum defines the available language models for the autodoc project. It is used in the `AutodocUserConfig` and `AutodocRepoConfig` types to specify which language models should be used for processing files and folders.\n\n2. **Question:** What are the `ProcessFile` and `ProcessFolder` types and how are they used in the code?\n **Answer:** `ProcessFile` and `ProcessFolder` are types for functions that process a file or a folder, respectively. They are used as optional parameters in the `TraverseFileSystemParams` type, allowing developers to provide custom processing functions when traversing the file system.\n\n3. **Question:** What is the purpose of the `TraverseFileSystemParams` type and how is it used in the code?\n **Answer:** The `TraverseFileSystemParams` type defines the parameters required for traversing the file system. It is used to pass configuration options, such as input path, project name, custom processing functions, and other settings, to a function that will traverse the file system and process files and folders accordingly." + "filePath": "src\\types.ts", + "url": "https://github.com/context-labs/autodoc/src\\types.ts", + "summary": "This code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders in the repository.\n\nThe `AutodocUserConfig` and `AutodocRepoConfig` types define the configuration options for the user and repository, respectively. These include settings such as the LLM models to use, repository URL, output directory, and content type.\n\n`FileSummary` and `FolderSummary` types represent the generated summaries for files and folders, including their paths, URLs, and checksums. The `ProcessFileParams` and `ProcessFolderParams` types define the parameters required for processing files and folders, such as the file or folder name, path, and content type.\n\n`ProcessFile` and `ProcessFolder` are function types that take the respective parameters and return a promise. These functions are responsible for processing the files and folders, generating summaries, and updating the documentation.\n\n`TraverseFileSystemParams` type defines the parameters for traversing the file system, including the input path, project name, and optional `processFile` and `processFolder` functions. It also includes settings for ignoring certain files or folders and content type preferences.\n\nThe `LLMModels` enum lists the available language models, such as GPT-3.5 Turbo, GPT-4, and GPT-4 32k. The `LLMModelDetails` type provides information about each model, including the cost per 1K tokens, maximum length, and success/failure statistics.\n\nIn the larger project, these types and interfaces would be used to configure and run the `autodoc` tool, allowing users to automatically generate documentation for their code repositories using OpenAI's language models. For example, a user could provide an `AutodocRepoConfig` object to configure the tool, and then use the `TraverseFileSystem` function to process the repository and generate the documentation.", + "questions": "1. **What is the purpose of the `AutodocUserConfig` and `AutodocRepoConfig` types?**\n\n The `AutodocUserConfig` type is used to define the user configuration for the autodoc project, which includes an array of LLMModels. The `AutodocRepoConfig` type is used to define the repository configuration for the autodoc project, which includes various properties such as name, repository URL, root, output, LLMModels, and more.\n\n2. **What are the different LLMModels available in the `LLMModels` enum?**\n\n The `LLMModels` enum lists the available language models for the autodoc project. Currently, there are three models: GPT3 (gpt-3.5-turbo), GPT4 (gpt-4), and GPT432k (gpt-4-32k).\n\n3. **What is the purpose of the `ProcessFile` and `ProcessFolder` types?**\n\n The `ProcessFile` type is a function type that takes a `ProcessFileParams` object as input and returns a Promise. It is used to process a single file in the autodoc project. The `ProcessFolder` type is a function type that takes a `ProcessFolderParams` object as input and returns a Promise. It is used to process a folder in the autodoc project.", + "checksum": "796822d4da09cce719cb86b540d2fb66" } \ No newline at end of file diff --git a/.autodoc/docs/json/tsconfig.json b/.autodoc/docs/json/tsconfig.json index b9a229a..7266ad8 100644 --- a/.autodoc/docs/json/tsconfig.json +++ b/.autodoc/docs/json/tsconfig.json @@ -2,6 +2,7 @@ "fileName": "tsconfig.json", "filePath": "tsconfig.json", "url": "https://github.com/context-labs/autodoc/tsconfig.json", - "summary": "This code is a configuration file for the TypeScript compiler in a project. The purpose of this configuration is to define various options and settings that the TypeScript compiler should use when transpiling TypeScript code into JavaScript. This is important for ensuring that the compiled output is consistent and compatible with the intended runtime environment.\n\nHere's a brief explanation of the key options set in this configuration:\n\n- `\"rootDir\": \"src\"`: Specifies the root directory containing the TypeScript source files. This tells the compiler where to look for the input files.\n- `\"outDir\": \"dist\"`: Specifies the output directory for the compiled JavaScript files. This is where the transpiled code will be saved.\n- `\"strict\": true`: Enables strict type checking, which enforces stronger type safety and helps catch potential issues during development.\n- `\"target\": \"es2020\"`: Sets the target ECMAScript version for the compiled output. In this case, the output will be compatible with ECMAScript 2020 (ES11) features.\n- `\"module\": \"ES2020\"`: Specifies the module system to use in the compiled output. This setting is aligned with the target ECMAScript version.\n- `\"sourceMap\": true`: Generates source map files alongside the compiled output. This helps with debugging by mapping the compiled code back to the original TypeScript source.\n- `\"esModuleInterop\": true` and `\"allowSyntheticDefaultImports\": true`: These options enable better compatibility with different module systems and allow for more flexible import statements.\n- `\"moduleResolution\": \"node\"`: Sets the module resolution strategy to Node.js-style, which is the most common approach for resolving module imports in JavaScript projects.\n- `\"declaration\": true`: Generates TypeScript declaration files (`.d.ts`) alongside the compiled output. These files provide type information for the compiled code, which can be useful for other TypeScript projects that depend on this one.\n- `\"skipLibCheck\": true`: Skips type checking of declaration files, which can speed up the compilation process.\n\nIn the larger project, this configuration file ensures that the TypeScript compiler produces consistent and compatible JavaScript output, making it easier to integrate the compiled code with other parts of the project or with external dependencies.", - "questions": "1. **What is the purpose of the `rootDir` and `outDir` options in the configuration?**\n\n The `rootDir` option specifies the root folder of the source files, while the `outDir` option specifies the output directory for the compiled files.\n\n2. **What does the `strict` option do in the configuration?**\n\n The `strict` option enables a set of strict type-checking options in the TypeScript compiler, ensuring a higher level of type safety in the code.\n\n3. **What is the significance of the `target` and `module` options in the configuration?**\n\n The `target` option sets the ECMAScript target version for the compiled JavaScript output, while the `module` option specifies the module system to be used in the generated code. In this case, both are set to \"es2020\", indicating that the output will be ECMAScript 2020 compliant." + "summary": "The code provided is a configuration file for the TypeScript compiler in a project. It specifies various options that control how the TypeScript compiler should process the source code and generate the output JavaScript files. This configuration file is typically named `tsconfig.json` and is placed at the root of a TypeScript project.\n\nThe `compilerOptions` object contains several key-value pairs that define the behavior of the TypeScript compiler:\n\n- `rootDir`: Specifies the root directory of the source files. In this case, it is set to \"src\", meaning that the source files are located in the \"src\" folder.\n- `outDir`: Specifies the output directory for the compiled JavaScript files. In this case, it is set to \"dist\", meaning that the compiled files will be placed in the \"dist\" folder.\n- `strict`: Enables strict type checking, which helps catch potential issues in the code.\n- `target`: Specifies the ECMAScript target version for the output JavaScript files. In this case, it is set to \"es2020\", meaning that the output files will be compatible with ECMAScript 2020 features.\n- `module`: Specifies the module system to be used. In this case, it is set to \"ES2020\", meaning that the output files will use the ECMAScript 2020 module system.\n- `sourceMap`: Generates source map files, which help in debugging the compiled code by mapping it back to the original TypeScript source files.\n- `esModuleInterop`: Enables compatibility with ECMAScript modules for importing CommonJS modules.\n- `moduleResolution`: Specifies the module resolution strategy. In this case, it is set to \"node\", meaning that the Node.js module resolution algorithm will be used.\n- `allowSyntheticDefaultImports`: Allows default imports from modules with no default export.\n- `declaration`: Generates TypeScript declaration files (`.d.ts`) alongside the compiled JavaScript files, which can be useful for other projects that depend on this one.\n- `skipLibCheck`: Skips type checking of declaration files, which can speed up the compilation process.\n\nOverall, this configuration file helps ensure that the TypeScript compiler processes the source code according to the specified options, resulting in compiled JavaScript files that are compatible with the desired ECMAScript version and module system, while also providing useful features like source maps and strict type checking.", + "questions": "1. **What is the purpose of the `rootDir` and `outDir` options in the configuration?**\n\n The `rootDir` option specifies the root directory of the input files, while the `outDir` option specifies the output directory for the compiled files.\n\n2. **What does the `strict` option do in the configuration?**\n\n The `strict` option enables a wide range of type checking behavior that results in stronger guarantees of program correctness.\n\n3. **What is the significance of the `target` and `module` options in the configuration?**\n\n The `target` option specifies the ECMAScript target version for the output code, and the `module` option specifies the module system used in the output code. In this case, both are set to \"es2020\", which means the output code will be compatible with ECMAScript 2020 features and module system.", + "checksum": "e52c7d90cf341455e41e46229333e66d" } \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/estimate/index.md b/.autodoc/docs/markdown/src/cli/commands/estimate/index.md index fc04cc4..f213482 100644 --- a/.autodoc/docs/markdown/src/cli/commands/estimate/index.md +++ b/.autodoc/docs/markdown/src/cli/commands/estimate/index.md @@ -1,29 +1,25 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/estimate/index.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\estimate\index.ts) -The `estimate` function in this code file is responsible for providing an estimated cost of indexing a given repository using the AutodocRepoConfig configuration. This function is particularly useful for users who want to get an idea of the cost involved in processing their repository before actually running the process. +The `estimate` function in this code is responsible for providing an estimated cost of processing a given repository using the Autodoc project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository. -The function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository name, URL, root directory, output directory, and other settings related to the processing of the repository. +The function starts by constructing the path to the JSON output directory, which will be used to store the intermediate results of the processing. It then updates the spinner text to indicate that the cost estimation is in progress. -The main steps involved in the function are: +Next, the `processRepository` function is called with the provided configuration options and a `true` flag to indicate that this is a dry run. This means that the repository will not actually be processed, but the function will return the details of what would happen if it were processed. This is used to calculate the estimated cost of processing the repository. -1. Set the output path for the JSON files generated during the process. -2. Update the spinner text to display "Estimating cost...". -3. Perform a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed. -4. Stop the spinner once the dry run is complete. -5. Print the details of the models obtained from the dry run using the `printModelDetails` utility function. -6. Calculate the total estimated cost using the `totalIndexCostEstimate` utility function. -7. Display the estimated cost in a user-friendly format using the `chalk` library. +Once the dry run is complete, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is then calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input. + +Finally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in a red color. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges. Here's an example of how the `estimate` function might be used in the larger project: ```javascript -import { estimate } from './autodoc/estimate'; +import { estimate } from './path/to/this/file'; const config = { name: 'my-repo', repositoryUrl: 'https://github.com/user/my-repo.git', root: './', - output: './output/', + output: './output', llms: ['en'], ignore: ['.git', 'node_modules'], filePrompt: true, @@ -37,16 +33,16 @@ const config = { estimate(config); ``` -This example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. +This example would estimate the cost of processing the "my-repo" repository with the specified configuration options. ## Questions: - 1. **What is the purpose of the `estimate` function and what parameters does it accept?** + 1. **What is the purpose of the `estimate` function?** - The `estimate` function is used to estimate the cost of processing a repository for indexing. It accepts an `AutodocRepoConfig` object as a parameter, which contains various configuration options such as repository URL, output path, and other settings. + The `estimate` function is used to perform a dry run of the `processRepository` command to get an estimated price for indexing the given repository. It then prints the model details and the total estimated cost. -2. **How does the `estimate` function calculate the cost estimate?** +2. **What are the parameters passed to the `processRepository` function?** - The `estimate` function performs a dry run of the `processRepository` command to get the estimated price for indexing the repository. It then uses the `totalIndexCostEstimate` function to calculate the total cost based on the returned run details. + The `processRepository` function is called with an object containing the following properties: `name`, `repositoryUrl`, `root`, `output`, `llms`, `ignore`, `filePrompt`, `folderPrompt`, `chatPrompt`, `contentType`, `targetAudience`, and `linkHosted`. Additionally, a second argument `true` is passed to indicate that it's a dry run. -3. **What is the purpose of the `printModelDetails` function and how is it used in the `estimate` function?** +3. **How is the total estimated cost calculated and displayed?** - The `printModelDetails` function is used to display the details of the models used in the estimation process. In the `estimate` function, it is called with the values of the `runDetails` object to print the model details before displaying the total cost estimate. \ No newline at end of file + The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes an array of values from the `runDetails` object. The cost is then displayed using `console.log` with `chalk.redBright` for formatting, showing the cost with two decimal places and a note that the actual cost may vary. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/estimate/summary.md b/.autodoc/docs/markdown/src/cli/commands/estimate/summary.md index 37f1ecb..f64823d 100644 --- a/.autodoc/docs/markdown/src/cli/commands/estimate/summary.md +++ b/.autodoc/docs/markdown/src/cli/commands/estimate/summary.md @@ -1,27 +1,23 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/estimate) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\commands\estimate) -The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. +The `estimate` function in `index.ts` is a crucial part of the Autodoc project, as it provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input, containing various configuration options such as repository name, URL, root directory, output directory, and other settings related to the processing of the repository. -The main steps involved in the `estimate` function are: +The function begins by constructing the path to the JSON output directory, which stores intermediate results of the processing. It then updates the spinner text to indicate that cost estimation is in progress. The `processRepository` function is called with the provided configuration options and a `true` flag, signifying a dry run. This dry run returns the details of what would happen if the repository were processed, which is used to calculate the estimated cost. -1. Setting the output path for the JSON files generated during the process. -2. Updating the spinner text to display "Estimating cost...". -3. Performing a dry run of the `processRepository` function with the given configuration options. The dry run does not actually process the repository but instead returns the details of the models that would be processed. -4. Stopping the spinner once the dry run is complete. -5. Printing the details of the models obtained from the dry run using the `printModelDetails` utility function. -6. Calculating the total estimated cost using the `totalIndexCostEstimate` utility function. -7. Displaying the estimated cost in a user-friendly format using the `chalk` library. +Upon completion of the dry run, the spinner is updated to show success, and the results are printed using the `printModelDetails` function. The total estimated cost is calculated using the `totalIndexCostEstimate` function, which takes the values of the `runDetails` object as input. + +Finally, the estimated cost is displayed in the console using the `chalk.redBright` function to format the text in red. The message also includes a disclaimer that the actual cost may vary and recommends setting a limit in the user's OpenAI account to prevent unexpected charges. Here's an example of how the `estimate` function might be used in the larger project: ```javascript -import { estimate } from './autodoc/estimate'; +import { estimate } from './path/to/this/file'; const config = { name: 'my-repo', repositoryUrl: 'https://github.com/user/my-repo.git', root: './', - output: './output/', + output: './output', llms: ['en'], ignore: ['.git', 'node_modules'], filePrompt: true, @@ -35,6 +31,4 @@ const config = { estimate(config); ``` -This example demonstrates how a user can call the `estimate` function with a specific configuration to get an estimated cost for processing their repository. The function is designed to work seamlessly with other parts of the Autodoc project, such as the `processRepository` function, which is responsible for the actual processing of the repository. - -By providing an estimated cost upfront, the `estimate` function helps users make informed decisions about whether to proceed with the indexing process or not. This can be particularly useful for users with large repositories or those who are working within a budget. Overall, the `estimate` function is an essential tool for users looking to leverage the power of Autodoc while managing their costs effectively. +This example would estimate the cost of processing the "my-repo" repository with the specified configuration options. diff --git a/.autodoc/docs/markdown/src/cli/commands/index/convertJsonToMarkdown.md b/.autodoc/docs/markdown/src/cli/commands/index/convertJsonToMarkdown.md index 82f31c5..cfba5ac 100644 --- a/.autodoc/docs/markdown/src/cli/commands/index/convertJsonToMarkdown.md +++ b/.autodoc/docs/markdown/src/cli/commands/index/convertJsonToMarkdown.md @@ -1,61 +1,37 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/convertJsonToMarkdown.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\index\convertJsonToMarkdown.ts) -The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This is done in two main steps: counting the number of files in the project and creating Markdown files for each code file in the project. +The `convertJsonToMarkdown` function in this code is responsible for converting JSON files containing documentation information into Markdown files. This function is part of the larger Autodoc project, which aims to automate the process of generating documentation for code repositories. -First, the function uses the `traverseFileSystem` utility to count the number of files in the project. It takes an `AutodocRepoConfig` object as input, which contains information about the project, such as its name, root directory, output directory, and other configuration options. The `traverseFileSystem` utility is called with a `processFile` function that increments the `files` counter for each file encountered. +The function takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, input and output directories, and other settings related to the documentation generation process. -```javascript -await traverseFileSystem({ - inputPath: inputRoot, - projectName, - processFile: () => { - files++; - return Promise.resolve(); - }, - ignore: [], - filePrompt, - folderPrompt, - contentType, - targetAudience, - linkHosted, -}); -``` +The code first counts the number of files in the project by traversing the file system using the `traverseFileSystem` utility function. This is done to provide a progress update to the user via the `updateSpinnerText` function. -Next, the function defines another `processFile` function that reads the content of each JSON file, converts it to a Markdown format, and writes the output to a new Markdown file in the specified output directory. It first checks if the content exists, and if not, it returns early. It then creates the output directory if it doesn't exist, and parses the JSON content into either a `FolderSummary` or a `FileSummary` object, depending on the file name. +Next, the `processFile` function is defined, which is responsible for reading the content of each JSON file, parsing it, and converting it into a Markdown format. The function checks if the file has a summary, and if so, it generates the Markdown content with a link to the code on GitHub, the summary, and any questions if present. The output Markdown file is then saved in the specified output directory. -The function then constructs the Markdown content by including a link to the code on GitHub, the summary, and any questions if they exist. Finally, it writes the Markdown content to the output file with the `.md` extension. +Finally, the `traverseFileSystem` function is called again, this time with the `processFile` function as an argument. This allows the code to process each JSON file in the project and convert it into a Markdown file. Once the process is complete, a success message is displayed to the user using the `spinnerSuccess` function. -```javascript -const outputPath = getFileName(markdownFilePath, '.', '.md'); -await fs.writeFile(outputPath, markdown, 'utf-8'); -``` - -The `convertJsonToMarkdown` function is then called again with the new `processFile` function to create the Markdown files for each code file in the project. +Example usage: ```javascript -await traverseFileSystem({ - inputPath: inputRoot, - projectName, - processFile, - ignore: [], - filePrompt, - folderPrompt, - contentType, - targetAudience, - linkHosted, +convertJsonToMarkdown({ + name: "myProject", + root: "./input", + output: "./output", + filePrompt: true, + folderPrompt: true, + contentType: "code", + targetAudience: "developers", + linkHosted: "https://github.com/user/myProject", }); ``` -In summary, this code is responsible for converting JSON files containing documentation information into Markdown files, which can be used in the larger Autodoc project to generate documentation for code repositories. +This will convert all JSON files in the `./input` directory into Markdown files and save them in the `./output` directory. ## Questions: - 1. **What is the purpose of the `convertJsonToMarkdown` function?** - - The `convertJsonToMarkdown` function is responsible for converting JSON files containing summaries and questions about code files in a project into Markdown files. It traverses the file system, reads the JSON files, and creates corresponding Markdown files with the provided information. - -2. **How does the `traverseFileSystem` function work and what are its parameters?** - - The `traverseFileSystem` function is a utility function that recursively traverses the file system starting from a given input path. It takes an object as a parameter with properties such as `inputPath`, `projectName`, `processFile`, `ignore`, `filePrompt`, `folderPrompt`, `contentType`, `targetAudience`, and `linkHosted`. The function processes each file using the provided `processFile` callback and can be configured to ignore certain files or folders. + 1. **Question:** What is the purpose of the `convertJsonToMarkdown` function and what are the expected inputs? + **Answer:** The `convertJsonToMarkdown` function is used to convert JSON files to Markdown files for each code file in the project. It takes an `AutodocRepoConfig` object as input, which contains various properties like projectName, root, output, filePrompt, folderPrompt, contentType, targetAudience, and linkHosted. -3. **What is the purpose of the `processFile` function inside `convertJsonToMarkdown`?** +2. **Question:** How does the `traverseFileSystem` function work and what is its role in this code? + **Answer:** The `traverseFileSystem` function is a utility function that recursively traverses the file system, starting from the inputPath, and processes each file using the provided `processFile` function. In this code, it is used twice: first to count the number of files in the project, and then to create Markdown files for each code file in the project. - The `processFile` function is a callback function that is passed to the `traverseFileSystem` function. It is responsible for reading the content of a JSON file, parsing it, and creating a corresponding Markdown file with the summary and questions. It also handles creating the output directory if it doesn't exist and writing the Markdown content to the output file. \ No newline at end of file +3. **Question:** How are the output directories and Markdown files created, and what is the structure of the generated Markdown content? + **Answer:** The output directories are created using the `fs.mkdir` function with the `recursive: true` option. The Markdown files are created using the `fs.writeFile` function. The structure of the generated Markdown content includes a link to view the code on GitHub, the summary, and optionally, a list of questions if they exist. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/index/createVectorStore.md b/.autodoc/docs/markdown/src/cli/commands/index/createVectorStore.md index 31495cc..2f8217c 100644 --- a/.autodoc/docs/markdown/src/cli/commands/index/createVectorStore.md +++ b/.autodoc/docs/markdown/src/cli/commands/index/createVectorStore.md @@ -1,14 +1,14 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/createVectorStore.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\index\createVectorStore.ts) -The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. +The code in this file is responsible for processing a directory of text files, splitting the text into chunks, and creating a vector store using the HNSWLib library and OpenAIEmbeddings. This vector store can be used for efficient similarity search and retrieval of documents in the larger project. -The `processFile` function takes a file path as input and returns a Promise that resolves to a Document object. It reads the file contents and creates a Document object with the file contents as `pageContent` and the file path as metadata. +The `processFile` function reads a file's content and creates a `Document` object with the content and metadata (source file path). It returns a Promise that resolves to the created Document. -The `processDirectory` function takes a directory path as input and returns a Promise that resolves to an array of Document objects. It reads the files in the directory and calls `processFile` for each file. If a file is a directory, it calls `processDirectory` recursively. The function accumulates all the Document objects in an array and returns it. +The `processDirectory` function is a recursive function that processes a directory and its subdirectories. It reads the files in the directory, and for each file, it checks if it's a directory or a regular file. If it's a directory, the function calls itself with the new directory path. If it's a file, it calls the `processFile` function to create a Document object. The function returns an array of Document objects. -The `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as input. It has a `load` method that calls the `processDirectory` function with the file path and returns the resulting array of Document objects. +The `RepoLoader` class extends the `BaseDocumentLoader` class and has a constructor that takes a file path as an argument. It has a `load` method that calls the `processDirectory` function with the given file path and returns the array of Document objects. -The `createVectorStore` function is an async function that takes an AutodocRepoConfig object as input, which contains the root directory and output file path. It creates a RepoLoader instance with the root directory, loads the raw documents, and splits them into chunks using the `RecursiveCharacterTextSplitter` class. It then creates a vector store using the HNSWLib library and OpenAIEmbeddings, and saves the vector store to the output file path. +The `createVectorStore` function is an async function that takes an `AutodocRepoConfig` object as an argument, which contains the root directory and output file path. It creates a `RepoLoader` instance with the root directory and loads the documents using the `load` method. It then creates a `RecursiveCharacterTextSplitter` instance with a specified chunk size and chunk overlap and splits the documents into chunks. Finally, it creates a vector store using the HNSWLib library and OpenAIEmbeddings with the processed documents and saves the vector store to the output file path. Example usage: @@ -22,14 +22,12 @@ createVectorStore(config).then(() => { console.log('Vector store created successfully'); }); ``` - -This code snippet would process all the text files in the `./data/documents` directory, split the text into chunks, create a vector store using the HNSWLib library and OpenAIEmbeddings, and save the vector store to the `./data/vector_store` file. ## Questions: - 1. **Question:** What is the purpose of the `processFile` function and how does it handle errors? - **Answer:** The `processFile` function reads the content of a file and creates a `Document` object with the file contents and metadata. If there is an error while reading the file, it rejects the promise with the error. + 1. **Question:** What is the purpose of the `processFile` function and what does it return? + **Answer:** The `processFile` function is an asynchronous function that reads the content of a file given its file path, creates a `Document` object with the file contents and metadata (source file path), and returns a Promise that resolves to the created `Document` object. -2. **Question:** How does the `processDirectory` function handle nested directories and files? - **Answer:** The `processDirectory` function iterates through the files in a directory. If it encounters a subdirectory, it calls itself recursively to process the subdirectory. If it encounters a file, it processes the file using the `processFile` function and adds the resulting `Document` object to the `docs` array. +2. **Question:** How does the `processDirectory` function work and what does it return? + **Answer:** The `processDirectory` function is an asynchronous function that takes a directory path as input, reads all the files and subdirectories within it, and processes them recursively. It returns a Promise that resolves to an array of `Document` objects created from the files in the directory and its subdirectories. -3. **Question:** What is the purpose of the `createVectorStore` function and how does it use the `RepoLoader` class? - **Answer:** The `createVectorStore` function is responsible for creating a vector store from a given repository. It uses the `RepoLoader` class to load all the documents from the repository, splits the text into chunks using the `RecursiveCharacterTextSplitter`, and then creates a vector store using the `HNSWLib.fromDocuments` method with the `OpenAIEmbeddings`. Finally, it saves the vector store to the specified output path. \ No newline at end of file +3. **Question:** What is the purpose of the `createVectorStore` function and how does it work? + **Answer:** The `createVectorStore` function is an asynchronous function that takes an `AutodocRepoConfig` object as input, which contains the root directory path and output file path. The function loads all the documents from the root directory using the `RepoLoader`, splits the text into chunks using the `RecursiveCharacterTextSplitter`, creates a vector store from the documents using the `HNSWLib` and `OpenAIEmbeddings`, and saves the vector store to the specified output file. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/index/index.md b/.autodoc/docs/markdown/src/cli/commands/index/index.md index 1950469..67191ab 100644 --- a/.autodoc/docs/markdown/src/cli/commands/index/index.md +++ b/.autodoc/docs/markdown/src/cli/commands/index/index.md @@ -1,50 +1,45 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/index.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\index\index.ts) -The code in this file is responsible for processing a given repository and generating documentation in JSON and Markdown formats, as well as creating vector files for the documentation. It exports a single function `index` that takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. +The code in this file is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It exports a single function `index` that takes an `AutodocRepoConfig` object as its argument, which contains various configuration options for processing the repository. -The `index` function performs the following steps: +The `index` function performs three main tasks: -1. Define the paths for JSON, Markdown, and data output directories within the `output` folder. +1. **Process the repository**: It traverses the repository, calls the LLMS (Language Learning Management System) for each file, and creates JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The JSON files are stored in the `output/docs/json/` directory. -2. Process the repository by traversing its files, calling the LLMS (Language Learning Management System) for each file, and creating JSON files with the results. This is done using the `processRepository` function, which takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step. + ```javascript + updateSpinnerText('Processing repository...'); + await processRepository({ /* configuration options */ }); + spinnerSuccess(); + ``` -3. Convert the generated JSON files into Markdown format using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion. +2. **Create Markdown files**: It converts the generated JSON files into Markdown files using the `convertJsonToMarkdown` function. This function also takes the same configuration options as the `index` function. The Markdown files are stored in the `output/docs/markdown/` directory. -4. Create vector files for the generated Markdown documentation using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The spinner text is updated to show the progress of this step, and a success message is displayed upon completion. + ```javascript + updateSpinnerText('Creating markdown files...'); + await convertJsonToMarkdown({ /* configuration options */ }); + spinnerSuccess(); + ``` -Here's an example of how this code might be used in the larger project: +3. **Create vector files**: It creates vector files from the generated Markdown files using the `createVectorStore` function. This function also takes the same configuration options as the `index` function. The vector files are stored in the `output/docs/data/` directory. -```javascript -import autodoc from './autodoc'; + ```javascript + updateSpinnerText('Create vector files...'); + await createVectorStore({ /* configuration options */ }); + spinnerSuccess(); + ``` -const config = { - name: 'MyProject', - repositoryUrl: 'https://github.com/user/myproject', - root: './src', - output: './output', - llms: 'https://llms.example.com', - ignore: ['.git', 'node_modules'], - filePrompt: true, - folderPrompt: true, - chatPrompt: true, - contentType: 'text', - targetAudience: 'developers', - linkHosted: 'https://myproject-docs.example.com', -}; +Throughout the execution of these tasks, the code uses `updateSpinnerText` and `spinnerSuccess` functions to provide visual feedback on the progress of the tasks. -autodoc.index(config); -``` - -This example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text. +In the larger project, this code would be used to automatically generate documentation for a given repository based on the provided configuration options. The generated documentation can then be used for various purposes, such as displaying it on a website or analyzing the content for specific insights. ## Questions: - 1. **What is the purpose of the `index` function in this code?** + 1. **What does the `index` function do in this code?** - The `index` function is the main entry point for the autodoc project. It processes a given repository, converts the JSON files to markdown, and creates vector files based on the provided configuration options. + The `index` function is the main entry point for the autodoc project. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository and creating JSON files, converting JSON files to markdown files, and creating vector files. -2. **What are the different steps involved in processing the repository?** +2. **What is the purpose of the `processRepository`, `convertJsonToMarkdown`, and `createVectorStore` functions?** - The processing of the repository involves three main steps: (1) traversing the repository and calling LLMS for each file to create JSON files with the results, (2) converting the JSON files to markdown files, and (3) creating vector files from the markdown files. + The `processRepository` function traverses the repository, calls LLMS for each file, and creates JSON files with the results. The `convertJsonToMarkdown` function creates markdown files from the generated JSON files. The `createVectorStore` function creates vector files from the markdown files. -3. **What is the role of the `AutodocRepoConfig` type?** +3. **What are the different types of prompts (`filePrompt`, `folderPrompt`, `chatPrompt`) used for in this code?** - The `AutodocRepoConfig` type is used to define the shape of the configuration object that is passed to the `index` function. It specifies the properties and their types that are required for the function to process the repository, convert JSON to markdown, and create vector files. \ No newline at end of file + These prompts are likely used to interact with the user during the processing of the repository. The `filePrompt` might be used to ask the user for input regarding specific files, the `folderPrompt` for input regarding folders, and the `chatPrompt` for general input or feedback during the processing. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/index/processRepository.md b/.autodoc/docs/markdown/src/cli/commands/index/processRepository.md index 20ef43d..bb943ec 100644 --- a/.autodoc/docs/markdown/src/cli/commands/index/processRepository.md +++ b/.autodoc/docs/markdown/src/cli/commands/index/processRepository.md @@ -1,44 +1,20 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/processRepository.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\index\processRepository.ts) -The `processRepository` function in this code is responsible for processing a given code repository and generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the repository URL, input and output paths, language models to use, and other settings. +The `processRepository` function in this code is responsible for generating summaries and questions for code files and folders in a given repository. It takes an `AutodocRepoConfig` object as input, which contains information about the project, repository URL, input and output paths, language models, and other configurations. An optional `dryRun` parameter can be provided to skip actual API calls and file writing. -The function starts by initializing an `APIRateLimit` instance to limit the number of API calls made to the language models. It then defines several helper functions, such as `callLLM` for making API calls, `isModel` for checking if a given model is valid, `processFile` for processing individual files, and `processFolder` for processing folders. +The function starts by initializing the encoding and rate limit for API calls. It then defines two main helper functions: `processFile` and `processFolder`. The `processFile` function is responsible for processing individual code files. It reads the file content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it creates prompts for summaries and questions, selects the appropriate language model based on the input length, and calls the language model API to generate the summaries and questions. The results are then saved to a JSON file in the output directory. -The `processFile` function reads the content of a file, generates prompts for summaries and questions using the `createCodeFileSummary` and `createCodeQuestions` functions, and selects the best language model to use based on the token length of the prompts. It then calls the language model API to generate the summaries and questions, and saves the results as JSON files in the output directory. +The `processFolder` function is responsible for processing folders. It reads the folder content, calculates a checksum, and checks if reindexing is needed. If reindexing is required, it reads the summaries and questions of all files and subfolders in the folder, calls the language model API to generate a summary for the folder, and saves the result to a `summary.json` file in the folder. -The `processFolder` function reads the contents of a folder, filters out ignored files, and processes each file and subfolder within the folder. It then generates a summary prompt using the `folderSummaryPrompt` function and calls the language model API to generate a summary for the folder. The folder summary, along with the summaries and questions of its files and subfolders, is saved as a JSON file in the output directory. +The main function then counts the number of files and folders in the project and processes them using the `traverseFileSystem` utility function. It processes all files first, followed by all folders. Finally, it returns the language model usage statistics. -The main part of the `processRepository` function first counts the number of files and folders in the input directory using the `filesAndFolders` function. It then processes each file and folder using the `traverseFileSystem` function, which calls the `processFile` and `processFolder` functions for each file and folder encountered. Finally, the function returns the language models used during processing. - -Example usage of the `processRepository` function: - -```javascript -const autodocConfig = { - name: 'myProject', - repositoryUrl: 'https://github.com/user/myProject', - root: 'src', - output: 'output', - llms: [LLMModels.GPT3, LLMModels.GPT4], - ignore: ['.git', 'node_modules'], - filePrompt: 'Explain this code file', - folderPrompt: 'Summarize this folder', - contentType: 'code', - targetAudience: 'developers', - linkHosted: true, -}; - -processRepository(autodocConfig).then((models) => { - console.log('Processing complete'); -}); -``` - -This code would process the `src` directory of the `myProject` repository, generating summaries and questions for each file and folder, and saving the results in the `output` directory. +The `calculateChecksum` function calculates the checksum of a list of file contents, while the `reindexCheck` function checks if reindexing is needed by comparing the new and old checksums of a file or folder. ## Questions: - 1. **Question:** What is the purpose of the `processRepository` function and what are its input parameters? - **Answer:** The `processRepository` function is responsible for processing a code repository by generating summaries and questions for each file and folder in the project. It takes an `AutodocRepoConfig` object as input, which contains various configuration options such as the project name, repository URL, input and output paths, language models, and other settings. Additionally, it accepts an optional `dryRun` parameter, which, if set to true, will not save the generated summaries and questions to disk. + 1. **Question:** What is the purpose of the `processRepository` function and what are its inputs and outputs? + **Answer:** The `processRepository` function processes a given code repository, generating summaries and questions for each file and folder within the repository. It takes an `AutodocRepoConfig` object and an optional `dryRun` boolean as inputs. The function returns a `Promise` that resolves to an object containing the models used during processing. -2. **Question:** How does the code determine the best language model to use for generating summaries and questions? - **Answer:** The code checks the maximum token length of each available language model (GPT3, GPT4, and GPT432k) and compares it with the token length of the prompts (summary and questions). It selects the first model that can handle the maximum token length and is included in the `llms` array provided in the configuration. +2. **Question:** How does the `calculateChecksum` function work and what is its purpose? + **Answer:** The `calculateChecksum` function takes an array of file contents as input and calculates a checksum for each file using the MD5 hashing algorithm. It then concatenates all the checksums and calculates a final checksum using MD5 again. The purpose of this function is to generate a unique identifier for the contents of the files, which can be used to determine if the files have changed and need to be reprocessed. -3. **Question:** How does the code handle traversing the file system and processing files and folders? - **Answer:** The code uses the `traverseFileSystem` utility function to traverse the file system. It takes an object with various configuration options, including the input path, project name, and callbacks for processing files and folders. The `processFile` and `processFolder` functions are passed as callbacks to handle the processing of files and folders, respectively. \ No newline at end of file +3. **Question:** How does the `reindexCheck` function work and when is it used? + **Answer:** The `reindexCheck` function checks if a summary.json file exists in the given file or folder path and compares the stored checksum with the new checksum to determine if the file or folder needs to be reindexed. It is used in the `processFile` and `processFolder` functions to decide whether to regenerate summaries and questions for a file or folder based on changes in their contents. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/index/prompts.md b/.autodoc/docs/markdown/src/cli/commands/index/prompts.md index c365548..4ff2751 100644 --- a/.autodoc/docs/markdown/src/cli/commands/index/prompts.md +++ b/.autodoc/docs/markdown/src/cli/commands/index/prompts.md @@ -1,32 +1,38 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/index/prompts.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\index\prompts.ts) -The code in this file provides three functions that generate prompts for documentation experts to create summaries and answer questions about code files and folders in a project. These functions are likely used in the larger autodoc project to automate the process of generating documentation for code files and folders. +This code defines three utility functions that generate prompts for documentation experts working on a project. These functions are used to create documentation for code files and folders within a project. The generated prompts are in markdown format and include specific instructions for the documentation expert. -1. `createCodeFileSummary`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the code file. The prompt includes the file path, project name, content type, and a custom file prompt. For example: +1. `createCodeFileSummary`: This function generates a prompt for creating a summary of a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `filePrompt`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert. +Example usage: ```javascript -createCodeFileSummary('src/example.js', 'autodoc', 'console.log("Hello, World!");', 'JavaScript', 'Write a detailed technical explanation of what this code does.'); +const prompt = createCodeFileSummary('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'Write a detailed technical explanation of this code.'); ``` -2. `createCodeQuestions`: This function takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. It returns a formatted string prompt for a documentation expert to generate three questions and answers that a target audience might have about the code file. The prompt includes the file path, project name, content type, and target audience. For example: +2. `createCodeQuestions`: This function generates a prompt for creating a list of questions and answers about a code file. It takes five parameters: `filePath`, `projectName`, `fileContents`, `contentType`, and `targetAudience`. The function returns a markdown formatted string that includes the file's content and a custom prompt for the documentation expert to provide questions and answers. +Example usage: ```javascript -createCodeQuestions('src/example.js', 'autodoc', 'console.log("Hello, World!");', 'JavaScript', 'beginner'); +const prompt = createCodeQuestions('path/to/file.js', 'MyProject', 'const x = 10;', 'JavaScript', 'beginner'); ``` -3. `folderSummaryPrompt`: This function takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. It returns a formatted string prompt for a documentation expert to write a summary of the folder and its contents. The prompt includes the folder path, project name, content type, a list of files and their summaries, a list of subfolders and their summaries, and a custom folder prompt. For example: +3. `folderSummaryPrompt`: This function generates a prompt for creating a summary of a folder containing code files and subfolders. It takes six parameters: `folderPath`, `projectName`, `files`, `folders`, `contentType`, and `folderPrompt`. The `files` parameter is an array of `FileSummary` objects, and the `folders` parameter is an array of `FolderSummary` objects. The function returns a markdown formatted string that includes a list of files and folders with their summaries and a custom prompt for the documentation expert. +Example usage: ```javascript -folderSummaryPrompt('src/', 'autodoc', [{fileName: 'example.js', summary: 'A simple example file'}], [{folderName: 'utils', summary: 'Utility functions'}], 'JavaScript', 'Write a detailed technical explanation of the folder structure and contents.'); +const prompt = folderSummaryPrompt('path/to/folder', 'MyProject', fileSummaries, folderSummaries, 'JavaScript', 'Write a detailed technical explanation of this folder structure.'); ``` -These functions can be used in the autodoc project to generate prompts for documentation experts, helping to streamline the process of creating documentation for code files and folders. +These functions can be used in the larger project to generate documentation tasks for experts, ensuring consistent formatting and instructions across different parts of the project. ## Questions: - 1. **Question:** What is the purpose of the `createCodeFileSummary` function? - **Answer:** The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt. + 1. **What is the purpose of the `createCodeFileSummary` function?** -2. **Question:** How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function? - **Answer:** The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt. + The `createCodeFileSummary` function generates a string template for a code file summary prompt, which includes the file path, project name, file contents, content type, and a file prompt. -3. **Question:** What is the purpose of the `folderSummaryPrompt` function and what parameters does it take? - **Answer:** The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, files, folders, content type, and a folder prompt. It takes parameters such as folderPath, projectName, files, folders, contentType, and folderPrompt. \ No newline at end of file +2. **How does the `createCodeQuestions` function differ from the `createCodeFileSummary` function?** + + The `createCodeQuestions` function generates a string template for a code documentation prompt that asks for 3 questions and their answers, while the `createCodeFileSummary` function generates a string template for a code file summary prompt. + +3. **What is the role of the `folderSummaryPrompt` function?** + + The `folderSummaryPrompt` function generates a string template for a folder summary prompt, which includes the folder path, project name, lists of files and folders with their summaries, content type, and a folder prompt. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/index/summary.md b/.autodoc/docs/markdown/src/cli/commands/index/summary.md index 1efb44a..27f3ef4 100644 --- a/.autodoc/docs/markdown/src/cli/commands/index/summary.md +++ b/.autodoc/docs/markdown/src/cli/commands/index/summary.md @@ -1,36 +1,34 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/index) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\commands\index) -The code in this folder is responsible for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown. +The code in this folder is responsible for processing a given repository and generating documentation in JSON, Markdown, and vector formats. It consists of several functions and utilities that work together to automate the documentation generation process. -For example, the `processRepository` function processes a code repository and generates summaries and questions for each file and folder within the repository. It uses helper functions like `callLLM` to make API calls to language models and `processFile` and `processFolder` to process individual files and folders. The results are saved as JSON files in the output directory. +The main function, `index`, takes an `AutodocRepoConfig` object as input, which contains various configuration options for processing the repository. It performs three main tasks: -The `convertJsonToMarkdown` function converts JSON files containing documentation information into Markdown files. It counts the number of files in the project and creates Markdown files for each code file in the project using the `traverseFileSystem` utility. +1. **Process the repository**: It calls the `processRepository` function to traverse the repository, generate summaries and questions for code files and folders using the LLMS (Language Learning Management System), and create JSON files with the results. These JSON files are stored in the `output/docs/json/` directory. -The `createVectorStore` function processes a directory of text files, splits the text into chunks, and creates a vector store using the HNSWLib library and OpenAIEmbeddings. It processes the files in the directory and calls `processFile` for each file, creating a vector store and saving it to the output file path. +2. **Create Markdown files**: It uses the `convertJsonToMarkdown` function to convert the generated JSON files into Markdown files. These Markdown files are stored in the `output/docs/markdown/` directory. -Here's an example of how this code might be used in the larger project: +3. **Create vector files**: It calls the `createVectorStore` function to create vector files from the generated Markdown files. These vector files are stored in the `output/docs/data/` directory. + +Throughout the execution of these tasks, the code provides visual feedback on the progress of the tasks using `updateSpinnerText` and `spinnerSuccess` functions. + +Here's an example of how this code might be used: ```javascript -import autodoc from './autodoc'; - -const config = { - name: 'MyProject', - repositoryUrl: 'https://github.com/user/myproject', - root: './src', - output: './output', - llms: 'https://llms.example.com', - ignore: ['.git', 'node_modules'], +index({ + name: "myProject", + root: "./input", + output: "./output", filePrompt: true, folderPrompt: true, - chatPrompt: true, - contentType: 'text', - targetAudience: 'developers', - linkHosted: 'https://myproject-docs.example.com', -}; - -autodoc.index(config); + contentType: "code", + targetAudience: "developers", + linkHosted: "https://github.com/user/myProject", +}); ``` -This example would process the `MyProject` repository, generate JSON and Markdown documentation, and create vector files for the documentation, all while providing progress updates through spinner text. +This will process the repository located at `./input`, generate documentation in JSON, Markdown, and vector formats, and save the results in the `./output` directory. + +The `prompts.ts` file contains utility functions that generate prompts for documentation experts. These functions create markdown formatted strings with specific instructions for the documentation expert, ensuring consistent formatting and instructions across different parts of the project. -In summary, the code in this folder plays a crucial role in the Autodoc project by processing code repositories, generating documentation in various formats, and creating vector files for the documentation. This helps developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users. +In summary, the code in this folder automates the process of generating documentation for a given repository based on the provided configuration options. The generated documentation can be used for various purposes, such as displaying it on a website or analyzing the content for specific insights. diff --git a/.autodoc/docs/markdown/src/cli/commands/init/index.md b/.autodoc/docs/markdown/src/cli/commands/init/index.md index d7abbef..f3384c0 100644 --- a/.autodoc/docs/markdown/src/cli/commands/init/index.md +++ b/.autodoc/docs/markdown/src/cli/commands/init/index.md @@ -1,32 +1,40 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/init/index.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\init\index.ts) -This code is responsible for initializing and configuring the `autodoc` project. It provides a function `init` that creates a configuration file `autodoc.config.json` with user inputs and default values. The configuration file is essential for the project to function correctly and adapt to different user requirements. +This code is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument. -The `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation. +The `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided. -The `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. +The `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. -If there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function. +Next, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). -Finally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started. +After the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root. -Here's an example of how the `init` function is used: +Finally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project. + +Example usage: ```javascript -import { init } from './autodoc'; +import { init } from './path/to/this/file'; -(async () => { - await init(); -})(); -``` +// Initialize the configuration with default values +await init(); -This code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. +// Initialize the configuration with custom values +await init({ + name: 'My Custom Repository', + repositoryUrl: 'https://github.com/user/repo', +}); +``` ## Questions: - 1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return? - **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new `AutodocRepoConfig` object with default values for each property, using the provided `config` values if available. + 1. **What is the purpose of the `makeConfigTemplate` function?** + + The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc project. It takes an optional `config` parameter of type `AutodocRepoConfig` and returns a new configuration object with default values for various properties. + +2. **How does the `init` function work and when is it called?** + + The `init` function is an asynchronous function that initializes the Autodoc configuration by creating an `autodoc.config.json` file in the specified location. It takes an optional `config` parameter of type `AutodocRepoConfig` and prompts the user for input to set the configuration values. It is called when the user wants to set up the Autodoc configuration for their project. -2. **Question:** How does the `init` function work and what does it do with the user's input? - **Answer:** The `init` function is an asynchronous function that initializes the Autodoc configuration by prompting the user for input using the `inquirer` package. It takes an optional `config` parameter of type `AutodocRepoConfig` and uses it as the default values for the prompts. After collecting the user's input, it creates a new configuration object using the `makeConfigTemplate` function and writes it to a file named `autodoc.config.json`. +3. **What is the purpose of the `inquirer.prompt` calls in the `init` function?** -3. **Question:** What are the different LLM models available in the `llms` prompt and how are they used in the configuration? - **Answer:** The `llms` prompt provides three choices for the user to select the LLM models they have access to: GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The selected LLM models are stored in the `llms` property of the `AutodocRepoConfig` object, which can be used later in the project to determine which models to use for generating documentation. \ No newline at end of file + The `inquirer.prompt` calls are used to interactively prompt the user for input to set the configuration values for the Autodoc project. The user is asked for the repository name, repository URL, and the LLMs they have access to. The input is then used to create a new configuration object and write it to the `autodoc.config.json` file. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/init/summary.md b/.autodoc/docs/markdown/src/cli/commands/init/summary.md index 85bc3fd..9c96c0c 100644 --- a/.autodoc/docs/markdown/src/cli/commands/init/summary.md +++ b/.autodoc/docs/markdown/src/cli/commands/init/summary.md @@ -1,23 +1,30 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/init) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\commands\init) -The `index.ts` file in the `init` folder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. This configuration file is crucial for the project to function correctly and adapt to different user requirements. +The `index.ts` file in the `.autodoc\docs\json\src\cli\commands\init` folder is responsible for initializing the configuration of the Autodoc project. It provides a template for the configuration and prompts the user to input necessary information to set up the project. The main functionality is exposed through the `init` function, which is an asynchronous function that takes an optional `AutodocRepoConfig` object as an argument. -The `makeConfigTemplate` function generates a default configuration object with pre-defined values. It takes an optional `config` parameter to override the default values. The returned object contains settings such as repository name, URL, output directory, LLM models, and various prompts for generating documentation. +The `makeConfigTemplate` function creates a default configuration object with pre-defined values for various properties. It takes an optional `config` parameter and returns a new `AutodocRepoConfig` object with the provided values or default values if not provided. -The `init` function is an asynchronous function that takes an optional `config` parameter. It first checks if a configuration file already exists in the project directory. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. +The `init` function first checks if an `autodoc.config.json` file already exists in the project root. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. -If there is no existing configuration file or the user chooses to overwrite, the function prompts the user for the repository name, URL, and LLM models they have access to. These values are then used to create a new configuration object using the `makeConfigTemplate` function. +Next, the user is prompted to enter the name of their repository, the GitHub URL of their repository, and the LLMs they have access to. The LLMs are language models used for generating documentation. The user can choose between GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). -Finally, the new configuration object is written to the `autodoc.config.json` file in the project directory. A success message is displayed, instructing the user to run `doc index` to get started. +After the user provides the necessary information, a new configuration object is created using the `makeConfigTemplate` function with the user's input. The new configuration is then written to the `autodoc.config.json` file in the project root. -Here's an example of how the `init` function is used: +Finally, a success message is displayed, instructing the user to run `doc index` to get started with the Autodoc project. + +Example usage: ```javascript -import { init } from './autodoc'; +import { init } from './path/to/this/file'; + +// Initialize the configuration with default values +await init(); -(async () => { - await init(); -})(); +// Initialize the configuration with custom values +await init({ + name: 'My Custom Repository', + repositoryUrl: 'https://github.com/user/repo', +}); ``` -This code imports the `init` function and calls it, initializing the `autodoc` project with the user's inputs and default values. The `init` function is a crucial part of the project, as it sets up the necessary configuration for the project to work correctly. It interacts with other parts of the project by providing the required settings and values, ensuring that the project can adapt to different user requirements and preferences. +This code is essential for setting up the Autodoc project, as it creates the necessary configuration file and gathers user input to customize the project. It works in conjunction with other parts of the project, such as the CLI and the documentation generation process, which rely on the configuration file to function correctly. diff --git a/.autodoc/docs/markdown/src/cli/commands/query/createChatChain.md b/.autodoc/docs/markdown/src/cli/commands/query/createChatChain.md index b41b9b6..adea20d 100644 --- a/.autodoc/docs/markdown/src/cli/commands/query/createChatChain.md +++ b/.autodoc/docs/markdown/src/cli/commands/query/createChatChain.md @@ -1,44 +1,32 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/query/createChatChain.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\query\createChatChain.ts) -This code defines a function `makeChain` that creates a chatbot for answering questions about a software project. The chatbot is built using the `ChatVectorDBQAChain` class, which combines two separate language models: a question generator and a document chain. +This code defines a function `makeChain` that creates a chatbot for answering questions about a software project called `projectName`. The chatbot is trained on the content of the project, which is located at `repositoryUrl`. The content type of the project is specified by the `contentType` parameter. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter. -The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The `CONDENSE_PROMPT` template is used to format the input for the language model. +The `makeChain` function takes several parameters: -The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. The `makeQAPrompt` function generates this template, which instructs the language model to provide a conversational answer with hyperlinks to the project's GitHub repository. The answer should be tailored to the target audience and include code examples when appropriate. +- `projectName`: The name of the software project. +- `repositoryUrl`: The URL of the project's repository. +- `contentType`: The type of content the chatbot is trained on. +- `chatPrompt`: Additional instructions for answering questions about the content type. +- `targetAudience`: The intended audience for the chatbot's answers. +- `vectorstore`: An instance of HNSWLib for efficient nearest neighbor search. +- `llms`: An array of LLMModels, which are language models used for generating answers. +- `onTokenStream`: An optional callback function that is called when a new token is generated by the language model. -The `makeChain` function takes the following parameters: +The `makeChain` function first creates a question generator using the `LLMChain` class. This generator is responsible for rephrasing follow-up questions to be standalone questions. It uses the `CONDENSE_PROMPT` template, which is defined at the beginning of the code. -- `projectName`: The name of the software project. -- `repositoryUrl`: The URL of the project's GitHub repository. -- `contentType`: The type of content the chatbot is trained on (e.g., code, documentation). -- `chatPrompt`: Additional instructions for answering questions about the content. -- `targetAudience`: The intended audience for the chatbot's answers (e.g., developers, users). -- `vectorstore`: An instance of the `HNSWLib` class for storing and searching vectors. -- `llms`: An array of language models (e.g., GPT-3, GPT-4). -- `onTokenStream`: An optional callback function to handle streaming tokens. - -Example usage: - -```javascript -const chatbot = makeChain( - "autodoc", - "https://github.com/autodoc/autodoc", - "code", - "", - "developer", - vectorstore, - [gpt3, gpt4], - (token) => console.log(token) -); -``` - -This creates a chatbot that can answer questions about the "autodoc" project, using the provided language models and vector store. +Next, the function creates a `QA_PROMPT` template using the `makeQAPrompt` function. This template is used to generate answers to the questions in a conversational manner, with hyperlinks back to GitHub and code examples where appropriate. + +Finally, the function creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. The chatbot uses the `vectorstore` for efficient nearest neighbor search and the `llms` language models for generating answers. If the `onTokenStream` callback is provided, it will be called when a new token is generated by the language model. ## Questions: 1. **Question:** What is the purpose of the `makeChain` function and what are its input parameters? - **Answer:** The `makeChain` function is used to create a new `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` callback function. -2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in the code? - **Answer:** `CONDENSE_PROMPT` is a template for generating a standalone question from a given chat history and follow-up input. `QA_PROMPT` is a template for generating a conversational answer with hyperlinks back to GitHub, based on the given context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively. + **Answer:** The `makeChain` function is used to create a `ChatVectorDBQAChain` instance, which is responsible for generating questions and answers based on the given input parameters. The input parameters include `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and an optional `onTokenStream` function. + +2. **Question:** What are the roles of `CONDENSE_PROMPT` and `QA_PROMPT` in this code? + + **Answer:** `CONDENSE_PROMPT` is a template for generating standalone questions from a given chat history and follow-up question. `QA_PROMPT` is a template for generating conversational answers with hyperlinks to GitHub, based on the provided context and question. Both templates are used in the `LLMChain` and `loadQAChain` instances, respectively. + +3. **Question:** How does the `onTokenStream` function work and when is it used? -3. **Question:** How does the `onTokenStream` callback function work and when is it used? - **Answer:** The `onTokenStream` callback function is an optional parameter in the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process, allowing developers to handle or process the tokens in real-time. \ No newline at end of file + **Answer:** The `onTokenStream` function is an optional callback that can be provided to the `makeChain` function. It is used to handle the streaming of tokens generated by the OpenAIChat instance. If provided, it will be called with each new token generated during the chat process. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/query/index.md b/.autodoc/docs/markdown/src/cli/commands/query/index.md index 338396c..2288bca 100644 --- a/.autodoc/docs/markdown/src/cli/commands/query/index.md +++ b/.autodoc/docs/markdown/src/cli/commands/query/index.md @@ -1,33 +1,44 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/query/index.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\query\index.ts) -This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation. +This code defines a chatbot interface for the Autodoc project, which allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a combination of the `inquirer` library for user input, `marked` and `marked-terminal` for rendering Markdown output, and the `langchain` library for handling natural language processing tasks. -The code starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. It then defines a `chatHistory` array to store the conversation history between the user and the chatbot. +The `query` function is the main entry point for the chatbot. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store using the `HNSWLib` and `OpenAIEmbeddings` classes, and creates a chat chain using the `makeChain` function. -The `displayWelcomeMessage` function is used to display a welcome message to the user when they start the chatbot. The `clearScreenAndMoveCursorToTop` function clears the terminal screen and moves the cursor to the top. +The chatbot interface is displayed using the `displayWelcomeMessage` function, which prints a welcome message to the console. The `getQuestion` function is used to prompt the user for a question using the `inquirer` library. The chatbot then enters a loop, where it processes the user's question, generates a response using the chat chain, and displays the response as Markdown in the terminal. -The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input. +If an error occurs during the processing of a question, the chatbot will display an error message and continue to prompt the user for a new question. The loop continues until the user types 'exit', at which point the chatbot terminates. -The `getQuestion` function uses the `inquirer` library to prompt the user for a question. The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library. +Here's an example of how the `query` function might be used: -If an error occurs during the process, the chatbot displays an error message and prompts the user for another question. +```javascript +import { query } from './autodoc'; -Example usage: +const repoConfig = { + name: 'MyProject', + repositoryUrl: 'https://github.com/user/myproject', + output: 'path/to/output', + contentType: 'code', + chatPrompt: 'Ask me anything about MyProject', + targetAudience: 'developers', +}; + +const userConfig = { + llms: 'path/to/llms', +}; -```javascript query(repoConfig, userConfig); ``` -This chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers. +This example would initialize the chatbot with the specified repository and user configurations, and start the chatbot interface for the user to ask questions about the "MyProject" codebase. ## Questions: - 1. **What is the purpose of the `query` function and what are its input parameters?** + 1. **What is the purpose of the `query` function in this code?** - The `query` function is used to interact with the chatbot, taking user input and providing responses based on the given codebase. It takes two input parameters: an `AutodocRepoConfig` object containing information about the repository, and an `AutodocUserConfig` object containing user-specific configuration. + The `query` function is responsible for handling user interactions with the chatbot. It takes in an AutodocRepoConfig object and an AutodocUserConfig object, sets up the necessary data structures, and then enters a loop where it prompts the user for questions, processes them, and displays the results. -2. **How does the `vectorStore` work and what is its role in the code?** +2. **How does the code handle rendering Markdown text in the terminal?** - The `vectorStore` is an instance of HNSWLib loaded with data from the specified output directory and using OpenAIEmbeddings. It is used to store and retrieve vector representations of the codebase, which are then used by the `makeChain` function to generate responses to user questions. + The code uses the `marked` library along with a custom `TerminalRenderer` to render Markdown text in the terminal. The `marked` library is configured with the custom renderer using `marked.setOptions({ renderer: new TerminalRenderer() });`. -3. **How does the chat history work and what is its purpose?** +3. **What is the purpose of the `chatHistory` variable and how is it used?** - The `chatHistory` is an array of string pairs, where each pair represents a user question and the corresponding chatbot response. It is used to store the conversation history between the user and the chatbot, allowing the chatbot to provide context-aware responses based on previous interactions. \ No newline at end of file + The `chatHistory` variable is an array that stores the history of questions and answers in the chat session. It is used to keep track of the conversation between the user and the chatbot. When a new question is asked, the chat history is passed to the `chain.call()` function, and the new question and its corresponding answer are added to the `chatHistory` array. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/query/summary.md b/.autodoc/docs/markdown/src/cli/commands/query/summary.md index 709ec84..d184d42 100644 --- a/.autodoc/docs/markdown/src/cli/commands/query/summary.md +++ b/.autodoc/docs/markdown/src/cli/commands/query/summary.md @@ -1,32 +1,34 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/query) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\commands\query) -The `query` folder in the Autodoc project contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation. +The `query` folder in the Autodoc project contains code for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot is trained on the content of the project and provides answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. -In `createChatChain.ts`, the `makeChain` function is defined, which creates a chatbot using the `ChatVectorDBQAChain` class. This class combines two separate language models: a question generator and a document chain. The question generator is an instance of the `LLMChain` class, which uses the OpenAIChat API to generate standalone questions based on a given conversation history. The document chain is created using the `loadQAChain` function, which takes an instance of the OpenAIChat API and a prompt template as input. +The main entry point for the chatbot is the `query` function in `index.ts`. It takes two arguments: an `AutodocRepoConfig` object containing information about the code repository, and an `AutodocUserConfig` object containing user-specific settings. The function initializes a vector store and creates a chat chain using the `makeChain` function from `createChatChain.ts`. -Example usage of `makeChain`: +Here's an example of how the `query` function might be used: ```javascript -const chatbot = makeChain( - "autodoc", - "https://github.com/autodoc/autodoc", - "code", - "", - "developer", - vectorstore, - [gpt3, gpt4], - (token) => console.log(token) -); -``` - -In `index.ts`, the main chatbot interface is defined. It starts by importing necessary libraries and setting up the `marked` library with a custom terminal renderer for displaying Markdown content. The main function, `query`, takes two arguments: `AutodocRepoConfig` and `AutodocUserConfig`. It initializes the `vectorStore` by loading pre-trained embeddings and creates a `chain` object using the `makeChain` function. This chain object is responsible for generating responses based on the user's input. +import { query } from './autodoc'; -The main loop of the chatbot starts by getting the user's question and continues until the user types 'exit'. Inside the loop, the code updates the spinner text to 'Thinking...' and calls the `chain` object with the user's question and chat history. The response is then displayed in Markdown format using the `marked` library. +const repoConfig = { + name: 'MyProject', + repositoryUrl: 'https://github.com/user/myproject', + output: 'path/to/output', + contentType: 'code', + chatPrompt: 'Ask me anything about MyProject', + targetAudience: 'developers', +}; -Example usage of the chatbot interface: +const userConfig = { + llms: 'path/to/llms', +}; -```javascript query(repoConfig, userConfig); ``` -This chatbot interface can be used in the larger Autodoc project to help users navigate and understand the codebase more efficiently by providing a conversational interface for asking questions and receiving answers. +This example initializes the chatbot with the specified repository and user configurations and starts the chatbot interface for the user to ask questions about the "MyProject" codebase. + +The `createChatChain.ts` file defines the `makeChain` function, which creates a chatbot for answering questions about a software project. The chatbot is designed to provide conversational answers with hyperlinks back to GitHub, including code examples and links to the examples where appropriate. The target audience for the chatbot is specified by the `targetAudience` parameter. + +The `makeChain` function takes several parameters, such as `projectName`, `repositoryUrl`, `contentType`, `chatPrompt`, `targetAudience`, `vectorstore`, `llms`, and `onTokenStream`. It first creates a question generator using the `LLMChain` class, then creates a `QA_PROMPT` template using the `makeQAPrompt` function, and finally creates and returns a new instance of the `ChatVectorDBQAChain` class, which combines the question generator and the document chain to create a chatbot that can answer questions about the software project. + +In summary, the code in the `query` folder is responsible for creating a chatbot that can answer questions about a specific software project in a conversational manner. The chatbot uses a combination of natural language processing techniques and efficient nearest neighbor search to generate accurate and relevant answers for the user. diff --git a/.autodoc/docs/markdown/src/cli/commands/summary.md b/.autodoc/docs/markdown/src/cli/commands/summary.md index 429b49f..4aab10a 100644 --- a/.autodoc/docs/markdown/src/cli/commands/summary.md +++ b/.autodoc/docs/markdown/src/cli/commands/summary.md @@ -1,53 +1,101 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\commands) -The code in the `src/cli/commands` folder is responsible for handling various command-line tasks in the Autodoc project. It contains several subfolders, each dedicated to a specific command or functionality, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations. +The code in the `.autodoc\docs\json\src\cli\commands` folder is responsible for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project. The folder contains several subfolders, each with a specific purpose. -For instance, the `estimate` subfolder contains a function that allows users to estimate the cost of indexing a given repository before actually processing it. This function takes an `AutodocRepoConfig` object as input and performs a dry run of the `processRepository` function. It then calculates the total estimated cost and displays it to the user. This helps users make informed decisions about whether to proceed with the indexing process or not. +### estimate + +The `estimate` function provides an estimated cost of processing a given repository. It takes an `AutodocRepoConfig` object as input and performs a dry run of the repository processing to calculate the estimated cost. Example usage: ```javascript -import { estimate } from './autodoc/estimate'; +import { estimate } from './path/to/this/file'; const config = { - // ...configuration options... + name: 'my-repo', + repositoryUrl: 'https://github.com/user/my-repo.git', + root: './', + output: './output', + llms: ['en'], + ignore: ['.git', 'node_modules'], + filePrompt: true, + folderPrompt: true, + chatPrompt: true, + contentType: 'code', + targetAudience: 'developers', + linkHosted: true, }; estimate(config); ``` -The `index` subfolder contains code for processing a given code repository, generating documentation in JSON and Markdown formats, and creating vector files for the documentation. It provides several functions and utilities to achieve these tasks, such as traversing the file system, calling language models, and converting JSON files to Markdown. - -```javascript -import autodoc from './autodoc'; +### index -const config = { - // ...configuration options... -}; +The code in this folder processes a given repository and generates documentation in JSON, Markdown, and vector formats. It takes an `AutodocRepoConfig` object as input and performs three main tasks: processing the repository, creating Markdown files, and creating vector files. Example usage: -autodoc.index(config); +```javascript +index({ + name: "myProject", + root: "./input", + output: "./output", + filePrompt: true, + folderPrompt: true, + contentType: "code", + targetAudience: "developers", + linkHosted: "https://github.com/user/myProject", +}); ``` -The `init` subfolder is responsible for initializing and configuring the `autodoc` project. It provides an essential function called `init` that creates a configuration file named `autodoc.config.json` with user inputs and default values. +### init + +The `init` function initializes the configuration of the Autodoc project. It prompts the user to input necessary information to set up the project and creates the `autodoc.config.json` file in the project root. Example usage: ```javascript -import { init } from './autodoc'; +import { init } from './path/to/this/file'; -(async () => { - await init(); -})(); +// Initialize the configuration with default values +await init(); + +// Initialize the configuration with custom values +await init({ + name: 'My Custom Repository', + repositoryUrl: 'https://github.com/user/repo', +}); ``` -The `query` subfolder contains code for creating a chatbot interface that allows users to ask questions related to a specific codebase and receive answers in a conversational manner. The chatbot uses a language model to generate responses based on the user's input and the codebase documentation. +### query + +The `query` folder contains code for creating a chatbot that can answer questions about a specific software project. The main entry point is the `query` function, which takes an `AutodocRepoConfig` object and an `AutodocUserConfig` object as input. Example usage: ```javascript +import { query } from './autodoc'; + +const repoConfig = { + name: 'MyProject', + repositoryUrl: 'https://github.com/user/myproject', + output: 'path/to/output', + contentType: 'code', + chatPrompt: 'Ask me anything about MyProject', + targetAudience: 'developers', +}; + +const userConfig = { + llms: 'path/to/llms', +}; + query(repoConfig, userConfig); ``` -The `user` subfolder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs). +### user + +The `user` folder manages the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs). Example usage: + +```javascript +import { user } from './path/to/this/file'; + +// Create a new user configuration with default settings +await user(); -```typescript -async function user(): Promise { - // ... -} +// Update the user configuration with a custom config object +await user({ llms: [LLMModels.GPT3, LLMModels.GPT4] }); ``` -In summary, the code in the `src/cli/commands` folder plays a crucial role in the Autodoc project by providing various command-line functionalities, such as estimating costs, processing repositories, initializing the project, querying the chatbot, and managing user configurations. These functionalities help developers to easily generate and maintain documentation for their projects, making it more accessible and understandable for other developers and users. +In summary, the code in this folder is essential for various tasks related to the Autodoc project, such as initializing the configuration, processing repositories, generating documentation, and creating a chatbot for answering questions about a specific software project. diff --git a/.autodoc/docs/markdown/src/cli/commands/user/index.md b/.autodoc/docs/markdown/src/cli/commands/user/index.md index 6fcef0b..682866a 100644 --- a/.autodoc/docs/markdown/src/cli/commands/user/index.md +++ b/.autodoc/docs/markdown/src/cli/commands/user/index.md @@ -1,26 +1,37 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/commands/user/index.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\commands\user\index.ts) -This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K. +This code is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K. -The `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user. +The `makeConfigTemplate` function is used to create a default configuration object with the provided `config` parameter or with GPT-3 as the default LLM. This function is used to generate a new configuration object when needed. -The `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. +The main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. -If the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options: +If the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code. -1. GPT-3.5 Turbo -2. GPT-3.5 Turbo, GPT-4 8K (Early Access) -3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access) +Next, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function. -After the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format. +Finally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command. -Finally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command. +Example usage: + +```javascript +import { user } from './path/to/this/file'; + +// Create a new user configuration with default settings +await user(); + +// Update the user configuration with a custom config object +await user({ llms: [LLMModels.GPT3, LLMModels.GPT4] }); +``` ## Questions: - 1. **Question:** What is the purpose of the `makeConfigTemplate` function and what does it return? - **Answer:** The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter of type `AutodocUserConfig` and returns a new configuration object with the `llms` property set to the provided value or a default value of `[LLMModels.GPT3]`. + 1. **What is the purpose of the `makeConfigTemplate` function?** + + The `makeConfigTemplate` function is used to create a default configuration object for the Autodoc user. It takes an optional `config` parameter and returns an object with a `llms` property, which is an array of LLM models. + +2. **How does the `user` function handle existing user configuration files?** + + The `user` function checks if a user configuration file already exists using `fsSync.existsSync`. If it does, the user is prompted with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits with a status code of 0. -2. **Question:** How does the `user` function handle existing user configuration files? - **Answer:** The `user` function checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the function prompts the user with a confirmation message to overwrite the existing configuration. If the user chooses not to overwrite, the process exits; otherwise, the function proceeds to create a new configuration. +3. **What are the available choices for LLM models in the `user` function?** -3. **Question:** What are the available choices for the LLMs in the `user` function, and how are they used to create the new configuration? - **Answer:** The available choices for LLMs are GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the corresponding LLM models will be set as the value of the `llms` property in the new configuration object. \ No newline at end of file + The available choices for LLM models are GPT-3.5 Turbo, GPT-3.5 Turbo and GPT-4 8K (Early Access), and GPT-3.5 Turbo, GPT-4 8K (Early Access), and GPT-4 32K (Early Access). The user can select one of these options, and the selected value is stored in the `llms` property of the new configuration object. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/commands/user/summary.md b/.autodoc/docs/markdown/src/cli/commands/user/summary.md index 349d5a6..8e74a3c 100644 --- a/.autodoc/docs/markdown/src/cli/commands/user/summary.md +++ b/.autodoc/docs/markdown/src/cli/commands/user/summary.md @@ -1,40 +1,29 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/commands/user) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\commands\user) -The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It provides a way to create, update, and save the user configuration file, which stores information about the user's access to different Language Learning Models (LLMs) such as GPT-3.5 Turbo, GPT-4 8K, and GPT-4 32K. +The `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project. It allows users to create, update, and save their configuration file, which stores information about their access to different Language Learning Models (LLMs) such as GPT-3, GPT-4, and GPT-4 32K. -The `makeConfigTemplate` function is used to create a default configuration object with the specified LLMs or default to GPT-3.5 Turbo if none are provided. This function is used to generate the initial configuration object for the user. +The `makeConfigTemplate` function creates a default configuration object with either the provided `config` parameter or GPT-3 as the default LLM. This function is useful for generating a new configuration object when needed. -```typescript -function makeConfigTemplate(llms: string[]): ConfigTemplate { - // ... -} -``` +The main function, `user`, is an asynchronous function that takes an optional `config` parameter. It first checks if a user configuration file already exists at the `userConfigFilePath`. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. -The `user` function is an asynchronous function that handles the user configuration process. It first checks if a user configuration file already exists. If it does, the user is prompted to confirm whether they want to overwrite the existing configuration. If the user chooses not to overwrite, the process exits. +If the user configuration file does not exist, the code attempts to create the necessary directories for the file. If there's an error during this process, it logs the error and exits with a non-zero status code. -```typescript -async function user(): Promise { - // ... -} -``` +Next, the user is prompted to select which LLMs they have access to. The available options are GPT-3.5 Turbo, GPT-3.5 Turbo with GPT-4 8K (Early Access), and GPT-3.5 Turbo with GPT-4 8K and GPT-4 32K (Early Access). The user's selection is then used to create a new configuration object using the `makeConfigTemplate` function. -If the user decides to continue or if no configuration file exists, the function proceeds to create the necessary directories for the configuration file. It then prompts the user to select the LLMs they have access to using the `inquirer` library. The user can choose from three options: +Finally, the new configuration object is written to the user configuration file in JSON format. A success message is displayed to the user, indicating that the configuration has been saved and they can start querying using the `doc q` command. -1. GPT-3.5 Turbo -2. GPT-3.5 Turbo, GPT-4 8K (Early Access) -3. GPT-3.5 Turbo, GPT-4 8K (Early Access), GPT-4 32K (Early Access) +This code is essential for the Autodoc project as it allows users to manage their access to different LLMs and store their preferences in a configuration file. This configuration file can then be used by other parts of the project to determine which LLMs the user has access to and tailor the querying process accordingly. -After the user makes their selection, the new configuration object is created using the `makeConfigTemplate` function with the selected LLMs. The configuration object is then saved to the user configuration file in JSON format. +Example usage: -```typescript -const configTemplate = makeConfigTemplate(selectedLLMs); -await fs.promises.writeFile(configPath, JSON.stringify(configTemplate, null, 2)); -``` +```javascript +import { user } from './path/to/this/file'; -Finally, the user is informed that the configuration has been saved and they can start querying by running the `doc q` command. +// Create a new user configuration with default settings +await user(); -This code is essential for setting up the user's environment and preferences for the Autodoc project. It ensures that the user has the correct configuration file in place, which is necessary for the proper functioning of the project. The user configuration file is used by other parts of the project to determine which LLMs the user has access to and can query. - -For example, when a user runs the `doc q` command, the project will read the user configuration file to determine which LLMs are available for querying. This ensures that the user only queries the LLMs they have access to, preventing any unauthorized access or usage. +// Update the user configuration with a custom config object +await user({ llms: [LLMModels.GPT3, LLMModels.GPT4] }); +``` -In summary, the `index.ts` file in the `user` folder is responsible for managing the user configuration for the Autodoc project, ensuring that the user has the correct configuration file in place, and allowing the user to select the LLMs they have access to. This is essential for the proper functioning of the project and for maintaining the user's preferences and access to different LLMs. +In summary, the `index.ts` file in the `user` folder is a crucial part of the Autodoc project, allowing users to manage their LLM access and preferences. This configuration is then used by other parts of the project to provide a tailored experience based on the user's access to different LLMs. diff --git a/.autodoc/docs/markdown/src/cli/spinner.md b/.autodoc/docs/markdown/src/cli/spinner.md index 1d7643d..c8e8b15 100644 --- a/.autodoc/docs/markdown/src/cli/spinner.md +++ b/.autodoc/docs/markdown/src/cli/spinner.md @@ -1,39 +1,56 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/spinner.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\spinner.ts) -This code provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages. +This code is responsible for managing a spinner, which is a visual element that indicates a process is running in the background. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces. -The `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style. +The code starts by importing the `ora` library and creating a singleton spinner instance with the 'dots' style. This ensures that there will only be one spinner active at any given time. -The `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example: +There are several functions exported by this module to interact with the spinner: -```javascript -updateSpinnerText('Loading data...'); -``` +1. `updateSpinnerText(message: string)`: This function updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message. -The `stopSpinner` function stops the spinner if it is currently spinning: + Example usage: + ```javascript + updateSpinnerText('Loading data...'); + ``` -```javascript -stopSpinner(); -``` +2. `stopSpinner()`: This function stops the spinner if it is currently spinning. -The `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.): + Example usage: + ```javascript + stopSpinner(); + ``` -```javascript -spinnerError('An error occurred'); -spinnerSuccess('Operation completed successfully'); -spinnerInfo('Please wait...'); -``` +3. `spinnerError(message?: string)`: This function stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning. -In the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes. + Example usage: + ```javascript + spinnerError('Failed to load data'); + ``` + +4. `spinnerSuccess(message?: string)`: This function stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning. + + Example usage: + ```javascript + spinnerSuccess('Data loaded successfully'); + ``` + +5. `spinnerInfo(message: string)`: This function displays an informational message without affecting the spinner's state. + + Example usage: + ```javascript + spinnerInfo('Connecting to server...'); + ``` + +In the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages. ## Questions: 1. **What is the purpose of the `ora` package in this code?** - The `ora` package is used to create a spinner in the terminal, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style. + The `ora` package is used to create a spinner in the command line interface, providing a visual indication of a running process. In this code, it is used to create a singleton spinner with the 'dots' style. -2. **What are the different states of the spinner and how are they updated?** +2. **How does the `updateSpinnerText` function work?** - The spinner can have different states such as spinning, stopped, failed, succeeded, and displaying information. The functions `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` are used to update the spinner's state and text accordingly. + The `updateSpinnerText` function takes a message as an input and updates the spinner's text with the given message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message. -3. **How does the `updateSpinnerText` function work and when should it be used?** +3. **What are the differences between `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions?** - The `updateSpinnerText` function updates the spinner's text with the provided message. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the new message. This function should be used when you want to change the spinner's text while it is spinning or start it with a new message. \ No newline at end of file + These functions are used to update the spinner's state and message based on the outcome of a process. `spinnerError` is called when there is an error, and it stops the spinner with a failure message. `spinnerSuccess` is called when the process is successful, and it stops the spinner with a success message. `spinnerInfo` is used to display an informational message without stopping the spinner. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/summary.md b/.autodoc/docs/markdown/src/cli/summary.md index d36d7bf..29a248f 100644 --- a/.autodoc/docs/markdown/src/cli/summary.md +++ b/.autodoc/docs/markdown/src/cli/summary.md @@ -1,27 +1,42 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli) -The `spinner.ts` file in the `.autodoc/docs/json/src/cli` folder provides a utility for managing a command-line spinner using the `ora` library. The spinner is a visual indicator that displays a series of characters in a loop, giving the user feedback that a process is running in the background. The code exports several functions to control the spinner's behavior, such as updating the text, stopping the spinner, and displaying success, error, or informational messages. +The code in the `spinner.ts` file, located in the `.autodoc\docs\json\src\cli` folder, is responsible for managing a spinner, a visual element that indicates a background process is running. The spinner is created using the `ora` library, which provides a simple and customizable way to create spinners for command-line interfaces. -The `spinner` object is created as a singleton to ensure that there is only one instance of the spinner at any given time. This prevents multiple spinners from being displayed simultaneously, which could cause confusion for the user. The spinner is configured to use the 'dots' style. +The module exports several functions to interact with the spinner: -The `updateSpinnerText` function is used to update the spinner's text. If the spinner is already spinning, it updates the text directly; otherwise, it starts the spinner with the given message. For example: +1. `updateSpinnerText(message: string)`: Updates the spinner's text with the provided message. If the spinner is already spinning, it simply updates the text; otherwise, it starts the spinner with the new message. -```javascript -updateSpinnerText('Loading data...'); -``` + Example usage: + ```javascript + updateSpinnerText('Loading data...'); + ``` -The `stopSpinner` function stops the spinner if it is currently spinning: +2. `stopSpinner()`: Stops the spinner if it is currently spinning. -```javascript -stopSpinner(); -``` + Example usage: + ```javascript + stopSpinner(); + ``` -The `spinnerError`, `spinnerSuccess`, and `spinnerInfo` functions are used to display error, success, and informational messages, respectively. These functions first check if the spinner is spinning and then call the appropriate `ora` method to display the message with the corresponding status symbol (e.g., a red cross for errors, a green checkmark for success, etc.): +3. `spinnerError(message?: string)`: Stops the spinner and marks it as failed with an optional error message. It only takes effect if the spinner is currently spinning. -```javascript -spinnerError('An error occurred'); -spinnerSuccess('Operation completed successfully'); -spinnerInfo('Please wait...'); -``` + Example usage: + ```javascript + spinnerError('Failed to load data'); + ``` -In the larger project, this utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes. +4. `spinnerSuccess(message?: string)`: Stops the spinner and marks it as successful with an optional success message. It only takes effect if the spinner is currently spinning. + + Example usage: + ```javascript + spinnerSuccess('Data loaded successfully'); + ``` + +5. `spinnerInfo(message: string)`: Displays an informational message without affecting the spinner's state. + + Example usage: + ```javascript + spinnerInfo('Connecting to server...'); + ``` + +In the larger project, this module can be used to provide visual feedback to users when a background process is running, such as loading data, connecting to a server, or performing a complex calculation. By using the exported functions, developers can easily update the spinner's text, stop it, or change its state to indicate success, failure, or display informational messages. diff --git a/.autodoc/docs/markdown/src/cli/utils/APIRateLimit.md b/.autodoc/docs/markdown/src/cli/utils/APIRateLimit.md index 79d41f4..628688c 100644 --- a/.autodoc/docs/markdown/src/cli/utils/APIRateLimit.md +++ b/.autodoc/docs/markdown/src/cli/utils/APIRateLimit.md @@ -1,34 +1,28 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/APIRateLimit.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\utils\APIRateLimit.ts) -The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server. +The `APIRateLimit` class in this code snippet is designed to manage and limit the number of concurrent API calls made by the application. This is useful in situations where the API being called has a rate limit or when the application needs to prevent overwhelming the server with too many requests at once. -The class has a constructor that takes an optional `maxConcurrentCalls` parameter, which defaults to 50. This parameter determines the maximum number of API calls that can be made concurrently. +The class constructor takes an optional parameter `maxConcurrentCalls`, which defaults to 50, to set the maximum number of concurrent API calls allowed. It maintains a queue of API calls and keeps track of the number of calls in progress. -The main method of this class is `callApi(apiFunction: () => Promise): Promise`. This method takes a function `apiFunction` that returns a promise and wraps it in a rate-limited execution. The method returns a promise that resolves with the result of the API call or rejects with an error if the call fails. +The main method of this class is `callApi(apiFunction: () => Promise): Promise`. It takes a function `apiFunction` that returns a promise and wraps it in a new promise. The purpose of this wrapping is to control the execution of the API calls and ensure that they do not exceed the specified rate limit. -When `callApi` is called, it adds the `executeCall` function to the `queue`. The `executeCall` function is responsible for executing the API call, resolving or rejecting the promise, and managing the `inProgress` counter. After adding the `executeCall` function to the queue, the code checks if there are available slots for concurrent calls by comparing `inProgress` with `maxConcurrentCalls`. If there are available slots, it calls the `dequeueAndExecute` method. +When `callApi` is called, the provided `apiFunction` is added to the queue and the `dequeueAndExecute` method is triggered if there are available slots for concurrent calls. The `dequeueAndExecute` method checks if there are any API calls in the queue and if the number of in-progress calls is below the maximum limit. If both conditions are met, it dequeues the next API call and executes it. -The `dequeueAndExecute` method is responsible for executing the queued API calls while ensuring that the number of concurrent calls does not exceed the `maxConcurrentCalls` limit. It dequeues the next API call from the queue and executes it if there are available slots for concurrent calls. +The `executeCall` function inside `callApi` is responsible for actually calling the API function, resolving or rejecting the promise based on the result, and updating the number of in-progress calls. Once an API call is completed, the `dequeueAndExecute` method is called again to process any remaining calls in the queue. Here's an example of how this class can be used in the larger project: ```javascript const apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls -async function fetchData(id) { - // Simulate an API call - return new Promise((resolve) => setTimeout(() => resolve(`Data for ${id}`), 1000)); +async function fetchSomeData(id) { + // Call the API using the rate limiter + const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`)); + return result; } - -async function getData(id) { - return apiRateLimiter.callApi(() => fetchData(id)); -} - -// Usage -getData(1).then(console.log); // Fetches data for ID 1, rate-limited ``` -In this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetchData` function, which simulates an API call. +In this example, the `APIRateLimit` class is used to limit the number of concurrent calls to the `fetch` function, ensuring that no more than 10 calls are made at once. ## Questions: 1. **What is the purpose of the `APIRateLimit` class?** @@ -36,8 +30,8 @@ In this example, the `APIRateLimit` class is used to limit the number of concurr 2. **How does the `callApi` method work and what is its return type?** - The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and manages the execution of queued calls based on the available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`. + The `callApi` method takes an `apiFunction` as an argument, which is a function that returns a Promise. It adds the API call to a queue and executes it when there are available slots for concurrent calls. The method returns a Promise of type `T`, where `T` is the expected return type of the `apiFunction`. -3. **How does the `dequeueAndExecute` method work?** +3. **How can the maximum number of concurrent calls be configured?** - The `dequeueAndExecute` method is responsible for executing the queued API calls. It checks if there are any calls in the queue and if there are available slots for concurrent calls. If both conditions are met, it dequeues the next call from the queue and executes it. This method is called whenever a new API call is added to the queue or when an in-progress call is completed. \ No newline at end of file + The maximum number of concurrent calls can be configured by passing a value to the `maxConcurrentCalls` parameter in the constructor of the `APIRateLimit` class. If no value is provided, the default value is set to 50. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/utils/FileUtil.md b/.autodoc/docs/markdown/src/cli/utils/FileUtil.md index 2f213f0..c2b665c 100644 --- a/.autodoc/docs/markdown/src/cli/utils/FileUtil.md +++ b/.autodoc/docs/markdown/src/cli/utils/FileUtil.md @@ -1,38 +1,44 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/FileUtil.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\utils\FileUtil.ts) -This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for files and folders. +This code provides utility functions for handling file and folder paths in the autodoc project. The main purpose of these functions is to generate file names and GitHub URLs for documentation files. -1. `getFileName(input: string, delimiter = '.', extension = '.md'): string`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new file name with the given extension. If the delimiter is not found in the input string, the function appends the extension to the input string. If the delimiter is found, the function replaces the part after the last delimiter with the extension. For example: +1. `getFileName(input, delimiter, extension)`: This function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns a new string with the given extension. If the delimiter is found in the input string, the function removes the part of the string after the last occurrence of the delimiter and appends the extension. If the delimiter is not found, the function simply appends the extension to the input string. This function can be used to generate file names for documentation files with the desired extension. - ```javascript - getFileName("example.txt"); // returns "example.md" - getFileName("example"); // returns "example.md" + Example usage: + + ``` + getFileName('example.txt'); // returns 'example.md' + getFileName('example', '_', '.html'); // returns 'example.html' ``` -2. `githubFileUrl(githubRoot: string, inputRoot: string, filePath: string, linkHosted: boolean): string`: This function generates a GitHub URL for a file. It takes the GitHub root URL, the input root path, the file path, and a boolean flag `linkHosted`. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the file. If `linkHosted` is false, the function returns a URL pointing to the file in the GitHub repository. For example: +2. `githubFileUrl(githubRoot, inputRoot, filePath, linkHosted)`: This function generates a GitHub URL for a file. It takes the GitHub repository root URL, the input root folder path, the file path, and a boolean flag indicating whether the URL should be for the hosted version of the file or the source code. It returns a string with the generated URL. + + Example usage: - ```javascript - githubFileUrl("https://github.com/user/repo", "/input", "/input/example.md", true); // returns "https://github.com/user/repo/example.md" - githubFileUrl("https://github.com/user/repo", "/input", "/input/example.md", false); // returns "https://github.com/user/repo/blob/master/example.md" ``` + githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true); + // returns 'https://github.com/user/repo/example.md' + ``` + +3. `githubFolderUrl(githubRoot, inputRoot, folderPath, linkHosted)`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. It takes the same arguments as `githubFileUrl` and returns a string with the generated URL. -3. `githubFolderUrl(githubRoot: string, inputRoot: string, folderPath: string, linkHosted: boolean): string`: This function is similar to `githubFileUrl`, but it generates a GitHub URL for a folder instead of a file. If `linkHosted` is true, the function returns a URL pointing to the hosted version of the folder. If `linkHosted` is false, the function returns a URL pointing to the folder in the GitHub repository. For example: + Example usage: - ```javascript - githubFolderUrl("https://github.com/user/repo", "/input", "/input/folder", true); // returns "https://github.com/user/repo/folder" - githubFolderUrl("https://github.com/user/repo", "/input", "/input/folder", false); // returns "https://github.com/user/repo/tree/master/folder" + ``` + githubFolderUrl('https://github.com/user/repo', '/input', '/input/folder', true); + // returns 'https://github.com/user/repo/folder' ``` -These utility functions can be used in the autodoc project to generate file names and URLs for documentation files and folders, making it easier to manage and navigate the documentation structure. +These utility functions can be used throughout the autodoc project to generate file names and GitHub URLs for documentation files and folders, ensuring consistent naming and URL generation across the project. ## Questions: - 1. **What does the `getFileName` function do?** + 1. **What is the purpose of the `getFileName` function?** - The `getFileName` function takes an input string, an optional delimiter (default is '.'), and an optional extension (default is '.md'). It returns the input string with the specified extension, replacing the part after the last occurrence of the delimiter if it exists. + The `getFileName` function takes an input string, an optional delimiter, and an optional extension, and returns a new string with the given extension. If the delimiter is not found in the input string, the extension is simply appended to the input string. If the delimiter is found, the input string is sliced up to the last delimiter index and the extension is appended. -2. **What is the purpose of the `githubFileUrl` and `githubFolderUrl` functions?** +2. **What are the differences between the `githubFileUrl` and `githubFolderUrl` functions?** - Both `githubFileUrl` and `githubFolderUrl` functions are used to generate URLs for files and folders, respectively, in a GitHub repository. They take a `githubRoot`, `inputRoot`, a `filePath` or `folderPath`, and a `linkHosted` boolean flag. If `linkHosted` is true, the generated URL will point to the hosted version of the file or folder; otherwise, it will point to the file or folder in the GitHub repository. + Both functions take the same parameters: `githubRoot`, `inputRoot`, a path (either `filePath` or `folderPath`), and a `linkHosted` boolean. The main difference is in the returned URL: `githubFileUrl` returns a URL pointing to a file in the GitHub repository, while `githubFolderUrl` returns a URL pointing to a folder in the GitHub repository. The URL structure differs slightly, with `/blob/master/` for files and `/tree/master/` for folders. -3. **Why is the `inputRoot.length - 1` used in the `substring` method for both `githubFileUrl` and `githubFolderUrl` functions?** +3. **What is the purpose of the `linkHosted` parameter in the `githubFileUrl` and `githubFolderUrl` functions?** - The `inputRoot.length - 1` is used to remove the `inputRoot` part from the `filePath` or `folderPath` when generating the final URL. This ensures that the generated URL only contains the relevant path relative to the GitHub repository root. \ No newline at end of file + The `linkHosted` parameter is a boolean that determines whether the returned URL should point to the hosted version of the file or folder on GitHub Pages (if `true`) or to the file or folder within the GitHub repository itself (if `false`). Depending on the value of `linkHosted`, the functions will return different URL structures. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/utils/LLMUtil.md b/.autodoc/docs/markdown/src/cli/utils/LLMUtil.md index b307c9d..aa0ca5c 100644 --- a/.autodoc/docs/markdown/src/cli/utils/LLMUtil.md +++ b/.autodoc/docs/markdown/src/cli/utils/LLMUtil.md @@ -1,35 +1,41 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/LLMUtil.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\utils\LLMUtil.ts) -This code defines and manages different language models (LLMs) and their associated costs for a project. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file. +This code defines and manages different language models (LLMs) and their associated costs for a project that utilizes OpenAI's GPT models. It imports the `OpenAIChat` class from the `langchain/llms` module and the `LLMModelDetails` and `LLMModels` types from the `../../types.js` file. -The `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has a set of properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of `OpenAIChat` with specific configurations. The `inputTokens`, `outputTokens`, `succeeded`, `failed`, and `total` properties are initialized to 0. +The `models` object contains three LLMs: GPT3, GPT4, and GPT432k. Each model has its own properties, such as `name`, `inputCostPer1KTokens`, `outputCostPer1KTokens`, `maxLength`, and an instance of the `OpenAIChat` class with the respective model name and API key. Additionally, each model has counters for input tokens, output tokens, succeeded, failed, and total files processed. + +The `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models. + +The `totalIndexCostEstimate` function calculates the total cost of indexing all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number. + +These functions can be used in the larger project to manage and analyze the usage and costs of different LLMs. For example, the `printModelDetails` function can be called to display a summary of the models' usage and costs: ```javascript -{ - name: LLMModels.GPT3, - inputCostPer1KTokens: 0.002, - outputCostPer1KTokens: 0.002, - maxLength: 3050, - llm: new OpenAIChat({ ... }), - inputTokens: 0, - outputTokens: 0, - succeeded: 0, - failed: 0, - total: 0, -} +import { models, printModelDetails } from './path/to/this/file'; + +// Process files with models... +// Update models' properties... + +printModelDetails(Object.values(models)); ``` -The `printModelDetails` function takes an array of `LLMModelDetails` and prints a summary table to the console. It calculates the total cost for each model based on the number of input and output tokens and their respective costs per 1,000 tokens. It also calculates the total file count, succeeded, failed, tokens, and cost across all models. +And the `totalIndexCostEstimate` function can be used to estimate the total cost of indexing all models: + +```javascript +import { models, totalIndexCostEstimate } from './path/to/this/file'; -The `totalIndexCostEstimate` function calculates the total cost for all models in the input array. It uses the same cost calculation as in `printModelDetails` but returns the total cost as a number. +// Process files with models... +// Update models' properties... -These functions can be used in the larger project to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models. +const totalCost = totalIndexCostEstimate(Object.values(models)); +console.log(`Total cost: ${totalCost}`); +``` ## Questions: - 1. **Question**: What is the purpose of the `models` object and what are the different models available? - **Answer**: The `models` object is a record that maps the available LLMModels (GPT3, GPT4, and GPT432k) to their respective details, such as name, input and output costs, maxLength, and an instance of OpenAIChat with the corresponding model. + 1. **Question:** What is the purpose of the `models` object and how are the different GPT models being used? + **Answer:** The `models` object is a record that maps different GPT models (GPT3, GPT4, and GPT432k) to their respective details, such as cost per tokens, maximum length, and an instance of `OpenAIChat` with the corresponding model configuration. -2. **Question**: How does the `printModelDetails` function work and what information does it display? - **Answer**: The `printModelDetails` function takes an array of LLMModelDetails and generates an output object containing the model name, file count, succeeded, failed, tokens, and cost. It then calculates the totals for each property and displays the information in a console table. +2. **Question:** How does the `printModelDetails` function work and what information does it display? + **Answer:** The `printModelDetails` function takes an array of `LLMModelDetails` as input, processes the information for each model, and then prints a summary table to the console. The table includes the model name, file count, succeeded and failed counts, total tokens, and cost. -3. **Question**: What is the purpose of the `totalIndexCostEstimate` function and how does it calculate the total cost? - **Answer**: The `totalIndexCostEstimate` function calculates the total cost of indexing the given models by iterating through the models array and summing up the input and output costs per 1K tokens for each model. \ No newline at end of file +3. **Question:** What is the purpose of the `totalIndexCostEstimate` function and how is it calculating the total cost? + **Answer:** The `totalIndexCostEstimate` function calculates the total cost of processing the given models by iterating through the input `models` array and summing up the costs based on the input and output tokens and their respective costs per 1K tokens. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/utils/WaitUtil.md b/.autodoc/docs/markdown/src/cli/utils/WaitUtil.md index aa64ac5..001b655 100644 --- a/.autodoc/docs/markdown/src/cli/utils/WaitUtil.md +++ b/.autodoc/docs/markdown/src/cli/utils/WaitUtil.md @@ -1,44 +1,57 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/WaitUtil.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\utils\WaitUtil.ts) -The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, which is a JavaScript object that represents the eventual completion (or failure) of an asynchronous operation and its resulting value. +The code in this file provides two utility functions, `wait` and `forTrue`, which are designed to help manage asynchronous operations in the larger project. Both functions return a `Promise`, making them suitable for use with `async/await` syntax. -### wait function +### wait -The `wait` function takes two arguments: `timeoutMs`, which is the number of milliseconds to wait before resolving the promise, and `value`, which is an optional value to be returned when the promise resolves. The function creates a new `Promise` and uses `setTimeout` to resolve it with the given `value` after the specified `timeoutMs` has passed. +The `wait` function takes two arguments: `timeoutMs`, a number representing the desired waiting time in milliseconds, and an optional `value` that defaults to `null`. It returns a `Promise` that resolves with the provided `value` after the specified `timeoutMs` has elapsed. This function can be used to introduce a delay in the execution of asynchronous code. Example usage: ```javascript -// Wait for 2 seconds and then log "Hello, world!" -wait(2000, "Hello, world!").then(console.log); +async function delayedEcho() { + console.log("Start"); + await wait(1000, "Hello"); + console.log("End"); +} + +delayedEcho(); // Output: Start -> (1 second delay) -> End ``` -### forTrue function +### forTrue -The `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. The purpose of this function is to repeatedly check if the given function `fn` returns `true`. If it does, the promise resolves with `true`. If the function does not return `true` after 200 checks, the promise is rejected. +The `forTrue` function takes a single argument, `fn`, which is a function that returns a boolean value. It returns a `Promise` that resolves with `true` when the provided function `fn` returns `true`. The function `fn` is checked every 50 milliseconds, up to a maximum of 200 times (i.e., 10 seconds). If `fn` does not return `true` within this time, the `Promise` is rejected. -The function uses `setInterval` to repeatedly call the given function `fn` every 50 milliseconds. If `fn` returns `true`, the interval is cleared, and the promise is resolved. If the function has been called 200 times without returning `true`, the promise is rejected. +This function can be used to wait for a specific condition to be met before continuing the execution of asynchronous code. Example usage: ```javascript -// Check if a certain element is visible on the page -const isElementVisible = () => document.querySelector("#my-element").offsetParent !== null; +let condition = false; + +setTimeout(() => { + condition = true; +}, 3000); + +async function waitForCondition() { + console.log("Waiting for condition..."); + await forTrue(() => condition); + console.log("Condition met!"); +} -// Wait for the element to become visible, then log "Element is visible!" -forTrue(isElementVisible).then(() => console.log("Element is visible!")); +waitForCondition(); // Output: Waiting for condition... -> (3 second delay) -> Condition met! ``` -In summary, these utility functions help manage asynchronous operations by providing a way to wait for a certain amount of time or for a specific condition to be met. They can be used in various parts of the larger project to handle timing and conditional logic in an asynchronous manner. +In summary, this file provides two utility functions that help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used in the larger project to control the flow of asynchronous code execution. ## Questions: 1. **What is the purpose of the `wait` function?** - The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds. It can be used to introduce a delay in the execution of asynchronous code. + The `wait` function is an asynchronous utility function that resolves a promise after a specified timeout in milliseconds, optionally returning a value when the promise is resolved. -2. **How does the `forTrue` function work and what is its use case?** +2. **How does the `forTrue` function work?** - The `forTrue` function takes a function `fn` as an argument, which returns a boolean value. It repeatedly checks the result of `fn` every 50 milliseconds until it returns `true` or the maximum number of checks (200) is reached. This function can be used to wait for a specific condition to be met before proceeding with the execution of asynchronous code. + The `forTrue` function takes a function `fn` as an argument, which should return a boolean value. It checks the result of `fn` every 50 milliseconds and resolves the promise when `fn` returns `true`. If `fn` does not return `true` after 200 attempts, the promise is rejected. -3. **Is there any error handling or customization for the `forTrue` function, such as customizing the interval or maximum number of checks?** +3. **What is the use case for the `forTrue` function?** - Currently, there is no error handling or customization options for the `forTrue` function. The interval is hardcoded to 50 milliseconds, and the maximum number of checks is hardcoded to 200. To add customization, additional parameters could be added to the function signature and used in the implementation. \ No newline at end of file + The `forTrue` function can be used to wait for a certain condition to be met before proceeding with the execution of the code. This can be useful in situations where you need to wait for an asynchronous operation to complete or a specific state to be reached before continuing. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/cli/utils/summary.md b/.autodoc/docs/markdown/src/cli/utils/summary.md index 3815d86..8f3926f 100644 --- a/.autodoc/docs/markdown/src/cli/utils/summary.md +++ b/.autodoc/docs/markdown/src/cli/utils/summary.md @@ -1,48 +1,59 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/cli/utils) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\cli\utils) -The code in the `.autodoc/docs/json/src/cli/utils` folder provides utility functions and classes that help manage various aspects of the autodoc project, such as rate-limiting API calls, handling file and folder paths, managing language models, and traversing file systems. +The `.autodoc\docs\json\src\cli\utils` folder contains utility functions and classes that assist in managing API rate limits, handling file and folder paths, managing language models, traversing file systems, and controlling asynchronous operations. These utilities can be used throughout the autodoc project to ensure consistent behavior and improve code organization. -`APIRateLimit.ts` contains the `APIRateLimit` class, which is designed to manage and limit the number of concurrent API calls made by the application. This is useful when the API being called has a rate limit or when the application needs to control the number of simultaneous requests to avoid overloading the server. For example: +`APIRateLimit.ts` provides the `APIRateLimit` class, which manages and limits the number of concurrent API calls made by the application. This is useful when working with rate-limited APIs or preventing server overload. Example usage: ```javascript const apiRateLimiter = new APIRateLimit(10); // Limit to 10 concurrent calls -async function getData(id) { - return apiRateLimiter.callApi(() => fetchData(id)); +async function fetchSomeData(id) { + const result = await apiRateLimiter.callApi(() => fetch(`https://api.example.com/data/${id}`)); + return result; } -getData(1).then(console.log); // Fetches data for ID 1, rate-limited ``` -`FileUtil.ts` provides utility functions for handling file and folder paths, such as generating file names and GitHub URLs for files and folders. These functions can be used to manage and navigate the documentation structure. For example: +`FileUtil.ts` offers utility functions for generating file names and GitHub URLs for documentation files. These functions ensure consistent naming and URL generation across the project. Example usage: ```javascript -getFileName("example.txt"); // returns "example.md" -githubFileUrl("https://github.com/user/repo", "/input", "/input/example.md", true); // returns "https://github.com/user/repo/example.md" +getFileName('example.txt'); // returns 'example.md' +githubFileUrl('https://github.com/user/repo', '/input', '/input/example.md', true); // returns 'https://github.com/user/repo/example.md' ``` -`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project. It provides functions like `printModelDetails` and `totalIndexCostEstimate` to manage and analyze the usage and costs of different language models. For example, the `printModelDetails` function can provide a summary of the project's LLM usage, while the `totalIndexCostEstimate` function can help estimate the overall cost of using these models. - -`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations in the larger project. They can be used in various parts of the project to handle timing and conditional logic in an asynchronous manner. For example: +`LLMUtil.ts` defines and manages different language models (LLMs) and their associated costs for a project utilizing OpenAI's GPT models. Functions like `printModelDetails` and `totalIndexCostEstimate` can be used to manage and analyze the usage and costs of different LLMs. Example usage: ```javascript -wait(2000, "Hello, world!").then(console.log); // Waits for 2 seconds and then logs "Hello, world!" -forTrue(isElementVisible).then(() => console.log("Element is visible!")); // Waits for an element to become visible, then logs "Element is visible!" +import { models, printModelDetails } from './path/to/this/file'; +printModelDetails(Object.values(models)); +const totalCost = totalIndexCostEstimate(Object.values(models)); +console.log(`Total cost: ${totalCost}`); ``` -`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used for processing and generating documentation for a given project. For example: +`traverseFileSystem.ts` contains the `traverseFileSystem` function, which recursively traverses a given file system, processing files and folders based on provided parameters. This is useful for generating documentation or performing tasks that require processing files and folders in a directory structure. Example usage: ```javascript -const params = { - inputPath: './myProject', - projectName: 'My Project', +await traverseFileSystem({ + inputPath: './src', + projectName: 'myProject', + processFile: (params) => { /* Process file logic */ }, + processFolder: (params) => { /* Process folder logic */ }, ignore: ['node_modules/**', '.git/**'], - processFile: async (fileInfo) => { - // Process the file, e.g., generate documentation - }, - processFolder: async (folderInfo) => { - // Process the folder, e.g., create a folder in the output directory - }, -}; -traverseFileSystem(params); +}); +``` + +`WaitUtil.ts` provides two utility functions, `wait` and `forTrue`, which help manage asynchronous operations by introducing delays and waiting for specific conditions to be met. These functions can be used to control the flow of asynchronous code execution. Example usage: + +```javascript +async function delayedEcho() { + console.log("Start"); + await wait(1000, "Hello"); + console.log("End"); +} + +async function waitForCondition() { + console.log("Waiting for condition..."); + await forTrue(() => condition); + console.log("Condition met!"); +} ``` -In summary, the code in this folder provides various utility functions and classes that help manage different aspects of the autodoc project, making it easier to handle tasks such as rate-limiting, file and folder management, language model management, asynchronous operations, and file system traversal. +In summary, the utilities in this folder enhance the autodoc project by providing consistent behavior, improving code organization, and managing various aspects of the project, such as API rate limits, file and folder paths, language models, file system traversal, and asynchronous operations. diff --git a/.autodoc/docs/markdown/src/cli/utils/traverseFileSystem.md b/.autodoc/docs/markdown/src/cli/utils/traverseFileSystem.md index 573fcf0..e8c62aa 100644 --- a/.autodoc/docs/markdown/src/cli/utils/traverseFileSystem.md +++ b/.autodoc/docs/markdown/src/cli/utils/traverseFileSystem.md @@ -1,54 +1,49 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/cli/utils/traverseFileSystem.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\cli\utils\traverseFileSystem.ts) -The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processes folders and files, and filters out ignored files based on provided patterns. It is designed to be used in the larger project for processing and generating documentation for a given project. +The `traverseFileSystem` function in this code is an asynchronous function that recursively traverses a given file system, processing files and folders based on the provided parameters. It is designed to be used in the larger project for generating documentation or performing other tasks that require processing files and folders in a directory structure. -The function takes an object of type `TraverseFileSystemParams` as its input, which contains the following properties: +The function takes an object of type `TraverseFileSystemParams` as its input, which contains various properties to control the traversal and processing behavior. These properties include: -- `inputPath`: The root folder path to start traversing. -- `projectName`: The name of the project being documented. -- `processFile`: An optional callback function to process files. -- `processFolder`: An optional callback function to process folders. -- `ignore`: An array of patterns to ignore files and folders. -- `filePrompt`: An optional prompt for processing files. -- `folderPrompt`: An optional prompt for processing folders. -- `contentType`: The type of content being processed. -- `targetAudience`: The target audience for the documentation. -- `linkHosted`: A flag indicating if the documentation should be linked to a hosted version. +- `inputPath`: The root path to start the traversal from. +- `projectName`: The name of the project being processed. +- `processFile`: An optional callback function to process a file. +- `processFolder`: An optional callback function to process a folder. +- `ignore`: An array of patterns to ignore during traversal. +- `filePrompt`, `folderPrompt`: Optional prompts for user interaction. +- `contentType`, `targetAudience`, `linkHosted`: Additional metadata for processing. -The function first checks if the provided `inputPath` exists. If not, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns. +The function first checks if the provided `inputPath` exists using `fs.access`. If the path does not exist, it logs an error message and returns. It then defines a helper function `shouldIgnore` that checks if a given file or folder should be ignored based on the `ignore` patterns. -The main logic of the function is implemented in the `dfs` (depth-first search) function, which recursively traverses the file system. It reads the contents of the current folder, filters out ignored files and folders, and processes them accordingly. If an entry is a directory, it calls `dfs` recursively and then calls the `processFolder` callback if provided. If an entry is a file and is a text file, it calls the `processFile` callback if provided. +The main logic of the function is implemented in the `dfs` (depth-first search) function, which is called recursively to traverse the file system. It reads the contents of the current directory using `fs.readdir`, filters out ignored items, and processes the remaining items. + +For each item, if it is a directory, the `dfs` function is called recursively, and the `processFolder` callback is invoked if provided. If it is a file and its content is text (checked using `isText`), the `processFile` callback is invoked if provided. + +The traversal is performed using `Promise.all` to process items concurrently, improving performance. If an error occurs during traversal, it is logged and rethrown. Here's an example of how this function might be used in the larger project: ```javascript -import { traverseFileSystem } from './autodoc'; - -const params = { - inputPath: './myProject', - projectName: 'My Project', - ignore: ['node_modules/**', '.git/**'], - processFile: async (fileInfo) => { - // Process the file, e.g., generate documentation +await traverseFileSystem({ + inputPath: './src', + projectName: 'myProject', + processFile: (params) => { + // Process file logic here }, - processFolder: async (folderInfo) => { - // Process the folder, e.g., create a folder in the output directory + processFolder: (params) => { + // Process folder logic here }, -}; - -traverseFileSystem(params); + ignore: ['node_modules/**', '.git/**'], +}); ``` - -This example would traverse the `myProject` folder, ignoring any files and folders within `node_modules` and `.git`, and process the remaining files and folders using the provided callback functions. ## Questions: 1. **What is the purpose of the `traverseFileSystem` function?** - The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes files and folders based on the provided parameters, and ignores files and folders that match the specified ignore patterns. + The `traverseFileSystem` function is an asynchronous function that traverses a given file system, processes folders and files based on the provided parameters, and ignores files and folders based on the given ignore patterns. 2. **How does the `shouldIgnore` function work?** - The `shouldIgnore` function takes a file or folder name as input and returns a boolean value indicating whether the file or folder should be ignored based on the provided ignore patterns. It uses the `minimatch` library to check if the file or folder name matches any of the ignore patterns. + The `shouldIgnore` function takes a file name as input and returns a boolean value indicating whether the file should be ignored or not. It checks if the file name matches any of the ignore patterns provided in the `ignore` parameter using the `minimatch` library. 3. **What is the role of the `dfs` function inside `traverseFileSystem`?** - The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory. \ No newline at end of file + The `dfs` function is an asynchronous function that performs a depth-first search on the file system starting from the given `currentPath`. It processes folders and files based on the provided parameters and recursively calls itself for each subdirectory found. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/const.md b/.autodoc/docs/markdown/src/const.md index fceb891..31c2de3 100644 --- a/.autodoc/docs/markdown/src/const.md +++ b/.autodoc/docs/markdown/src/const.md @@ -1,27 +1,35 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/const.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\const.ts) -The code in this file is responsible for managing the user configuration file for the Autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively. +The code in this file is responsible for managing the user configuration file for the autodoc project. It imports two Node.js built-in modules, `path` and `os`, which are used to handle file paths and operating system-related utility functions, respectively. -The `userConfigFileName` constant is defined as `'autodoc.user.json'`. This constant represents the name of the user configuration file that will be used by the Autodoc project. +The `userConfigFileName` constant is defined as `'autodoc.user.json'`, which represents the name of the user configuration file. This file is expected to store user-specific settings for the autodoc project in JSON format. -The `userConfigFilePath` constant is created using the `path.resolve()` function, which resolves a sequence of paths into an absolute path. It takes three arguments: +The `userConfigFilePath` constant is created using the `path.resolve()` function, which combines the provided arguments into an absolute file path. The `os.homedir()` function is used to get the current user's home directory, and `./.config/autodoc/` is appended to it as the folder where the user configuration file should be stored. Finally, the `userConfigFileName` constant is appended to the path, resulting in the complete file path for the user configuration file. -1. `os.homedir()`: This function returns the current user's home directory. It ensures that the user configuration file is stored in the user's home directory, making it user-specific. -2. `'./.config/autodoc/'`: This string specifies the subdirectory within the user's home directory where the configuration file will be stored. The `.config` directory is a common location for storing configuration files on Unix-based systems, and the `autodoc` subdirectory is used to keep the Autodoc configuration files organized. -3. `userConfigFileName`: This constant is used as the file name for the user configuration file. +By exporting both `userConfigFileName` and `userConfigFilePath`, other parts of the autodoc project can easily access and use these constants to read or write user-specific settings. For example, when the autodoc application starts, it can read the user configuration file from the specified path, and apply the settings accordingly. -The `userConfigFilePath` constant will store the absolute path to the user configuration file, which can be used by other parts of the Autodoc project to read or write user-specific settings. +Here's a code example of how these constants might be used in another part of the autodoc project: -In summary, this code is responsible for defining the location and name of the user configuration file for the Autodoc project. It ensures that the configuration file is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences. +```javascript +import { userConfigFilePath } from './path/to/this/file'; + +// Read user configuration from the file +const userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8')); + +// Apply user settings +applyUserSettings(userConfig); +``` + +In summary, this code is responsible for defining the name and file path of the user configuration file for the autodoc project, allowing other parts of the project to easily access and manage user-specific settings. ## Questions: 1. **What is the purpose of the `userConfigFileName` and `userConfigFilePath` constants?** The `userConfigFileName` constant defines the name of the user configuration file for the autodoc project, while the `userConfigFilePath` constant defines the absolute path to this file, which is located in the user's home directory under the `.config/autodoc/` folder. -2. **Why are the `node:path` and `node:os` modules imported?** +2. **Why are the `node:path` and `node:os` modules being imported?** - The `node:path` module is imported to provide utilities for working with file and directory paths, such as the `path.resolve()` function used to construct the `userConfigFilePath`. The `node:os` module is imported to provide operating system-related utility methods, such as `os.homedir()` which returns the current user's home directory. + The `node:path` module is imported to provide utilities for working with file and directory paths, such as resolving the absolute path to the user configuration file. The `node:os` module is imported to provide operating system-related utility methods, such as getting the user's home directory. 3. **Is this code compatible with different operating systems?** - Yes, this code is compatible with different operating systems. The `os.homedir()` function from the `node:os` module returns the correct home directory path for the current user, regardless of the operating system. Additionally, the `path.resolve()` function from the `node:path` module handles path separators and other OS-specific details, ensuring the correct file path is generated. \ No newline at end of file + Yes, this code is compatible with different operating systems. The `os.homedir()` method returns the home directory of the current user, which is platform-specific, and the `path.resolve()` method takes care of handling the correct path separators for the current operating system. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/index.md b/.autodoc/docs/markdown/src/index.md index e178bd7..217e90f 100644 --- a/.autodoc/docs/markdown/src/index.md +++ b/.autodoc/docs/markdown/src/index.md @@ -1,6 +1,8 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/index.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\index.ts) -The code is a CLI (Command Line Interface) tool for the Autodoc project, which helps in generating documentation for a codebase. It uses the `commander` package to define and manage commands, and `inquirer` for interactive prompts. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`. +This code is the main entry point for the Autodoc CLI tool, which provides a set of commands to help developers automatically generate documentation for their codebase. The tool uses the `commander` library to define and handle commands, and `inquirer` for interactive prompts. + +The available commands are: 1. `init`: Initializes the repository by creating an `autodoc.config.json` file in the current directory. If the file already exists, it uses the existing configuration. ```bash @@ -12,30 +14,33 @@ The code is a CLI (Command Line Interface) tool for the Autodoc project, which h autodoc estimate ``` -3. `index`: Traverses the codebase, writes documentation using LLM (Language Model), and creates a locally stored index. It prompts the user to confirm before starting the indexing process. +3. `index`: Traverses the codebase, writes documentation using LLM, and creates a locally stored index. Before starting the indexing process, it prompts the user for confirmation. It requires the `autodoc.config.json` file to be present. ```bash autodoc index ``` -4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration; otherwise, it creates a new one. +4. `user`: Sets the Autodoc user configuration. If a user configuration file exists, it uses the existing configuration. ```bash autodoc user ``` -5. `q`: Queries an Autodoc index. It requires both `autodoc.config.json` and user configuration files to be present. +5. `q`: Queries an Autodoc index. It requires both the `autodoc.config.json` and user configuration files to be present. ```bash autodoc q ``` -The code also handles unhandled promise rejections by logging the error stack, showing an error spinner, stopping the spinner, and exiting with an error code. +The code also listens for unhandled promise rejections and handles them gracefully by showing an error spinner, stopping the spinner, and exiting with an error code. -Overall, this CLI tool simplifies the process of generating documentation for a codebase by providing an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities. +In the larger project, this CLI tool serves as the primary interface for users to interact with Autodoc, allowing them to easily generate and manage documentation for their codebase. ## Questions: - 1. **Question:** What is the purpose of the `autodoc.config.json` file and how is it used in the code? - **Answer:** The `autodoc.config.json` file is used to store the configuration for the Autodoc repository. It is read and parsed in various commands like `init`, `estimate`, `index`, and `q` to provide the necessary configuration for each command's execution. + 1. **What is the purpose of the Autodoc CLI Tool?** + + The Autodoc CLI Tool is designed to help developers automatically generate documentation for their codebase by traversing the code, writing docs via LLM, and creating a locally stored index. + +2. **How does the `estimate` command work and what does it return?** + + The `estimate` command reads the `autodoc.config.json` file and estimates the cost of running the `index` command on the repository. It provides an estimation of the resources required to generate the documentation. -2. **Question:** How does the `estimate` command work and what does it do? - **Answer:** The `estimate` command reads the `autodoc.config.json` file, parses it into a configuration object, and then calls the `estimate` function with the configuration. The purpose of this command is to estimate the cost of running the `index` command on the repository. +3. **What is the role of the `user` command and how does it interact with the user configuration file?** -3. **Question:** What is the purpose of the `user` command and how does it handle user configuration? - **Answer:** The `user` command is used to set the Autodoc user configuration. It reads the user configuration file specified by `userConfigFilePath`, parses it into a configuration object, and then calls the `user` function with the configuration. If the configuration file is not found, it calls the `user` function without any configuration, allowing the user to set up their configuration. \ No newline at end of file + The `user` command is responsible for setting the Autodoc user configuration. It reads the user configuration file (if it exists) and allows the user to update or create a new configuration. This configuration is then used in other commands, such as the `query` command, to interact with the Autodoc index. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/langchain/hnswlib.md b/.autodoc/docs/markdown/src/langchain/hnswlib.md index 696d07c..5b00af0 100644 --- a/.autodoc/docs/markdown/src/langchain/hnswlib.md +++ b/.autodoc/docs/markdown/src/langchain/hnswlib.md @@ -1,32 +1,36 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/langchain/hnswlib.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\langchain\hnswlib.ts) -The `HNSWLib` class in this code is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index. +The `HNSWLib` class in this code is a specialized vector store that uses the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. It is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. The main purpose of this class is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content. -The constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata. +The constructor of the `HNSWLib` class takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is used to convert documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents. -The `addDocuments` method takes an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method is responsible for initializing the index, resizing it if necessary, and adding the vectors and their corresponding metadata to the `InMemoryDocstore`. +The `addDocuments` method takes an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores. -The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search. +The `save` and `load` methods allow for persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively. -The `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively. - -Example usage: +Here's an example of how to use the `HNSWLib` class: ```javascript const embeddings = new Embeddings(/* ... */); -const hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings); +const args = { space: 'cosine' }; +const hnswLib = new HNSWLib(embeddings, args); + +// Add documents to the index +await hnswLib.addDocuments(documents); -const queryVector = await embeddings.embedText("example query"); -const similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5); +// Perform a similarity search +const queryVector = /* ... */; +const k = 10; +const results = await hnswLib.similaritySearchVectorWithScore(queryVector, k); ``` -In the larger project, this class can be used to efficiently store and search for similar documents based on their embeddings, which can be useful for tasks such as document clustering, nearest neighbor search, and recommendation systems. +In the larger project, the `HNSWLib` class can be used to efficiently store and search for documents based on their content similarity, which can be useful for tasks such as document clustering, recommendation systems, or information retrieval. ## Questions: - 1. **Question:** What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class? - **Answer:** The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk. + 1. **Question**: What is the purpose of the `HNSWLib` class and how does it relate to the `SaveableVectorStore` class? + **Answer**: The `HNSWLib` class is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. It extends the `SaveableVectorStore` class, which provides a base class for vector stores that can be saved and loaded from disk. -2. **Question:** How does the `addDocuments` method work and what is its purpose? - **Answer:** The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them into vectors using the `embedDocuments` method from the `embeddings` object. It then adds these vectors and the corresponding documents to the HNSW index and the `docstore` respectively. +2. **Question**: How does the `addDocuments` method work and what is its purpose? + **Answer**: The `addDocuments` method takes an array of `Document` objects, extracts their `pageContent`, and embeds them using the provided `Embeddings` instance. It then adds the resulting vectors and documents to the HNSW index and the `InMemoryDocstore`, respectively. -3. **Question:** How does the `similaritySearchVectorWithScore` method work and what does it return? - **Answer:** The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input. It checks if the query vector has the same length as the number of dimensions and if `k` is not greater than the number of elements in the index. It then performs a k-nearest neighbors search on the HNSW index using the query vector and returns an array of `[Document, number]` tuples, where each tuple contains a document from the `docstore` and its corresponding distance score to the query vector. \ No newline at end of file +3. **Question**: How does the `similaritySearchVectorWithScore` method work and what does it return? + **Answer**: The `similaritySearchVectorWithScore` method takes a query vector and a number `k` as input, and searches for the `k` most similar vectors in the HNSW index. It returns an array of tuples, where each tuple contains a `Document` object and its corresponding similarity score to the query vector. \ No newline at end of file diff --git a/.autodoc/docs/markdown/src/langchain/summary.md b/.autodoc/docs/markdown/src/langchain/summary.md index bb0b9a8..e3292fb 100644 --- a/.autodoc/docs/markdown/src/langchain/summary.md +++ b/.autodoc/docs/markdown/src/langchain/summary.md @@ -1,23 +1,29 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src/langchain) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src\langchain) -The `hnswlib.ts` file in the `.autodoc/docs/json/src/langchain` folder contains the `HNSWLib` class, which is an implementation of a vector store using the Hierarchical Navigable Small World (HNSW) algorithm from the `hnswlib-node` library. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems. +The `hnswlib.ts` file in the `.autodoc\docs\json\src\langchain` folder contains the `HNSWLib` class, which is a specialized vector store utilizing the Hierarchical Navigable Small World (HNSW) algorithm for efficient similarity search. This class is built on top of the `hnswlib-node` library and extends the `SaveableVectorStore` class. Its primary purpose is to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content. -The `HNSWLib` class extends the `SaveableVectorStore` class and provides methods for adding documents, searching for similar documents, and saving/loading the index. It takes an `Embeddings` object and an `HNSWLibArgs` object as arguments in its constructor. The `Embeddings` object is responsible for converting text documents into numerical vectors, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing document metadata. +The `HNSWLib` class constructor takes an `Embeddings` object and an `HNSWLibArgs` object as arguments. The `Embeddings` object is responsible for converting documents into their corresponding vector representations, while the `HNSWLibArgs` object contains configuration options for the HNSW index and an optional `InMemoryDocstore` object for storing the documents. -The `addDocuments` method accepts an array of `Document` objects, converts their text content into numerical vectors using the `Embeddings` object, and adds the vectors to the HNSW index. The `addVectors` method initializes the index, resizes it if necessary, and adds the vectors and their corresponding metadata to the `InMemoryDocstore`. +The `addDocuments` method accepts an array of `Document` objects, converts them into embeddings using the `Embeddings` object, and adds them to the HNSW index. The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents along with their similarity scores. -The `similaritySearchVectorWithScore` method takes a query vector and a number `k`, and returns the top `k` most similar documents in the index along with their similarity scores. It checks if the query vector has the correct dimensions and if `k` is within the valid range before performing the search. +The `save` and `load` methods enable persisting the HNSW index, document store, and configuration options to disk and loading them back into memory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of texts or documents, respectively. -The `save` and `load` methods allow the HNSW index and its associated metadata to be saved to and loaded from a specified directory. The `fromTexts` and `fromDocuments` static methods provide convenient ways to create an `HNSWLib` instance from an array of text strings or `Document` objects, respectively. +In the larger project, the `HNSWLib` class can be employed to efficiently store and search for documents based on their content similarity, which can be beneficial for tasks such as document clustering, recommendation systems, or information retrieval. -Here's an example of how this code might be used: +Here's an example of how to use the `HNSWLib` class: ```javascript const embeddings = new Embeddings(/* ... */); -const hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings); +const args = { space: 'cosine' }; +const hnswLib = new HNSWLib(embeddings, args); -const queryVector = await embeddings.embedText("example query"); -const similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5); +// Add documents to the index +await hnswLib.addDocuments(documents); + +// Perform a similarity search +const queryVector = /* ... */; +const k = 10; +const results = await hnswLib.similaritySearchVectorWithScore(queryVector, k); ``` -In the larger project, the `HNSWLib` class can be integrated with other components to build efficient and scalable systems for document similarity search, clustering, and recommendations based on text embeddings. +This code snippet demonstrates how to create an `HNSWLib` instance, add documents to the index, and perform a similarity search. The results can then be used for various purposes, such as finding related documents or generating recommendations based on content similarity. diff --git a/.autodoc/docs/markdown/src/summary.md b/.autodoc/docs/markdown/src/summary.md index 30a7739..6d65cdc 100644 --- a/.autodoc/docs/markdown/src/summary.md +++ b/.autodoc/docs/markdown/src/summary.md @@ -1,39 +1,39 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc/docs/json/src) +[View code on GitHub](https://github.com/context-labs/autodoc/.autodoc\docs\json\src) -The `.autodoc/docs/json/src` folder contains the core components of the Autodoc project, which aims to automatically generate documentation for a given code repository using OpenAI's language models (LLMs). The main files in this folder are `const.ts`, `index.ts`, and `types.ts`. +The `.autodoc\docs\json\src` folder contains the core components of the autodoc project, which is designed to automatically generate documentation for a given code repository using OpenAI's language models (LLMs). The folder consists of three main files: `const.ts`, `index.ts`, and `types.ts`, as well as two subfolders: `cli` and `langchain`. -`const.ts` manages the user configuration file for the Autodoc project. It defines the location and name of the user configuration file, ensuring that it is stored in a user-specific directory and follows a standard naming convention. This allows the Autodoc project to easily manage user-specific settings and preferences. +`const.ts` defines the name and file path of the user configuration file for the autodoc project. This file stores user-specific settings in JSON format. Other parts of the project can easily access and use these constants to read or write user-specific settings. For example: -`index.ts` is a CLI (Command Line Interface) tool for the Autodoc project, which simplifies the process of generating documentation for a codebase. It provides an easy-to-use interface for managing configurations and running the Autodoc project's core functionalities. The main commands supported are `init`, `estimate`, `index`, `user`, and `q`. For example: +```javascript +import { userConfigFilePath } from './path/to/this/file'; + +// Read user configuration from the file +const userConfig = JSON.parse(fs.readFileSync(userConfigFilePath, 'utf-8')); -```bash -autodoc init -autodoc estimate -autodoc index -autodoc user -autodoc q +// Apply user settings +applyUserSettings(userConfig); ``` -`types.ts` defines the types and interfaces for the Autodoc project, providing the foundation for processing code repositories and generating documentation using OpenAI's language models. It includes types such as `AutodocUserConfig`, `AutodocRepoConfig`, `FileSummary`, `FolderSummary`, and more. +`index.ts` serves as the main entry point for the Autodoc CLI tool, providing a set of commands for developers to generate and manage documentation for their codebase. The available commands include `init`, `estimate`, `index`, `user`, and `q`. The CLI tool uses the `commander` library for command handling and `inquirer` for interactive prompts. -The `cli` subfolder contains the `spinner.ts` file, which provides a utility for managing a command-line spinner using the `ora` library. This utility can be used to provide a consistent and user-friendly interface for displaying progress and status messages during long-running tasks or processes. For example: +`types.ts` defines the types and interfaces for the autodoc project, such as `AutodocUserConfig`, `AutodocRepoConfig`, `FileSummary`, `FolderSummary`, and more. These types are used to configure and run the autodoc tool, allowing users to generate documentation for their code repositories using OpenAI's LLMs. -```javascript -updateSpinnerText('Loading data...'); -stopSpinner(); -spinnerError('An error occurred'); -spinnerSuccess('Operation completed successfully'); -spinnerInfo('Please wait...'); -``` +The `cli` subfolder contains the `spinner.ts` file, which manages a spinner for visual feedback during background processes. It exports functions like `updateSpinnerText`, `stopSpinner`, `spinnerError`, `spinnerSuccess`, and `spinnerInfo` for easy interaction with the spinner. -The `langchain` subfolder contains the `hnswlib.ts` file, which implements a vector store using the Hierarchical Navigable Small World (HNSW) algorithm. This class is designed to efficiently store and search for similar documents based on their embeddings, making it useful for tasks such as document clustering, nearest neighbor search, and recommendation systems. For example: +The `langchain` subfolder contains the `hnswlib.ts` file, which provides the `HNSWLib` class for efficient similarity search using the Hierarchical Navigable Small World (HNSW) algorithm. This class is used to store and search for documents based on their embeddings, which are high-dimensional vectors representing the documents' content. Example usage: ```javascript const embeddings = new Embeddings(/* ... */); -const hnswLib = await HNSWLib.fromTexts(texts, metadatas, embeddings); +const args = { space: 'cosine' }; +const hnswLib = new HNSWLib(embeddings, args); + +// Add documents to the index +await hnswLib.addDocuments(documents); -const queryVector = await embeddings.embedText("example query"); -const similarDocuments = await hnswLib.similaritySearchVectorWithScore(queryVector, 5); +// Perform a similarity search +const queryVector = /* ... */; +const k = 10; +const results = await hnswLib.similaritySearchVectorWithScore(queryVector, k); ``` -In summary, the code in this folder provides the core components and utilities for the Autodoc project, enabling the automatic generation of documentation for code repositories using OpenAI's language models. The CLI tool simplifies the process, while the types and interfaces lay the foundation for processing and generating documentation. The additional utilities, such as the spinner and HNSWLib, enhance the user experience and provide efficient search capabilities. +In summary, the code in this folder is responsible for the core functionality of the autodoc project, including user configuration management, CLI tool commands, type definitions, spinner management, and efficient similarity search using the HNSW algorithm. diff --git a/.autodoc/docs/markdown/src/types.md b/.autodoc/docs/markdown/src/types.md index 40258f5..7850747 100644 --- a/.autodoc/docs/markdown/src/types.md +++ b/.autodoc/docs/markdown/src/types.md @@ -1,28 +1,27 @@ -[View code on GitHub](https://github.com/context-labs/autodoc/src/types.ts) +[View code on GitHub](https://github.com/context-labs/autodoc/src\types.ts) -This code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders within the repository. +This code defines the types and interfaces for the `autodoc` project, which aims to automatically generate documentation for a given code repository. The project uses OpenAI's language models (LLMs) to process and generate summaries, questions, and other relevant information for files and folders in the repository. -The code starts by importing `OpenAIChat` from the `langchain/llms` package. It then defines several types and interfaces that are used throughout the project: +The `AutodocUserConfig` and `AutodocRepoConfig` types define the configuration options for the user and repository, respectively. These include settings such as the LLM models to use, repository URL, output directory, and content type. -- `AutodocUserConfig`: Represents the user configuration for the autodoc project, including the LLM models to be used. -- `AutodocRepoConfig`: Represents the configuration for a specific repository, including its name, URL, root directory, output directory, LLM models, and other settings. -- `FileSummary` and `FolderSummary`: Represent the summaries and questions generated for files and folders, respectively. -- `ProcessFileParams`, `ProcessFolderParams`, and `TraverseFileSystemParams`: Define the parameters for processing files, folders, and traversing the file system, respectively. -- `ProcessFile` and `ProcessFolder`: Define the function types for processing files and folders, respectively. -- `LLMModels`: Enumerates the available LLM models, such as GPT-3.5-turbo, GPT-4, and GPT-4-32k. -- `LLMModelDetails`: Represents the details of an LLM model, including its name, cost per 1K tokens, maximum length, and other statistics. +`FileSummary` and `FolderSummary` types represent the generated summaries for files and folders, including their paths, URLs, and checksums. The `ProcessFileParams` and `ProcessFolderParams` types define the parameters required for processing files and folders, such as the file or folder name, path, and content type. -For example, when using this code in the larger project, you might define a `ProcessFile` function that takes a `ProcessFileParams` object as input and generates a summary and questions for the file using the specified LLM model. Similarly, you could define a `ProcessFolder` function that processes all files and subfolders within a folder, generating summaries and questions for each. +`ProcessFile` and `ProcessFolder` are function types that take the respective parameters and return a promise. These functions are responsible for processing the files and folders, generating summaries, and updating the documentation. -The `TraverseFileSystemParams` type allows you to configure how the file system is traversed, including specifying which files and folders to ignore, and what prompts to use for generating summaries and questions. +`TraverseFileSystemParams` type defines the parameters for traversing the file system, including the input path, project name, and optional `processFile` and `processFolder` functions. It also includes settings for ignoring certain files or folders and content type preferences. -Overall, this code provides the foundation for the `autodoc` project by defining the types and interfaces needed to process code repositories and generate documentation using OpenAI's language models. +The `LLMModels` enum lists the available language models, such as GPT-3.5 Turbo, GPT-4, and GPT-4 32k. The `LLMModelDetails` type provides information about each model, including the cost per 1K tokens, maximum length, and success/failure statistics. + +In the larger project, these types and interfaces would be used to configure and run the `autodoc` tool, allowing users to automatically generate documentation for their code repositories using OpenAI's language models. For example, a user could provide an `AutodocRepoConfig` object to configure the tool, and then use the `TraverseFileSystem` function to process the repository and generate the documentation. ## Questions: - 1. **Question:** What is the purpose of the `LLMModels` enum and how is it used in the code? - **Answer:** The `LLMModels` enum defines the available language models for the autodoc project. It is used in the `AutodocUserConfig` and `AutodocRepoConfig` types to specify which language models should be used for processing files and folders. + 1. **What is the purpose of the `AutodocUserConfig` and `AutodocRepoConfig` types?** + + The `AutodocUserConfig` type is used to define the user configuration for the autodoc project, which includes an array of LLMModels. The `AutodocRepoConfig` type is used to define the repository configuration for the autodoc project, which includes various properties such as name, repository URL, root, output, LLMModels, and more. + +2. **What are the different LLMModels available in the `LLMModels` enum?** + + The `LLMModels` enum lists the available language models for the autodoc project. Currently, there are three models: GPT3 (gpt-3.5-turbo), GPT4 (gpt-4), and GPT432k (gpt-4-32k). -2. **Question:** What are the `ProcessFile` and `ProcessFolder` types and how are they used in the code? - **Answer:** `ProcessFile` and `ProcessFolder` are types for functions that process a file or a folder, respectively. They are used as optional parameters in the `TraverseFileSystemParams` type, allowing developers to provide custom processing functions when traversing the file system. +3. **What is the purpose of the `ProcessFile` and `ProcessFolder` types?** -3. **Question:** What is the purpose of the `TraverseFileSystemParams` type and how is it used in the code? - **Answer:** The `TraverseFileSystemParams` type defines the parameters required for traversing the file system. It is used to pass configuration options, such as input path, project name, custom processing functions, and other settings, to a function that will traverse the file system and process files and folders accordingly. \ No newline at end of file + The `ProcessFile` type is a function type that takes a `ProcessFileParams` object as input and returns a Promise. It is used to process a single file in the autodoc project. The `ProcessFolder` type is a function type that takes a `ProcessFolderParams` object as input and returns a Promise. It is used to process a folder in the autodoc project. \ No newline at end of file diff --git a/.autodoc/docs/markdown/tsconfig.md b/.autodoc/docs/markdown/tsconfig.md index 68f5e10..64aee1e 100644 --- a/.autodoc/docs/markdown/tsconfig.md +++ b/.autodoc/docs/markdown/tsconfig.md @@ -1,30 +1,31 @@ [View code on GitHub](https://github.com/context-labs/autodoc/tsconfig.json) -This code is a configuration file for the TypeScript compiler in a project. The purpose of this configuration is to define various options and settings that the TypeScript compiler should use when transpiling TypeScript code into JavaScript. This is important for ensuring that the compiled output is consistent and compatible with the intended runtime environment. - -Here's a brief explanation of the key options set in this configuration: - -- `"rootDir": "src"`: Specifies the root directory containing the TypeScript source files. This tells the compiler where to look for the input files. -- `"outDir": "dist"`: Specifies the output directory for the compiled JavaScript files. This is where the transpiled code will be saved. -- `"strict": true`: Enables strict type checking, which enforces stronger type safety and helps catch potential issues during development. -- `"target": "es2020"`: Sets the target ECMAScript version for the compiled output. In this case, the output will be compatible with ECMAScript 2020 (ES11) features. -- `"module": "ES2020"`: Specifies the module system to use in the compiled output. This setting is aligned with the target ECMAScript version. -- `"sourceMap": true`: Generates source map files alongside the compiled output. This helps with debugging by mapping the compiled code back to the original TypeScript source. -- `"esModuleInterop": true` and `"allowSyntheticDefaultImports": true`: These options enable better compatibility with different module systems and allow for more flexible import statements. -- `"moduleResolution": "node"`: Sets the module resolution strategy to Node.js-style, which is the most common approach for resolving module imports in JavaScript projects. -- `"declaration": true`: Generates TypeScript declaration files (`.d.ts`) alongside the compiled output. These files provide type information for the compiled code, which can be useful for other TypeScript projects that depend on this one. -- `"skipLibCheck": true`: Skips type checking of declaration files, which can speed up the compilation process. - -In the larger project, this configuration file ensures that the TypeScript compiler produces consistent and compatible JavaScript output, making it easier to integrate the compiled code with other parts of the project or with external dependencies. +The code provided is a configuration file for the TypeScript compiler in a project. It specifies various options that control how the TypeScript compiler should process the source code and generate the output JavaScript files. This configuration file is typically named `tsconfig.json` and is placed at the root of a TypeScript project. + +The `compilerOptions` object contains several key-value pairs that define the behavior of the TypeScript compiler: + +- `rootDir`: Specifies the root directory of the source files. In this case, it is set to "src", meaning that the source files are located in the "src" folder. +- `outDir`: Specifies the output directory for the compiled JavaScript files. In this case, it is set to "dist", meaning that the compiled files will be placed in the "dist" folder. +- `strict`: Enables strict type checking, which helps catch potential issues in the code. +- `target`: Specifies the ECMAScript target version for the output JavaScript files. In this case, it is set to "es2020", meaning that the output files will be compatible with ECMAScript 2020 features. +- `module`: Specifies the module system to be used. In this case, it is set to "ES2020", meaning that the output files will use the ECMAScript 2020 module system. +- `sourceMap`: Generates source map files, which help in debugging the compiled code by mapping it back to the original TypeScript source files. +- `esModuleInterop`: Enables compatibility with ECMAScript modules for importing CommonJS modules. +- `moduleResolution`: Specifies the module resolution strategy. In this case, it is set to "node", meaning that the Node.js module resolution algorithm will be used. +- `allowSyntheticDefaultImports`: Allows default imports from modules with no default export. +- `declaration`: Generates TypeScript declaration files (`.d.ts`) alongside the compiled JavaScript files, which can be useful for other projects that depend on this one. +- `skipLibCheck`: Skips type checking of declaration files, which can speed up the compilation process. + +Overall, this configuration file helps ensure that the TypeScript compiler processes the source code according to the specified options, resulting in compiled JavaScript files that are compatible with the desired ECMAScript version and module system, while also providing useful features like source maps and strict type checking. ## Questions: 1. **What is the purpose of the `rootDir` and `outDir` options in the configuration?** - The `rootDir` option specifies the root folder of the source files, while the `outDir` option specifies the output directory for the compiled files. + The `rootDir` option specifies the root directory of the input files, while the `outDir` option specifies the output directory for the compiled files. 2. **What does the `strict` option do in the configuration?** - The `strict` option enables a set of strict type-checking options in the TypeScript compiler, ensuring a higher level of type safety in the code. + The `strict` option enables a wide range of type checking behavior that results in stronger guarantees of program correctness. 3. **What is the significance of the `target` and `module` options in the configuration?** - The `target` option sets the ECMAScript target version for the compiled JavaScript output, while the `module` option specifies the module system to be used in the generated code. In this case, both are set to "es2020", indicating that the output will be ECMAScript 2020 compliant. \ No newline at end of file + The `target` option specifies the ECMAScript target version for the output code, and the `module` option specifies the module system used in the output code. In this case, both are set to "es2020", which means the output code will be compatible with ECMAScript 2020 features and module system. \ No newline at end of file diff --git a/README.md b/README.md index 8e44aff..8212860 100644 --- a/README.md +++ b/README.md @@ -142,7 +142,7 @@ You should see a screen like this: Markdownify -This screen estimates the cost of indexing your repository. You can also access this screen via the `doc estimate` command. +This screen estimates the cost of indexing your repository. You can also access this screen via the `doc estimate` command. If you've already indexed once, then `doc index` will only reindex files that have been changed on the second go. For every file in your project, Autodoc calculates the number of tokens in the file based on the file content. The more lines of code, the larger the number of tokens. Using this number, it determine which model it will use on per file basis, always choosing the cheapest model whose context length supports the number of tokens in the file. If you're interested in helping make model selection configurable in Autodoc, check out [this issue](https://github.com/context-labs/autodoc/issues/9). diff --git a/package-lock.json b/package-lock.json index ebd487b..f35613e 100644 --- a/package-lock.json +++ b/package-lock.json @@ -1,12 +1,12 @@ { "name": "@context-labs/autodoc", - "version": "0.0.7", + "version": "0.0.8", "lockfileVersion": 2, "requires": true, "packages": { "": { "name": "@context-labs/autodoc", - "version": "0.0.7", + "version": "0.0.8", "license": "MIT", "dependencies": { "@dqbd/tiktoken": "^1.0.2", @@ -22,7 +22,8 @@ "marked": "^4.3.0", "marked-terminal": "^5.1.1", "minimatch": "^7.4.3", - "ora": "^6.2.0" + "ora": "^6.2.0", + "ts-md5": "^1.3.1" }, "bin": { "doc": "dist/index.js" @@ -5043,6 +5044,14 @@ "node": ">=8.0" } }, + "node_modules/ts-md5": { + "version": "1.3.1", + "resolved": "https://registry.npmjs.org/ts-md5/-/ts-md5-1.3.1.tgz", + "integrity": "sha512-DiwiXfwvcTeZ5wCE0z+2A9EseZsztaiZtGrtSaY5JOD7ekPnR/GoIVD5gXZAlK9Na9Kvpo9Waz5rW64WKAWApg==", + "engines": { + "node": ">=12" + } + }, "node_modules/tsconfig-paths": { "version": "3.14.2", "resolved": "https://registry.npmjs.org/tsconfig-paths/-/tsconfig-paths-3.14.2.tgz", @@ -8835,6 +8844,11 @@ "is-number": "^7.0.0" } }, + "ts-md5": { + "version": "1.3.1", + "resolved": "https://registry.npmjs.org/ts-md5/-/ts-md5-1.3.1.tgz", + "integrity": "sha512-DiwiXfwvcTeZ5wCE0z+2A9EseZsztaiZtGrtSaY5JOD7ekPnR/GoIVD5gXZAlK9Na9Kvpo9Waz5rW64WKAWApg==" + }, "tsconfig-paths": { "version": "3.14.2", "resolved": "https://registry.npmjs.org/tsconfig-paths/-/tsconfig-paths-3.14.2.tgz", diff --git a/package.json b/package.json index 982ed97..82f67ec 100644 --- a/package.json +++ b/package.json @@ -40,7 +40,8 @@ "marked": "^4.3.0", "marked-terminal": "^5.1.1", "minimatch": "^7.4.3", - "ora": "^6.2.0" + "ora": "^6.2.0", + "ts-md5": "^1.3.1" }, "devDependencies": { "@types/commander": "^2.12.2", diff --git a/src/cli/commands/index/convertJsonToMarkdown.ts b/src/cli/commands/index/convertJsonToMarkdown.ts index e871399..98eed82 100644 --- a/src/cli/commands/index/convertJsonToMarkdown.ts +++ b/src/cli/commands/index/convertJsonToMarkdown.ts @@ -87,7 +87,7 @@ export const convertJsonToMarkdown = async ({ await fs.writeFile(outputPath, markdown, 'utf-8'); }; - updateSpinnerText(`Creating ${files} mardown files...`); + updateSpinnerText(`Creating ${files} markdown files...`); await traverseFileSystem({ inputPath: inputRoot, projectName, @@ -99,5 +99,5 @@ export const convertJsonToMarkdown = async ({ targetAudience, linkHosted, }); - spinnerSuccess(`Created ${files} mardown files...`); + spinnerSuccess(`Created ${files} markdown files...`); }; diff --git a/src/cli/commands/index/processRepository.ts b/src/cli/commands/index/processRepository.ts index 290a054..64379bf 100644 --- a/src/cli/commands/index/processRepository.ts +++ b/src/cli/commands/index/processRepository.ts @@ -1,5 +1,6 @@ import fs from 'node:fs/promises'; import path from 'node:path'; +import { Md5 } from 'ts-md5'; import { OpenAIChat } from 'langchain/llms'; import { encoding_for_model } from '@dqbd/tiktoken'; import { APIRateLimit } from '../../utils/APIRateLimit.js'; @@ -69,6 +70,20 @@ export const processRepository = async ( linkHosted, }): Promise => { const content = await fs.readFile(filePath, 'utf-8'); + + //calculate the hash of the file + const newChecksum = await calculateChecksum([content]); + + //if an existing .json file exists, it will check the checksums and decide if a reindex is needed + const reindex = await reindexCheck( + path.join(outputRoot, filePath.substring(0, filePath.lastIndexOf('\\'))), + fileName.replace(/\.[^/.]+$/, '.json'), + newChecksum, + ); + if (!reindex) { + return; + } + const markdownFilePath = path.join(outputRoot, filePath); const url = githubFileUrl(repositoryUrl, inputRoot, filePath, linkHosted); const summaryPrompt = createCodeFileSummary( @@ -140,6 +155,7 @@ export const processRepository = async ( url, summary, questions, + checksum: newChecksum, }; const outputPath = getFileName(markdownFilePath, '.', '.json'); @@ -195,6 +211,16 @@ export const processRepository = async ( const contents = (await fs.readdir(folderPath)).filter( (fileName) => !shouldIgnore(fileName), ); + + //get the checksum of all the files in the folder + const newChecksum = await calculateChecksum(contents); + + //if an existing summary.json file exists, it will check the checksums and decide if a reindex is needed + const reindex = await reindexCheck(folderPath, 'summary.json', newChecksum); + if (!reindex) { + return; + } + // eslint-disable-next-line prettier/prettier const url = githubFolderUrl(repositoryUrl, inputRoot, folderPath, linkHosted); const allFiles: (FileSummary | null)[] = await Promise.all( @@ -259,6 +285,7 @@ export const processRepository = async ( folders: folders.filter(Boolean), summary, questions: '', + checksum: newChecksum, }; const outputPath = path.join(folderPath, 'summary.json'); @@ -366,3 +393,47 @@ export const processRepository = async ( */ return models; }; + +//reads all the files, and returns a checksum +async function calculateChecksum(contents: string[]): Promise { + const checksums: string[] = []; + for (const content of contents) { + const checksum = Md5.hashStr(content); + checksums.push(checksum); + } + const concatenatedChecksum = checksums.join(''); + const finalChecksum = Md5.hashStr(concatenatedChecksum); + return finalChecksum; +} + +//checks if a summary.json file exists, and if it does, compares the checksums to see if it needs to be re-indexed or not. +async function reindexCheck( + contentPath: string, + name: string, + newChecksum: string, +): Promise { + const jsonPath = path.join(contentPath, name); + + let summaryExists = false; + try { + await fs.access(jsonPath); + summaryExists = true; + } catch (error) {} + + if (summaryExists) { + const fileContents = await fs.readFile(jsonPath, 'utf8'); + const fileContentsJSON = JSON.parse(fileContents); + + const oldChecksum = fileContentsJSON.checksum; + + if (oldChecksum === newChecksum) { + console.log(`Skipping ${jsonPath} because it has not changed`); + return false; + } else { + console.log(`Reindexing ${jsonPath} because it has changed`); + return true; + } + } + //if no summary then generate one + return true; +} diff --git a/src/cli/commands/init/index.ts b/src/cli/commands/init/index.ts index 084073e..450ea8a 100644 --- a/src/cli/commands/init/index.ts +++ b/src/cli/commands/init/index.ts @@ -47,7 +47,7 @@ export const makeConfigTemplate = ( chatPrompt: '', contentType: 'code', targetAudience: 'smart developer', - linkHosted: true, + linkHosted: false, }; }; diff --git a/src/cli/utils/traverseFileSystem.ts b/src/cli/utils/traverseFileSystem.ts index f2e15a7..c2afbf8 100644 --- a/src/cli/utils/traverseFileSystem.ts +++ b/src/cli/utils/traverseFileSystem.ts @@ -36,6 +36,7 @@ export const traverseFileSystem = async ( await dfs(folderPath); await processFolder?.({ + inputPath, folderName, folderPath, projectName, diff --git a/src/types.ts b/src/types.ts index eb85968..a462d41 100644 --- a/src/types.ts +++ b/src/types.ts @@ -25,6 +25,7 @@ export type FileSummary = { url: string; summary: string; questions: string; + checksum: string; }; export type ProcessFileParams = { @@ -47,9 +48,11 @@ export type FolderSummary = { folders: FolderSummary[]; summary: string; questions: string; + checksum: string; }; export type ProcessFolderParams = { + inputPath: string; folderName: string; folderPath: string; projectName: string;