Microsoft released an opensource tool to convert anything, even pdf, PowerPoint, and excel files into Markdown. MarkItDown prioritizes preserving essential document structures like headings, lists, tables, and links, making the output highly suitable for text analysis pipelines and LLM ingestion.
https://github.com/microsoft/markitdown