Indexing INDEX.HTML Files


  from mixpeek import Mixpeek

  mix = Mixpeek(mixpeek_key="API_KEY")
  
  # index our INDEX.HTML file
  mix.extract("file.index.html")
  
  # now we have clean INDEX.HTML data
  [
    {
      "filename": "file.index.html",
      "content": "This is the content of the index.html file".
      "embedding": [0.1, 0.2, 0.3, ...]
      "metadata": {
        "author": "John Doe",
        "date": "2022-01-01"
      }
    }
  ]

Read the Docs

Become a multimodal maker.

Upgrade your software with multimodal understanding in one line of code.