I am building a digital human assistant that allows humans to talk to an llm in natural voice as if talking to a person and ask the assisant to perform task on their computer such as taking screenshots, searching online and saving results to a local file, clicking on a desktop, creating files, etc..