A speech recognition researcher I knew spent some time at Eastern Washington university because they had a lot of transcribed Washington state proceedings, which was open access enough to go into his company’s speech corpus, I guess (I only found out because I mentioned my mom graduated from there). Anyways, these people turn over a lot of rocks to realize their huge corpuses (erm, corpi?).
Whether that is “open access” enough for commercial use is an interesting question. I thought that the SCOTUS recordings, for example, can not be used for commercial applications, but that might be a restriction imposed by the organization that processes and publishes the data, not the proceedings themselves.