Being a person exposed to ai ,llms and investing due to my job and being a healthy Investors always fascinated me can we leveraged both.
In a cost effective and experimental way due to work I was exploring into RAG systems and after using it with investor conference material of navine flourine , I was truly amazed that it actually did something in reality as I put around 20+ conference documents.
So does anyone know or tried something like this as I want some inputs as this system that I am building is currently building is good to do many type of docs but have only started less then 2 week ago with some issues and mostly solved in coding process.
So would like to discuss with the community’s over it and if is fruitful would also love to share( note I have made as a side project but have also devoted some time of company so need to ask the job company before sharing as it was a part of their poc)
As a note this is my first post as a first thread so would also love some comments also how to write better
Like their is no limit of documents like I have put around 15+ docs though it took around.
For the case of prompt it was earlier designed to be a chat like model at least in terms of persona now I am switching to a financial analyst persona(testing is required).
In current scheme was using while con call but I am also testing summarization + embedding technique with a better model.
Like was using tinny llama but using mistral , phi or even llama 2 40B could yield better result.
Like as a question it was able to translate the growth and explain what molecules company make and some other like future capes.
But right now comparison don’t work.
I am also trying to find Claude 2 like large model so a great amount of data can be their in case of context
what metrics are you using to evaluate your mode response, I extensively work on LLMs and from my experience what i can say is the model response is not always reliable, the model provides you incorrect response confidently, also the more data you provide to the model, the more chances of it to hallucinate, so i think it’s very important to put some metrics in place for the response evaluation especially in cases of analyzing financial documents where the numbers are of utmost importance.
I am actually want to work on that only like i want some more real world question and answers to check it.
Working is done that phi2 and other also for this I am using both lmstudio and ollama support.
I am doing a RAG system that is used and using the embedding extraction to talk to database of these pdf and for halisunation I was earlier using tinyllama and after move to phi2 it work better.
Will need suggestions how to test as in my organisation I am not responsible for testing mainly for integration and making.
In future can add finance or chat finance based fine tuned models to solve it.
In rag the best case I found is to stuff accurate and good data using relevance and limit.
Like in navinflourine after phi2 it was 8 and 40% relevenace while for itc it was 20 and 60.
For tech stack I am using Qdrant,Hugging face text embedding, semantic kernel , .net api format mainly and ollama with everything being in a docker container and all stored locally
As promised, I have published a fork copy of my main project that has this code and have started to implement summarization and others in a new branch name also in below link can fork , star , contribute and also any comments will be welcomed
Like I personally tried and it don’t hallucination with phi 2 that much with around 60% relevance and 10 as a limit of responses.
I still need to try to solve these questions as a analyst and not a chat systems persona.
Also have seen better results with summarised pdf report embedding that I am currently working on so whole system is integrated and also planning to launch a simple pdf upload chat app that will run as a chain of prompts.
Update
Have implemented and covered the part of persona and the summarisation based embedding with some tweaks in code for parallel programming to solve speed problems
I am currently starting to make a simple react UI where a person can chat in a simple way so do check and would appreciate it if someone could contribute to UI part and testing part as I am not that good with that.
Hi @Contrabets great initiative. I am wondering if you tried to use AI to identify and flag technical patterns which have high degree of success probability?
I can help with the testing part as a user. I am not from an engineering background. But I have worked in Paytm as a product manager for 3 years before becoming a full time investor. Message me if you think we can do something together here.
like i we can start testing after getting the UI done will try to wrap it up in weekend.
Also as everything is running locally so atleast 16gb ram and if possible a good GPU would be needed (gpu is not mandatory as such but will make things fast)
like for that will need more of a visual model , may make something that can get the data from google finance then make chart image and then use it with visual models.
Like i personally think it is possible as with gpt 4 have tried to analyse charts may be a good perosna and tools can help.
lets try that too but would be a seprate use case can try
Agree it’s a different usecase. As you’d know there are a lot Technical experts in this forum so do seek out their guidance when you decide to take up this one. Good luck and wish you a lot of learning and success.