Use of Large Language Models For stock research

Contrabets · January 20, 2024, 5:09pm

Being a person exposed to ai ,llms and investing due to my job and being a healthy Investors always fascinated me can we leveraged both.

In a cost effective and experimental way due to work I was exploring into RAG systems and after using it with investor conference material of navine flourine , I was truly amazed that it actually did something in reality as I put around 20+ conference documents.

So does anyone know or tried something like this as I want some inputs as this system that I am building is currently building is good to do many type of docs but have only started less then 2 week ago with some issues and mostly solved in coding process.

So would like to discuss with the community’s over it and if is fruitful would also love to share( note I have made as a side project but have also devoted some time of company so need to ask the job company before sharing as it was a part of their poc)

As a note this is my first post as a first thread so would also love some comments also how to write better

Charanjeev_Kohli · January 21, 2024, 2:52am

Can you please explain, you put 20+ documents of conf calls, what prompt you gave & how it performed?

Contrabets · January 21, 2024, 9:18am

Like their is no limit of documents like I have put around 15+ docs though it took around.

For the case of prompt it was earlier designed to be a chat like model at least in terms of persona now I am switching to a financial analyst persona(testing is required).

In current scheme was using while con call but I am also testing summarization + embedding technique with a better model.

Like was using tinny llama but using mistral , phi or even llama 2 40B could yield better result.

Like as a question it was able to translate the growth and explain what molecules company make and some other like future capes.

But right now comparison don’t work.

I am also trying to find Claude 2 like large model so a great amount of data can be their in case of context

Contrabets · January 21, 2024, 1:08pm

Also for test would require some people help to test the functionality.

As everything is dockerized so by using just postman a person can question , put pdfs etc in it.

I know making a cool UI would make things easy but firstly I am a back end first person and testing with a front end will only delay this at the end

MomentumStocks · January 21, 2024, 1:37pm

This sounds very interesting
Where can I download the docker image to run similar tests ?
Maybe we can run a small group here to share our findings

Contrabets · January 21, 2024, 2:16pm

Like I need some revaluation of the summary approach and a prompt change so after that will share the repo link here

sougataG · January 22, 2024, 6:45am

Sounds very interesting. I have tried Phi-2 on LLM studio. How are you uploading the documents? Are you also training the model?

Harshal_Shinde · January 22, 2024, 7:19am

what metrics are you using to evaluate your mode response, I extensively work on LLMs and from my experience what i can say is the model response is not always reliable, the model provides you incorrect response confidently, also the more data you provide to the model, the more chances of it to hallucinate, so i think it’s very important to put some metrics in place for the response evaluation especially in cases of analyzing financial documents where the numbers are of utmost importance.

Contrabets · January 22, 2024, 7:28am

I am actually want to work on that only like i want some more real world question and answers to check it.

Working is done that phi2 and other also for this I am using both lmstudio and ollama support.

I am doing a RAG system that is used and using the embedding extraction to talk to database of these pdf and for halisunation I was earlier using tinyllama and after move to phi2 it work better.

Will need suggestions how to test as in my organisation I am not responsible for testing mainly for integration and making.

In future can add finance or chat finance based fine tuned models to solve it.

In rag the best case I found is to stuff accurate and good data using relevance and limit.

Like in navinflourine after phi2 it was 8 and 40% relevenace while for itc it was 20 and 60.

For tech stack I am using Qdrant,Hugging face text embedding, semantic kernel , .net api format mainly and ollama with everything being in a docker container and all stored locally

Contrabets · January 22, 2024, 7:30am

In this week only I am planning to public a fork of my project to my personal git and we all can contribute and make it useful and fast.

Like i think model picking and chunking could solve a great deal of hallucination problems

Contrabets · January 22, 2024, 11:52am

As promised, I have published a fork copy of my main project that has this code and have started to implement summarization and others in a new branch name also in below link can fork , star , contribute and also any comments will be welcomed

Contrabets · January 23, 2024, 8:12pm

Like I personally tried and it don’t hallucination with phi 2 that much with around 60% relevance and 10 as a limit of responses.

I still need to try to solve these questions as a analyst and not a chat systems persona.

Also have seen better results with summarised pdf report embedding that I am currently working on so whole system is integrated and also planning to launch a simple pdf upload chat app that will run as a chain of prompts.

Update

Have implemented and covered the part of persona and the summarisation based embedding with some tweaks in code for parallel programming to solve speed problems

Contrabets · January 31, 2024, 1:26pm

I am currently starting to make a simple react UI where a person can chat in a simple way so do check and would appreciate it if someone could contribute to UI part and testing part as I am not that good with that.

iivans · January 31, 2024, 4:24pm

Hi @Contrabets great initiative. I am wondering if you tried to use AI to identify and flag technical patterns which have high degree of success probability?

aadhar.aggarwal · February 1, 2024, 9:58am

Hi

I can help with the testing part as a user. I am not from an engineering background. But I have worked in Paytm as a product manager for 3 years before becoming a full time investor. Message me if you think we can do something together here.

Contrabets · February 1, 2024, 1:08pm

like i we can start testing after getting the UI done will try to wrap it up in weekend.

Also as everything is running locally so atleast 16gb ram and if possible a good GPU would be needed (gpu is not mandatory as such but will make things fast)

Contrabets · February 1, 2024, 1:12pm

like for that will need more of a visual model , may make something that can get the data from google finance then make chart image and then use it with visual models.

Like i personally think it is possible as with gpt 4 have tried to analyse charts may be a good perosna and tools can help.

lets try that too but would be a seprate use case can try

iivans · February 1, 2024, 4:26pm

Agree it’s a different usecase. As you’d know there are a lot Technical experts in this forum so do seek out their guidance when you decide to take up this one. Good luck and wish you a lot of learning and success.

kartik_sharma · February 1, 2024, 5:16pm

Hello,
Im working on a system which helps an investor to take decision and apply strategies.

Lets call it Screener on steroids

It can do sentiment analysis and help you gauge general public opinion on the company.
Internet based Multi Modal RAG with sources - Con-calls, presentations, Videos, Images.

Thanks

sarguna · February 7, 2024, 11:34pm

Let us know how we can test, I am Senior Test Engineer with Test Automation backgroung, also doing AI ML course

This would be an intresting for me test this model.