Skip to content
View AkimfromParis's full-sized avatar

Block or report AkimfromParis

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
akimfromparis/README.md

Akim Mousterou | ムステロ・アキム

AI/NLP Engineer from Paris, France | パリ、フランスからの自然言語処理エンジニア

  • Master's degree, in Natural Language Processing - Multilingual engineering, Japanese at I.N.A.L.C.O

(I.N.A.L.C.O 東洋言語文化学院、自然言語処理修士 | 卒業)

  • Master's Degree in International Business (EMIB) at ESCP Europe

(ESCPヨーロッパ・ビジネススクール、経営管理修士 | 卒業)

I have managed AI and business initiatives for prestigious brands, publishing companies, technology companies, and financial institutions for over 10 years. Born and raised in Paris, I am fluent in French, English, Japanese (JLPT N2), and Spanish. I am passionate about quantitative finance, network effects, opinion mining, and natural language processing.

Research papers:

Open-source contributor to LLM-jp in the "evaluation" working group led by Prof. Yusuke Miyao (University of Tokyo - 東京大学).

Please feel free to connect with me on GitHub, LinkedIn, or HuggingFace ! 😃


Libérté, égalité, architecture open-sourcé... Open-source RAG LlamaIndex and multilingual LLM from Mistral AI in a low-resource environment over financial statements:

  • Vanilla RAG (retrieval-augmented generation) with LlamaIndex and pgvecto.rs from TensorChord
  • Quantized model of Mistal8x7B from Mistral AI and LlamaCPP inference tool
  • Test of BGE-M3 embedding model from BAAI
  • Plus benchmark in German on Porsche AG, in French on Hermès, and in Italian on Brunello Cucinelli

Generative AI for all - Quick implementation with an open-source RAG LlamaIndex and Japanese LLM from ELYZA, Inc. in a low-resource environment over legal documents:

  • RAG (retrieval-augmented generation) is LlamaIndex with a vanilla Hybrid search (combining retrieval from both text search and vector search)
  • Japanese LLM “ELYZA-japanese-Llama-2-7b-instruct” created by Japanese startup, ELYZA, Inc.
  • Open-source database PostgreSQL transformed into a vector database by the great library PG Vector
  • Plus Q&A analysis in Japanese, embedding pricing war, and generative AI strategy of France, USA, and Japan

NER-Luxury is a fine-tuned XLM-Roberta model for the subtask N.E.R (Named Entity Recognition) in English. NER-Luxury is domain-specific for the fashion and luxury industry with bespoke labels. NER-Luxury is trying to be a bridge between the aesthetic side and the quantitative side of the fashion and luxury industry.

  • 38.063 sentences in English (covering the beauty, fashion, and luxury industries)
  • 32 labels from companies, groups, and holdings to luxury brands, models, and magazines
  • Loss: 0.3990, Accuracy: 0.9427, F1: 0.7879

To promote communication between NLP practitioners, I created a vocabulary deck of 420 technical words for Anki Pro (learning software based on space repetition) in Japanese, English, and French for natural language processing.

  • Test on earning calls of Fast Retailing Co., Ltd 2022 with Whisper OpenAI

  • LDA analysis on shareholder's letter

  • Sustainability as a strategy in fashion and in NLP

  • Introduction from Pregroup Grammar, DisCoCat, to Lambeq
  • Specificities of the Japanese language
  • Pregroup Grammar in free word order

  • Quick financial analysis
  • Why is TikT0k a mistake in luxury?
  • Black Scholes (closed form) and the Greeks implemented in Python and in C++
  • Linguistic specificities for NLP in Japanese
  • Leveraging spaCy framework, and Ginza model, and building a custom NER model
  • Insights on Masayoshi Son (Softbank), Hiroshi Mikitani (Rakuten), and Haruhiko Kuroda (Bank of Japan)
  • Few thoughts on NLP in Japanese
  • Naive OLS Linear regression model
  • Facebook's Prophet model
  • Bayesian Linear regression model with STAN
  • And a quick introduction to the Hierarchical model
  • Natural Language Processing using NLTK and Vador
  • Time Series for sentiment analysis
  • "Meme" as a marketing or investment strategy?
  • Insights on the human perceptions of leadership on Twitter
  • Financial analysis of luxury groups in 2020 during a high volatility state with the spread of SARS-Cov2
  • Found an increasing correlation on Y-o-Y between luxury groups and the Facebook Inc. ecosystem
  • Conclusion for stakeholders and shareholders

Popular repositories Loading

  1. RAG-Japanese RAG-Japanese Public

    Open source RAG with Llama Index for Japanese LLM in low resource settting

    Jupyter Notebook 10 3

  2. Mixtral_RAG Mixtral_RAG Public

    Open-source RAG using Mixtal 8x7B for financial statements in German, French, and Italian

    Jupyter Notebook 6

  3. quantumNLP_jp quantumNLP_jp Public

    Quantum Natural Language Processing for Japanese. Why the quantum approach will be better than large language models for the Japanese language.

    Jupyter Notebook 3 2

  4. ner_japanese ner_japanese Public

    The biggest tech investor Masayoshi Son is back on Twitter since 2020 but in Japanese. What he is saying?

    Jupyter Notebook 1

  5. anki_jap_ai_vocabulary anki_jap_ai_vocabulary Public

    Anki flashcards in Japanese for AI, ML, DL, NLP & CV - Collaborative deck V1

    1

  6. luxury_risk luxury_risk Public

    How the spread of SARS-Cov2 impacted luxury groups in 2020? How Instagram became more important than Vogue?

    Jupyter Notebook