embedding-finetuned / README.md
smokxy's picture
pytorch_model.bin upload/update
fe85a47 verified
|
raw
history blame
42 kB
---
language: []
library_name: sentence-transformers
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:900
- loss:GISTEmbedLoss
base_model: BAAI/bge-small-en-v1.5
datasets: []
metrics:
- cosine_accuracy@1
- cosine_accuracy@5
- cosine_accuracy@10
- cosine_precision@1
- cosine_precision@5
- cosine_precision@10
- cosine_recall@1
- cosine_recall@5
- cosine_recall@10
- cosine_ndcg@5
- cosine_ndcg@10
- cosine_ndcg@100
- cosine_mrr@5
- cosine_mrr@10
- cosine_mrr@100
- cosine_map@100
- dot_accuracy@1
- dot_accuracy@5
- dot_accuracy@10
- dot_precision@1
- dot_precision@5
- dot_precision@10
- dot_recall@1
- dot_recall@5
- dot_recall@10
- dot_ndcg@5
- dot_ndcg@10
- dot_ndcg@100
- dot_mrr@5
- dot_mrr@10
- dot_mrr@100
- dot_map@100
widget:
- source_sentence: Who are the CSCs engaged to enrol non-loanee farmers?
sentences:
- '''notification or /and on National Crop Insurance Portal multiplied by sown area
for notified crop. 3.1.3 Special efforts shall be made to ensure maximum
coverage of SC/ ST/ Women farmers under the Scheme. Further Panchayat Raj Institutions
(PRIs) may be involved in extension and awareness creation amongst farmers
and obtaining feed-back of farmers about the implementation of the Scheme 3.1.4 The
implementing Insurance Company selected as L1 will be responsible for taking necessary measures
to ensure at least 10% incremental increase in coverage of non-loanee farmers.
However other empanelled Insurance Companies which have participated in the bidding
and are keen for enrolment of non loanee farmers in the cluster may also be allowed
to enrol non-loanee farmers at L1 premium rate. The interested companies have
to inform their willingness in writing within seven days of finalisation of tender/issuance
of work order to L1. It will however be the responsibility of all the Insurance
Companies engaged in this process to ensure that duplicate enrolment does not
happen in the given cluster/district. Engaging companies other than L1 for enrolling
non loanee farmers will be taken up on a pilot basis in Districts notified by
State Govt. They shall enrol non loanee farmers as per conditions laid down
in Para 17.5. 3.1.5 These Insurance Company will maintain separate data of
such non loanee farmers covered by them and enter the said data on the portal
as per seasonality discipline detailed in Para 16.2. They shall be liable for
payment of claims to such farmers. 3.1.6 The exchange of information, co-witnessing
of CCEs and sharing of yield data etc for the cluster by Government/NCIP will
be limited to L1 Company only and it will be binding on other companies to accept
it. However, the requisition for payment of Government subsidy in respect of non-loanee enrolled
by them will be submitted directly to the Govt designated agency.'''
- '''Name of Implementing Agency (NABARD/NCDC):.............................................
Address: ...........................................................................................................
...........................................................................................................
................................................................................................................. Phone
Number: ............................................................................. (Each
page of the application form should be signed by Branch head and Zonal Manager) Name
and Address of the applicant Bank Branch : 1 a) Complete Postal Address (*with
pin-code) : 1 b) Phone No. with STD : 1 c) Fax No.: 1 d) E-Mail Address: 1
e) Details of the authorised Designation Mobile No. E-Mail Address. person
of the Bank submitting the Claim: 2 Name of Borrower FPO : 2 a) Constitution: Producer
Organization 2 b) Registered Office Address (*with pin-code): (i). Phone No. (ii).
Fax No. (iii). E-mail Address 2 c) Business Office Address (if any) (i). Phone
No. (ii). Fax No. (iii). E-mail Address 2 d) Name of CEO : Mobile No. 2 e)
Credit Facility for which guarantee cover sought : Old New Expansion Technical
Upgradation 2 f ) Give details of components:- Inputs: Processing: Marketing: Any
other: Total Investment: 3 Banking Facilities Sanctioned by sanctioning authority
(Rs. in Lakh):- (i). Term-Loan : Date of Sanction: Amount Outstanding: IRAC
Status: IRAC Status: (ii).Cash Credit : Date of Sanction: Amount Outstanding: 3
a) Sanctioning Office: Branch: ZO / RO: HO: 3 b) Designation of Sanctioning
Authority : 3 c) Sanctioning authority approval vide : 3 d) Sanction /
Appraisal Note No. Dated: 3 e) Agenda No. / Minutes conveying sanction : 4 Name
and Address of Controlling Office of the Branch (*with pin-code): 4.a). Name
of Controlling Authority : 4.b). Mobile No.: 4.c). Fax. No. : 4.d). E-Mail
Address. : 5 Present status of FPO Activity : (Give component wise details) 5.
a) 5. b). 5. c). 5. d). 5. e). 5. f ) 6 Status of Accounts 6. a).
Term-Loan: Amount of Disbursement till date : Outstanding as on date : i).'''
- '''8.1 CSCs under Ministry of Electronics and Information Technology (MeITY)
have been engaged to enrol non-loanee farmers. The Insurance Companies are
required to enter into a separate agreement with CSC and pay service charges
as fixed by DAC&FW, GOI per farmer per village per season. No other agreement
or payment is required to be made for this purpose. Nodal agency for engagement
with Ministry of Agriculture and Farmers Welfare and Insurance Companies will
be CSC-SPV, a company established under MeITY for carrying out e-governance
initiatives of GoI. 8.2 No charges/fee shall be borne or paid by the farmers
being enrolled through CSCs i.e. CSC-SPV and CSC-VLE 8.3 As per IRDA circular,
no separate qualification/certification will be required for the VLEs of CSCs
to facilitate enrolment of non-loanee farmers. 8.4 All empanelled Insurance
Companies will compulsorily be required to enter into an agreement with CSC
for enrolment of non-loanee farmers and for provision of other defined services
to farmers. 8.5 Other designated intermediaries may be linked with the Portal
in due course. 8.6 Empanelled Insurance Companies have to necessarily register
on the portal and submit list and details of agents/intermediaries engaged
for enrolment of non-loanee farmers in the beginning of each season within
10 days of award of work in the State. Further all agents/intermediaries have
to work strictly as per the provisions of the Scheme and IRDA regulations'''
- source_sentence: What role does the N-PMAFSC play in the modification of the scheme?
sentences:
- '''Eligible FPOs shall apply for the Equity Grant in the prescribed Application
Form (**Annexure-I**) only. Other mandatory documents required to be submitted
along with the Application are listed below: (i) Shareholder List and Share
Capital contribution by each member verified and certified by a Chartered Accountant
(CA)/Co-operative Auditors prior to submission(Enclosure-I of Annexure-I). (ii)
Resolution of the Board of Directors/Governing Body to seek Equity Grant for members
**(Enclosure-II of Annexure-I).** (iii) Consent of shareholders, stating name
of shareholder, gender, number of shares held, face value of shares, land holding,
signifying consent for Implementing Agencies to directly transfer the Equity Grant
sanctioned to the FPO on their behalf, to FPO Bank account, against the consideration
of additional shares of equivalent value to be issued to them by FPO and on exit-
transfer of the shares as per rules **(Enclosure-III of Annexure-I).** (iv) If
the FPO is in operation for more than one or more financial year then it shall
provide copy of the Audited Financial Statements of FPO for all years of existence
of the FPO, verified and certified by a Chartered Accountant (CA)/ Cooperative
Auditors prior to submission. (v) In case FPO is in operation for period of less
than one financial year, Photocopy of Bank Account Statement for last six months
authenticated by the Branch Manager of the \''Bank\'' is required (vi) Business
Plan of FPO and budget for next 18 months. (vii) Names, photographs, and identity
proof (anyone from among ration card, Aadhaar card, election identification card,
passport) of Representatives/ Directors authorized by the Board for executing
and signing all documents under the Scheme. Each page of the Application Form
and accompanying documents shall be signed by a minimum of two Board Member /Authorized
Representatives of the FPO.'''
- '''20.3.1 For addressing the issue of reliability of CCEs in terms of their
accuracy, representativeness and timeliness, innovative technologies such as
satellite remote sensing, drone, modeling, AWS/ARG, real time transmission of
data etc. should be utilized. This will ensure accurate assessment of yield and timely
payment of claims to farmers. Various studies carried out by national and international
organizations, including MNCFC, NRSC, SAC, CCAFS, IRRI, IFPRI, World Bank, etc.
have shown that the use of satellite, weather, soil and crop data, along with
images/video capture of crop growth at various stages and accurate sample CCE
data collection can improve the yield data quality/ timeliness and support timely
claim processing and payments. 20.3.2 States, with the support of national
centres as mentioned above, SRSC and SAUs, need to carry out adequate number
of pilot studies for improved yield estimation using technology, as mentioned above,
and small number of good quality CCEs. When a significant correlation is observed
between remote sensing and weather estimated yield and yield estimated through
CCEs, States and Insurance Companies can use these technologies in estimating
the crop yields at IU level, subject to the satisfaction of both States and Insurance
Companies about the accuracy of the yield estimates, to service the claims.'''
- '''(i) Coordinate with all the Implementing Agencies, State Level Consultative
Committee and District level Monitoring Committee (D-MC) for smooth implementation.
It will also consider feedback received from other relevant Ministries and Organizations
on Clusters identification for consideration. (ii) It will monitor the progress
either by holding the meetings of Implementing Agencies and other stakeholders
or by other means. (iii) It will allocate the produce clusters/districts/States
to Implementing Agencies for formation and promotion of FPOs. (iv) It will undertake
scrutiny of Action Plan of Implementing Agencies(IAs), consider recommendation
of release of fund to Implementing Agencies based on previous utilization as due with
respect to funding under the Scheme. (v) It will provide policy inputs to DAC&FW
for modification in the Scheme to better suit in the formation and promotion
of FPOs to make them economically sustainable. (vi) It will provide aid and advice
to Implementing Agencies as may be required for smooth functioning of the scheme. (vii)
Based on suggestions received from various Implementing Agencies, other Ministries,
States and experience/need, N-PMAFSC may examine and recommend revision of the
minimum membership norm per FPO to DAC&FW. (viii) It may seek detailed input
and analysis as may be required from time to time from NPMA and also seek assistance
of DMI in verification etc.'''
- source_sentence: When should non-loanee farmers apply for the Rabi season?
sentences:
- ''' Date……………………………… ……………………………… Signature of Branch Manager with branch seal Name……………………………………
… Designation …………………………………… ……………………………… ……………………………… Signature of Authorized
Person in zonal office Name………………………………… Designation …………………………………… 5. Promoter''s
request letter List of Enclosures 1. Recommendation 9. List of shareholders addressed
to the Bank Manager on original letter head of FPO confirmed by promoter and
bank with amount of CGC sought on Bank''s Original letterhead with date and
dispatch number duly signed by the Branch Manager on each page. 2. Sanction letter
of 6. Implementation Schedule 10. Affidavit of promoters that confirmed by
the bank. they have not availed CGC from any other institution for sanctioned
Credit Facility. sanctioning authority addressed to recommending branch. 3.
Bank''s approved 7. Up-to-date statement of account of 11. Field inspection
report of Term loan and Cash Credit (if Sanctioned). Bank official as on recent
date. Appraisal/Process note bearing signature of sanctioning authority. 4.
Potential Impact on 8. a).Equity Certificate, C.A/CS * Pin Code at Column No.
1. a), certificate/RCS certificate 2. b), 2. c), 4. a) and 9. a) is Mandatory b).
FORM-2, FORM-5 and FORM-23 filed with ROC for Company/RCS. small farmer producers 1.
Social Impact, 2. Environmental Impact 3.'''
- ''' \''Tenure of Guarantee Cover\'' means the agreed tenure of the Term loan/ composite
credit i.e. the maximum period of Guarantee Cover from the Guarantee start-up
which shall run through the agreed tenure of the term credit, and where working
capital facilities or Term loan alone are extended and/or continuing working
capital arrangements granted along with the Term Loan, for a period of 5 years or
block of 5 years and/or loan / working capital credit or composite credit facilities''
termination date, whichever is earlier or such period as may be specified by
the NABARD or NCDC, as case may be.'''
- '''for loanee and within 30 days for non loanee i.e. 15th Aug for Kharif and
15th Jan for Rabi for loanee and 31st Aug for Kharif and 31st Jan for Rabi
for Non Loanee 13 Within 7 days from the date of intimation by ICs CSCs/Banks/
Intermediary Cut-off date for CSCs/Banks/Intermediary to correct/update the paid
application intimated by ICs on Crop Insurance Portal 14 Cut-off date for Insurer
to accept the corrected/updated applications Within 7 days from the date of
submission of correction/updation by the Bank/CSC Insurance Companies 15 Within
7 days from acceptance of proposal by concerned Insurance Company on Portal Cut-off
date for Banks/ICs to hand over insurance acknowledgement receipt along with folio
to the insured farmer Banks/ICs for enrolment through their intermediaries 16 Cut
off date for processing of applications by ICs and auto approval of application
of insured farmers on crop insurance Portal 60 days from the cut off date for
enrolment/debit of premium from farmers i.e. 15th September for Kharif and 15th
February for Rabi seasons 17 Before cut off date of enrolment of farmers Insurance
Companies/GOI /State Cut off date for raising bills/requisitions with supporting
documents for releasing of advance premium subsidy based on 50% of 80% of respective
share of Centre/State in corresponding previous season 18 Release of advance
upfront premium subsidy (First Instalment)i.e. 50% of 80% of respective share
of Centre/State in corresponding previous season Within 15days of cut off date
of enrolment of farmers i.e. 31st July for Kharif Upto 15th August* 19 *state
may fix earlier dates for early Kharif crops Training and registration of field
level workers assigned for conduct of CCEs and reporting of the same on crop
insurance Portal through smart phones/CCE Agri App Upto31st August* *state 20 Registration
of mobile number of representative of ICs for co-witnessing of CCEs may fix earlier
dates for early Kharif crops At least 7 days before tentative date for conducting
CCEs 21 a) Uploading of tentative schedule/date for conducting CCEs (crop-wise/IU
wise) followed by SMS on one day notice through CCEs app.'''
- source_sentence: What is the requirement for the shareholder list and share capital
contribution?
sentences:
- '''To substantiate the fact, the most successful example is of dairy co-operative
in India where professional managers have contributed immensely to make it a success.
There are other so many examples which prove the absolute requirement of professional
managers. The number of professional staff could depend on geographical spread
of business operation, diversity of activities and volume of business. However,
an FPO should have minimum a CEO/Manager and an Accountant. Accountant is required
in FPO to look after its day to day accounting work. Based on requirement, FPO
can engage other staff also. 10.3 The CEO/Manager is to be appointed by the
executive body of the FPO who should be either graduate in agriculture / agriculture
marketing / agri-business management or BBA or equivalent. Locally available professionals
with 10+2 and preferably diploma in agriculture / agriculture marketing / agri-business
management or in such other related areas may be preferable. The accountant should
have educational qualification of 10+2 with Mathematics as a compulsory subject
or alternatively with Commerce or Accountancy background. If any members of the
FPO meet the above criteria, they may be considered preferably in the selection
process. 10.4 Under the scheme, financial support towards salary of CEO/Manager
up to @ Rs. 25,000/- per month and of Accountant up to @ Rs.10,000/- per
month with annual increment up to 5% is to be provided from the earmarked financial
support for first 3 years only. Thereafter, FPOs will manage from their own resources
to pay the salary of CEO/Manager and Accountant. In order to create interest
of good professional activities of CEO/Accountant, the FPO may also offer higher
payment with their own sources of funds on above of Govt. support. One CEO will
provide full time services to one FPO at a time only.'''
- '''i. Shareholder List and Share Capital contribution by each Member verified
and certified by a Chartered Accountant (CA) prior to submission (Format attached,
Annexure I- Enclosure-I). ii. Resolution of FPO Board/Governing Council to seek
Equity Grant for Members (Format attached, Annexure I- Enclosure-II). iii. Consent
of Shareholders, stating name of shareholder, gender, number of shares held, face
value of shares, land holding, and signature, signifying consent for Implementing
Agency to directly transfer the Equity Grant sanctioned to the FPC on their behalf,
to FPC Bank account, against the consideration of additional shares of equivalent
value to be issued to them by FPC and on exit- transfer of the shares as per rules
(Format attached, Annexure I-Enclosure-III). iv. Audited Financials of FPO for
a minimum 1 year/for all years of existence of the FPO if formed less than three
years prior to application/ for the last 3 years for FPO in existence for 3 years
or more, verified and certified by a Chartered Accountant (CA) prior to submission.
v. Photocopy of FPO Bank Account Statement for last six months authenticated by
Branch Manager. vi. Business plan and budget for next 18 months. vii. Names, photographs,
and identity proof (one from among ration card, Aadhaar card, election identification
card, and passport of Representatives/ Directors authorized by the Board for executing
and signing all documents under the Scheme. viii. Each page of Application Form and
accompanying documents should be signed by a minimum of two Board Member Authorised
Representatives of FPO;'''
- '''9.1 The Formation and Incubation cost of CBBO, limited to maximum of Rs. 25 lakh
/ FPO of support or actual which is lesser, is to be provided for five years
from the year of formation. It includes cost towards undertaking baseline survey,
mobilization of farmers, organizing awareness programmes and conducting exposure
visits, professional hand holdings, incubation, cost of engaging CBBOs and other
overheads etc. There is also a provision for cost of NPMA towards manpower,
establishment, travel and advisory and maintaining MIS portal. This also includes
a provision towards cost for development of appropriate overall ICT based MIS
web portal for the Scheme.'''
- source_sentence: What does the term 'shareholder members' refer to?
sentences:
- '''(i) Coordinate with all the Implementing Agencies, State Level Consultative
Committee and District level Monitoring Committee (D-MC) for smooth implementation.
It will also consider feedback received from other relevant Ministries and Organizations
on Clusters identification for consideration. (ii) It will monitor the progress
either by holding the meetings of Implementing Agencies and other stakeholders
or by other means. (iii) It will allocate the produce clusters/districts/States
to Implementing Agencies for formation and promotion of FPOs. (iv) It will undertake
scrutiny of Action Plan of Implementing Agencies(IAs), consider recommendation
of release of fund to Implementing Agencies based on previous utilization as due with
respect to funding under the Scheme. (v) It will provide policy inputs to DAC&FW
for modification in the Scheme to better suit in the formation and promotion
of FPOs to make them economically sustainable. (vi) It will provide aid and advice
to Implementing Agencies as may be required for smooth functioning of the scheme. (vii)
Based on suggestions received from various Implementing Agencies, other Ministries,
States and experience/need, N-PMAFSC may examine and recommend revision of the
minimum membership norm per FPO to DAC&FW. (viii) It may seek detailed input
and analysis as may be required from time to time from NPMA and also seek assistance
of DMI in verification etc.'''
- '''19.1 It has been seen, during first two years of implementation of PMFBY,
there are various types of yield disputes, which unnecessarily delays the claim
settlement. Following figure shows the procedures to be adopted in various cases. Figure.
Procedures to be followed in different yield dispute cases 19.2 Wherever
the yield estimates reported at IU level are abnormally low or high vis-à-vis
the general crop condition the Insurance Company in consultation with State Govt.
can make use of various products (e.g. Satellite based Vegetation Index, Weather
parameters, etc.) or other technologies (including statistical test, crop models
etc.) to confirm yield estimates. If Insurance Company witnesses any anomaly/deficiency
in the actual yield data(partial /consolidated) received from the State Govt.,
the same shall be brought into the notice of concerned State department within
7 days from date of receipt of yield data with specific observations/remarks under
intimation to Govt. of India and anomaly, if any, may be resolved in next 7 days
by the State Level Coordination Committee (SLCC) headed by Additional Chief
Secretary/Principal Secretary/Secretary of the concerned department. This committee
shall be authorized to decide all such cases and the decision in such cases shall
be final. The SLCC may refer the case to State Level Technical Advisory Committee
(STAC) for dispute resolution (Constitution of STAC is defined in Para 19.5).
In case the matter stands unresolved even after examination by STAC, it may be
escalated to TAC along with all relevant documents including minutes of meetings/records
of discussion and report of the STAC and SLCC. Reference to TAC can be made thereafter
only in conditions specified in Para 19.7.1 However, data with anomalies which
is not reported within 7 days will be treated as accepted to insurance company.'''
- '''Date: To, (i) The Managing Director Small Farmers'' Agri-Business Consortium
(SFAC), NCUI Auditorium, August Kranti Marg, Hauz Khas, New Delhi 110016. (ii)The
Managing Director National Co-operative Development Corporation (NCDC), 4, Siri
Institutional Area, Hauz Khas, New Delhi 110016. (iii) The Chief General Manager
National Bank for Agriculture and Rural Development (NABARD), Regional Office
--------------------------------------------------------------- (iv) To any other
additional Implementing Agency allowed/designated, as the case may be. Sub: Application
for Equity Grant under scheme of Formation and Promotion of 10,000 Farmer Producer
Organizations (FPOs) Dear Sir/Madam, We herewith apply for Equity Grant as per
the provisions under the captioned scheme. 1. The details of the FPO are as under- S.
No. Particulars to be furnished Details 1. Name of the FPO 2. Correspondence
address of FPO 3. Contact details of FPO 4. Registration Number 5. Date
of registration/incorporation of FPO 6. Brief account of business of FPO 7. Number
of Shareholder Members 8. Number of Small, Marginal and Landless Shareholder
Members'''
pipeline_tag: sentence-similarity
model-index:
- name: SentenceTransformer based on BAAI/bge-small-en-v1.5
results:
- task:
type: information-retrieval
name: Information Retrieval
dataset:
name: val evaluator
type: val_evaluator
metrics:
- type: cosine_accuracy@1
value: 0.43
name: Cosine Accuracy@1
- type: cosine_accuracy@5
value: 0.87
name: Cosine Accuracy@5
- type: cosine_accuracy@10
value: 0.92
name: Cosine Accuracy@10
- type: cosine_precision@1
value: 0.43
name: Cosine Precision@1
- type: cosine_precision@5
value: 0.17399999999999996
name: Cosine Precision@5
- type: cosine_precision@10
value: 0.09199999999999997
name: Cosine Precision@10
- type: cosine_recall@1
value: 0.43
name: Cosine Recall@1
- type: cosine_recall@5
value: 0.87
name: Cosine Recall@5
- type: cosine_recall@10
value: 0.92
name: Cosine Recall@10
- type: cosine_ndcg@5
value: 0.6778743824685509
name: Cosine Ndcg@5
- type: cosine_ndcg@10
value: 0.6934417324625011
name: Cosine Ndcg@10
- type: cosine_ndcg@100
value: 0.712214063928892
name: Cosine Ndcg@100
- type: cosine_mrr@5
value: 0.6126666666666668
name: Cosine Mrr@5
- type: cosine_mrr@10
value: 0.618761904761905
name: Cosine Mrr@10
- type: cosine_mrr@100
value: 0.623424850876552
name: Cosine Mrr@100
- type: cosine_map@100
value: 0.6234248508765518
name: Cosine Map@100
- type: dot_accuracy@1
value: 0.43
name: Dot Accuracy@1
- type: dot_accuracy@5
value: 0.87
name: Dot Accuracy@5
- type: dot_accuracy@10
value: 0.92
name: Dot Accuracy@10
- type: dot_precision@1
value: 0.43
name: Dot Precision@1
- type: dot_precision@5
value: 0.17399999999999996
name: Dot Precision@5
- type: dot_precision@10
value: 0.09199999999999997
name: Dot Precision@10
- type: dot_recall@1
value: 0.43
name: Dot Recall@1
- type: dot_recall@5
value: 0.87
name: Dot Recall@5
- type: dot_recall@10
value: 0.92
name: Dot Recall@10
- type: dot_ndcg@5
value: 0.6778743824685509
name: Dot Ndcg@5
- type: dot_ndcg@10
value: 0.6934417324625011
name: Dot Ndcg@10
- type: dot_ndcg@100
value: 0.712214063928892
name: Dot Ndcg@100
- type: dot_mrr@5
value: 0.6126666666666668
name: Dot Mrr@5
- type: dot_mrr@10
value: 0.618761904761905
name: Dot Mrr@10
- type: dot_mrr@100
value: 0.623424850876552
name: Dot Mrr@100
- type: dot_map@100
value: 0.6234248508765518
name: Dot Map@100
---
# SentenceTransformer based on BAAI/bge-small-en-v1.5
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5) <!-- at revision 5c38ec7c405ec4b44b94cc5a9bb96e735b38267a -->
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 384 tokens
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("smokxy/embedding-finetuned")
# Run inference
sentences = [
"What does the term 'shareholder members' refer to?",
"'Date: To, (i) The Managing Director Small Farmers' Agri-Business Consortium (SFAC), NCUI Auditorium, August Kranti Marg, Hauz Khas, New Delhi 110016. (ii)The Managing Director National Co-operative Development Corporation (NCDC), 4, Siri Institutional Area, Hauz Khas, New Delhi 110016. (iii) The Chief General Manager National Bank for Agriculture and Rural Development (NABARD), Regional Office --------------------------------------------------------------- (iv) To any other additional Implementing Agency allowed/designated, as the case may be. Sub: Application for Equity Grant under scheme of Formation and Promotion of 10,000 Farmer Producer Organizations (FPOs) Dear Sir/Madam, We herewith apply for Equity Grant as per the provisions under the captioned scheme. 1. The details of the FPO are as under- S. No. Particulars to be furnished Details 1. Name of the FPO 2. Correspondence address of FPO 3. Contact details of FPO 4. Registration Number 5. Date of registration/incorporation of FPO 6. Brief account of business of FPO 7. Number of Shareholder Members 8. Number of Small, Marginal and Landless Shareholder Members'",
"'19.1 It has been seen, during first two years of implementation of PMFBY, there are various types of yield disputes, which unnecessarily delays the claim settlement. Following figure shows the procedures to be adopted in various cases. Figure. Procedures to be followed in different yield dispute cases 19.2 Wherever the yield estimates reported at IU level are abnormally low or high vis-à-vis the general crop condition the Insurance Company in consultation with State Govt. can make use of various products (e.g. Satellite based Vegetation Index, Weather parameters, etc.) or other technologies (including statistical test, crop models etc.) to confirm yield estimates. If Insurance Company witnesses any anomaly/deficiency in the actual yield data(partial /consolidated) received from the State Govt., the same shall be brought into the notice of concerned State department within 7 days from date of receipt of yield data with specific observations/remarks under intimation to Govt. of India and anomaly, if any, may be resolved in next 7 days by the State Level Coordination Committee (SLCC) headed by Additional Chief Secretary/Principal Secretary/Secretary of the concerned department. This committee shall be authorized to decide all such cases and the decision in such cases shall be final. The SLCC may refer the case to State Level Technical Advisory Committee (STAC) for dispute resolution (Constitution of STAC is defined in Para 19.5). In case the matter stands unresolved even after examination by STAC, it may be escalated to TAC along with all relevant documents including minutes of meetings/records of discussion and report of the STAC and SLCC. Reference to TAC can be made thereafter only in conditions specified in Para 19.7.1 However, data with anomalies which is not reported within 7 days will be treated as accepted to insurance company.'",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
<!--
### Direct Usage (Transformers)
<details><summary>Click to see the direct usage in Transformers</summary>
</details>
-->
<!--
### Downstream Usage (Sentence Transformers)
You can finetune this model on your own dataset.
<details><summary>Click to expand</summary>
</details>
-->
<!--
### Out-of-Scope Use
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->
## Evaluation
### Metrics
#### Information Retrieval
* Dataset: `val_evaluator`
* Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
| Metric | Value |
|:--------------------|:-----------|
| cosine_accuracy@1 | 0.43 |
| cosine_accuracy@5 | 0.87 |
| cosine_accuracy@10 | 0.92 |
| cosine_precision@1 | 0.43 |
| cosine_precision@5 | 0.174 |
| cosine_precision@10 | 0.092 |
| cosine_recall@1 | 0.43 |
| cosine_recall@5 | 0.87 |
| cosine_recall@10 | 0.92 |
| cosine_ndcg@5 | 0.6779 |
| cosine_ndcg@10 | 0.6934 |
| cosine_ndcg@100 | 0.7122 |
| cosine_mrr@5 | 0.6127 |
| cosine_mrr@10 | 0.6188 |
| cosine_mrr@100 | 0.6234 |
| **cosine_map@100** | **0.6234** |
| dot_accuracy@1 | 0.43 |
| dot_accuracy@5 | 0.87 |
| dot_accuracy@10 | 0.92 |
| dot_precision@1 | 0.43 |
| dot_precision@5 | 0.174 |
| dot_precision@10 | 0.092 |
| dot_recall@1 | 0.43 |
| dot_recall@5 | 0.87 |
| dot_recall@10 | 0.92 |
| dot_ndcg@5 | 0.6779 |
| dot_ndcg@10 | 0.6934 |
| dot_ndcg@100 | 0.7122 |
| dot_mrr@5 | 0.6127 |
| dot_mrr@10 | 0.6188 |
| dot_mrr@100 | 0.6234 |
| dot_map@100 | 0.6234 |
<!--
## Bias, Risks and Limitations
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->
<!--
### Recommendations
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->
## Training Details
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `gradient_accumulation_steps`: 4
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `num_train_epochs`: 1.0
- `warmup_ratio`: 0.1
- `load_best_model_at_end`: True
#### All Hyperparameters
<details><summary>Click to expand</summary>
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 8
- `per_device_eval_batch_size`: 8
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 4
- `eval_accumulation_steps`: None
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 1.0
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: False
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: True
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: False
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `batch_sampler`: batch_sampler
- `multi_dataset_batch_sampler`: proportional
</details>
### Training Logs
| Epoch | Step | Training Loss | loss | val_evaluator_cosine_map@100 |
|:---------:|:------:|:-------------:|:----------:|:----------------------------:|
| **0.531** | **15** | **0.511** | **0.1405** | **0.6234** |
| 0.9912 | 28 | - | 0.1405 | 0.6234 |
* The bold row denotes the saved checkpoint.
### Framework Versions
- Python: 3.10.14
- Sentence Transformers: 3.0.1
- Transformers: 4.41.1
- PyTorch: 2.3.0+cu121
- Accelerate: 0.27.2
- Datasets: 2.19.1
- Tokenizers: 0.19.1
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
#### GISTEmbedLoss
```bibtex
@misc{solatorio2024gistembed,
title={GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning},
author={Aivin V. Solatorio},
year={2024},
eprint={2402.16829},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
```
<!--
## Glossary
*Clearly define terms in order to be accessible across audiences.*
-->
<!--
## Model Card Authors
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->
<!--
## Model Card Contact
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->